python (65.2k questions)
javascript (44.3k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (13k questions)
Remove duplicates on column based in apache beam java sdk
How do I remove multiple occurrences of row based on SessionId in apache beam java skd.
I have tried with Distinct as well as Deduplicate but that takes entire row based and removes.
import org.apache...
Ashok
Votes: 0
Answers: 1
Exception while writing multipart empty csv file from Apache Beam into netApp Storage Grid
Problem Statement
We are consuming multiple csv files into pcollections -> Apply beam SQL to transform data -> write resulted pcollection.
This is working absolutely fine if we have some data in...
Jaysukh Kalasariya
Votes: 0
Answers: 2
Apache Beam Python - SQL Transform with named PCollection Issue
I am trying to execute the below code in which I am using Named Tuple for PCollection and SQL transform for doing a simple select.
As per the video link (4:06) : https://www.youtube.com/watch?v=zx4p-U...

Murli Krishnan
Votes: 0
Answers: 0
Apache Beam - Multiple Pcollection - Dataframetransform Issue
I am running a below sample in apache beam
import apache_beam as beam
from apache_beam import Row
from apache_beam import Pipeline
from apache_beam.options.pipeline_options import PipelineOptions
from...

Murli Krishnan
Votes: 0
Answers: 0