python (65.2k questions)
javascript (44.3k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (13k questions)
apache beam rows to tfrecord in order to GenerateStatistics
I have built a pipeline that read some data, does some manipulations and create some apache beam Row objects (Steps 1 and 2 in the code below). I then would like to generate statistic and write them t...
DarioB
Votes: 0
Answers: 1
Apache beam Initializer
In my dataflow job, I need to initialize a Config factory and log certain messages in an audit log before actual processing begins.
I have placed the Config factory initialization code + audit logging...

Kul
Votes: 0
Answers: 1
why creating a beam dataframe from beam.rows do not work but it does from beam.select
I have simplified my problem with the following two cases:
case 1:
import apache_beam as beam
from apache_beam.dataframe.convert import to_dataframe
input_file = "gs://bucket/inputfile.parquet&q...
DarioB
Votes: 0
Answers: 0
NameError: name 'funt1' is not defined [while running 'format data-ptransform-40']
I am running dataflow pipeline to load into biqquery but I am getting the below error
wrapper = lambda x: [fn(x)]
File "/home/mprabakaran/mergefile.py", line 93, in <lambda>
NameErro...
manoj
Votes: 0
Answers: 2