python (65.1k questions)
javascript (44.2k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (12.9k questions)
Generate Kafka message with Headers using Apache Spark
I have an ETL (spark-scala). After writing in a table, a message with "header" must be sent to Kafka. I couldn't add the header in the message. I have a spark DataFrame with the "key&qu...
sherifdewichita
Votes: 0
Answers: 1
Spark spark-sql-kafka - java.lang.NoClassDefFoundError: org/apache/kafka/common/serialization/ByteArraySerializer
I am experimenting with spark reading from a kafka topic through "Structured Streaming + Kafka Integration Guide".
Spark version: 3.2.1
Scala version: 2.12.15
Following their guide on the sp...
abaschkim
Votes: 0
Answers: 1
Driver memory not getting cleaned up in Spark Structured Streaming
I am using my Spark Structured Streaming job to perform my ETL in AWS platform
My Driver memory is not getting cleared-up. The job is reading the events from Kinesis and writing to S3
Below are the my...
Ankur Shrivastava
Votes: 0
Answers: 1
changing parquet file column value without changing name of the parquet file so that _spark_metadata is not changed
So I have a spark structured streaming job that stores flattened json messages dataframes in a hdfs location in parquet format in date partitioned folders i.e /source/wireless/active_portal/activation...
Queen Anonymous
Votes: 0
Answers: 1