python (65.1k questions)
javascript (44.2k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (12.9k questions)
parquet4s not returning all records
I have a simple Scala application that uses parquet4s with fs2 to read a set of partitioned records (spread across directories, generated by a Spark job).
When I run the app, it only returns a fractio...
samirbajaj
Votes: 0
Answers: 1
Python Statsmodel Logistic Regression iterate through Parquet file
I am trying to run a logistic regression model on a very large dataset with 2.3 billion observations in Python. I need a standard regression output. Statsmodels with parquet seemed promising:
https://...
HLRA
Votes: 0
Answers: 1
Kafka-connect file sink connector write in parquet file format
I am looking for kafka-connect connector that will write from kafka to local file system in parquet file format.
I don't want to use the hdfs or s3 sink connector for the same.
Aman Jain
Votes: 0
Answers: 1
Is it possible to reopen ParquetWriter after close() is called?
I'm currently using ParquetWriter to write Avro records to parquet files. I can use the write() and close() methods to write and close files as needed. Now I have a use case where I need to reopen a c...
WZH
Votes: 0
Answers: 0