python (65.1k questions)
javascript (44.2k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (12.9k questions)
Pandas - MemoryError: Unable to allocate 220. MiB
So I have a data frame of orders, with the order date as the index, which I set so:
df = df.set_index('ORDER_ENTRY_DATE', drop=False)
In the code below I create a new feature, containing the total am...
yalexx
Votes: 0
Answers: 0
Trying to create RDD from external dataset(word_list.txt) but getting error
valfile = sc. textFile("/Users/Downloads/word_list-1.txt")My_file: org.apache.spark.rdd. RDD[String] = /Users/Downloads/word_list-1.txt Mappartitions RDD[1] at textFile at <console>:23...
Raju
Votes: 0
Answers: 0
What is the fastest way to pull massive amounts of data from Snowflake Database into AWS SageMaker?
What would be the fastest way to pull in very large datasets from Snowflake into my SageMaker instance in AWS? How does the snowflake python connector (what I currently use) compare to lets say a spar...
someone
Votes: 0
Answers: 1
Parquet writes boolean values as null
I am trying to write a dataframe in parquet format to hdfs. The dataframe contains all values before writing but when i write it to the disk, it converts the boolean values (true,false) to null.
Here ...
Khan Saab
Votes: 0
Answers: 0