python (65.1k questions)
javascript (44.2k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (12.9k questions)
cant't figure out how to fix the error ''An error occurred while calling z:org.apache.spark.api.python.PythonRDD.collectAndServe.''
How to fix this error when I write the mapping function in pyspark. The error I get is (Py4JJavaError: An error occurred while calling z:org.apache.spark.api.python.PythonRDD.collectAndServe.)
def use...
jharna
Votes: 0
Answers: 0
Sort by key (Month) using RDDs in Pyspark
I have this RDD and wanna sort it by Month (Jan --> Dec). How can i do it in pyspark?
Note: Don't want to use spark.sql or Dataframe.
+-----+-----+
|Month|count|
+-----+-----+
| Oct| 1176|
| Sep|...
Asif_Javed
Votes: 0
Answers: 2
Explode single DataFrame row into multiple ones across Year-Month column?
I'm trying to explode a dataframe row made of a "yearMonth" column into multiple rows where each one is a day of that month
this is the example. I want to go from this:
Key
YearMonth
...
Alessio Iannini
Votes: 0
Answers: 2
How do I manually select the bandwidth in rdrobust?
I am using an election data frame with margin of victory as my running variable and I want to manually select the bandwidth in rdrobust.
I now that I should use the h() command for that, but I don't u...
Caroline Cavallari
Votes: 0
Answers: 1