python (65.1k questions)
javascript (44.2k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (12.9k questions)
Why did optimizing my Spark config slow it down?
I'm running Spark 2.2 (legacy codebase constraints) on an AWS EMR Cluster (version 5.8), and run a system of both Spark and Hadoop jobs on a cluster daily. I noticed that the configs that are submitte...
NateH06
Votes: 0
Answers: 1
EMR Spark deploy mode when using Docker
I am deploying a spark job in AWS EMR and packaging all my dependencies using docker. My pythonized spark submit command looks like this
...
cmd = (
f"spark-submit --deploy-mo...
Muhammad Raihan Muhaimin
Votes: 0
Answers: 1
Trino iceberg connector "getTablesWithParameter for GlueHiveMetastore is not implemented"
I'm running trino on EMR version 6.5 and I have added the iceberg connector for the trino and I want it to use a glue catalog. These are the configuration under the iceberg.properties
connector.name=i...
taraf
Votes: 0
Answers: 1
Launch EMR cluster via Lambda inside a VPC using boto3
I am trying to launch an EMR cluster using AWS Lambda code written with boto3 and python. The Lambda is able to launch the cluster when there is no VPC configuration associated it. However, as soon as...
floating_hammer
Votes: 0
Answers: 0