Skip to content

Instantly share code, notes, and snippets.

@dotneet
Last active October 24, 2016 06:24
Show Gist options
  • Select an option

  • Save dotneet/c8f39836bc9fd87dc742 to your computer and use it in GitHub Desktop.

Select an option

Save dotneet/c8f39836bc9fd87dc742 to your computer and use it in GitHub Desktop.
Sparkのチューニングに関するメモ ref: http://qiita.com/devneko/items/8feeeabfcd9049874e5f
sqlContext.setConf("spark.sql.parquet.compression.codec", "snappy")
val jsonDataFrame = sqlContext.read.json(jsonFilePath)
val jsonDataFrame.select("id","type","arg1","arg2","arg3","time").write.format("parquet").save(parquetPath)
spark.serializer org.apache.spark.serializer.KryoSerializer
spark.executor.memory 6g
spark.driver.memory 2g
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment