Hey Sonal G. to pass the checkpoint directory to spark, I made below change to the 0.6.0 Jar and then it worked. Attaching the code block which was modified. This is for the first time setup of EMR where we have a fresh context, and to pass a directory for checkpointing.
In spark-submit we have to add --conf spark.checkpoint.dir=s3://OUR_BUCKET/checkpoints/
In spark/client/src/main/java/zingg/spark/client/SparkClient.java
public void checkAndSetCheckpoint(SparkSession session) {
SparkContext sc = session.sparkContext();
if (sc.getCheckpointDir().isEmpty()) {
String checkpointDir = sc.getConf().get("spark.checkpoint.dir", DEFAULT_CHECKPOINT_DIR);
sc.setCheckpointDir(checkpointDir);
}
} Let me know what do you think of it.