How to Diagnose Zingg Matching Phase Errors When Running on EMR with FEBRL Example?
Hey Zingg Team, We are trying to run ZINGG on the febrl example, When running below command in EMR /opt/zingg/scripts/zingg.sh --phase match --conf /opt/zingg/examples/febrl/config.json --zinggDir s3://OUR_BUCKET/checkpoints/ We are getting below error, how do we get more information where it is failing? 26/06/22 11:55:34 WARN SparkContext: Spark is not running in local mode, therefore the checkpoint directory must not be on the local filesystem. Directory '/tmp/checkpoint' appears to be on the local filesystem. 26/06/22 11:55:34 WARN PipeUtilReader: Reading Pipe [name=test, format=csv, preprocessors=null, props={path=s3://OUR_BUCKET/examples/febrl/test.csv, header=false, delimiter=,}] 26/06/22 11:55:35 INFO ClientConfigurationFactory: Set initial getObject socket timeout to 2000 ms. 26/06/22 11:55:38 WARN Email: Unable to send email Can't send command to SMTP host 26/06/22 11:55:38 WARN Client: Apologies for this message. Zingg has encountered an error. Error in matching phase