AWS EMR Spark Python Logging
Answer :
I've found that EMR's logging for particular steps almost never winds up in the controller or stderr logs that get pulled alongside the step in the AWS console.
Usually I find what I want in the job's container logs (and usually it's in stdout).
These are typically at a path like s3://mybucket/logs/emr/spark/j-XXXXXX/containers/application_XXXXXXXXX/container_XXXXXXX/.... You might need to poke around within the various application_... and container_... directories within containers.
That last container directory should have a stdout.log and stderr.log.
Comments
Post a Comment