AWS EMR Spark Python Logging
Answer :
I've found that EMR's logging for particular steps almost never winds up in the controller or stderr logs that get pulled alongside the step in the AWS console.
Usually I find what I want in the job's container logs (and usually it's in stdout).
These are typically at a path like s3://mybucket/logs/emr/spark/j-XXXXXX/containers/application_XXXXXXXXX/container_XXXXXXX/...
. You might need to poke around within the various application_...
and container_...
directories within containers
.
That last container directory should have a stdout.log
and stderr.log
.
Comments
Post a Comment