How to redirect entire output of spark-submit to a file
spark-submit
prints most of it's output to STDERR
To redirect the entire output to one file, you can use:
spark-submit something.py > results.txt 2>&1
Or
spark-submit something.py &> results.txt
If you are running the spark-submit on a cluster the logs are stored with the application Id. You can see the logs once the application finishes.
yarn logs --applicationId <your applicationId> > myfile.txt
Should fetch you the log of your job
The applicationId of your job is given when you submit the spark job. You will be able to see that in the console where you are submitting or from the Hadoop UI.