diff --git a/docs/submitting-applications.md b/docs/submitting-applications.md index 3ecbf2308cd44..46204634a74d4 100644 --- a/docs/submitting-applications.md +++ b/docs/submitting-applications.md @@ -22,6 +22,9 @@ For Python, you can use the `--py-files` argument of `spark-submit` to add `.py` files to be distributed with your application. If you depend on multiple Python files we recommend packaging them into a `.zip` or `.egg`. +As Python can not read files from assembly jar which package by JDK1.7+, so packaging pyspark into a +`.zip`(the name contains "spark-pyspark") and use `--py-files` argument of `spark-submit` to distribute it. + # Launching Applications with spark-submit Once a user application is bundled, it can be launched using the `bin/spark-submit` script.