Mirror of Apache Spark Website
HTML Other
Switch branches/tags
Nothing to show
Latest commit f57f070 Jul 13, 2017 @marmbrus marmbrus Add pyspark instructions
Permalink
Failed to load latest commit information.
.github add Pull Request Template Nov 21, 2016
_layouts Update for 2.2.0 Jul 11, 2017
css CSS tweaks Jun 10, 2016
graphx Port wiki page Committers to committers.html, Contributing to Spark a… Nov 21, 2016
images Remove unused images from Spark site; crush large PNGs; remove obsole… Jul 4, 2016
js Update for 2.2.0 Jul 11, 2017
mllib Replace most http links with https as a best practice, where possible May 10, 2017
news Update for 2.2.0 Jul 11, 2017
releases/_posts Add pyspark instructions Jul 13, 2017
screencasts Use site.baseurl, not site.url, to work with Jekyll 3.3. Require Jeky… Nov 15, 2016
site Add pyspark instructions Jul 13, 2017
sql Port wiki page Committers to committers.html, Contributing to Spark a… Nov 21, 2016
streaming Replace most http links with https as a best practice, where possible May 10, 2017
talks Added PySpark talk Sep 11, 2013
.gitignore Add .DS_Store in .gitignore Jun 20, 2017
README.md Use site.baseurl, not site.url, to work with Jekyll 3.3. Require Jeky… Nov 15, 2016
_config.yml Replace most http links with https as a best practice, where possible May 10, 2017
committers.md Update Sandy. Jul 5, 2017
community.md Break out security info into dedicated page, to match other ASF proje… Jun 9, 2017
content Add possibly temporary content -> site symlink to see if it makes the… Jul 13, 2016
contributing.md Replace most http links with https as a best practice, where possible May 10, 2017
developer-tools.md Replace most http links with https as a best practice, where possible May 10, 2017
doap.rdf SPARK-10175 fix links in Spark DOAP file Aug 29, 2015
documentation.md Replace most http links with https as a best practice, where possible May 10, 2017
downloads.md Add pyspark instructions Jul 13, 2017
examples.md Patch references to docs/programming-guide.html to docs/rdd-programmi… Jul 12, 2017
faq.md Use AMPLab direct link in FAQ Jul 9, 2017
favicon.ico Initial port of Spark website from spark-project.org wordpress to Jek… Aug 23, 2013
improvement-proposals.md jekyll build Jun 26, 2017
index.md Replace most http links with https as a best practice, where possible May 10, 2017
mailing-lists.md Update site look and add pages for Streaming and MLlib Jan 22, 2014
powered-by.md More dead link fixing May 11, 2017
release-process.md Add a note for searching PySpark and SparkR version changes in releas… Jun 20, 2017
research.md replace with valid url to rdd paper Sep 16, 2016
robots.txt Replace most http links with https as a best practice, where possible May 10, 2017
security.md Add note about CVE-2017-7678 Jul 12, 2017
sitemap.xml Patch references to docs/programming-guide.html to docs/rdd-programmi… Jul 12, 2017
third-party-projects.md Replace most http links with https as a best practice, where possible May 10, 2017
trademarks.md Replace most http links with https as a best practice, where possible May 10, 2017
versioning-policy.md Port wiki Useful Developer Tools and Profiling Spark Apps to /develop… Nov 22, 2016

README.md

Generating the website HTML

In this directory you will find text files formatted using Markdown, with an .md suffix.

Building the site requires Jekyll 3.3.0 or newer. The easiest way to install jekyll is via a Ruby Gem. This will create a directory called site containing index.html as well as the rest of the compiled directories and files.

To install Jekyll and its required dependencies, execute sudo gem install jekyll pygments.rb and sudo pip install Pygments. See also https://github.com/apache/spark/blob/master/docs/README.md

You can generate the html website by running jekyll build in this directory. Use the --watch flag to have jekyll recompile your files as you save changes.

In addition to generating the site as HTML from the markdown files, jekyll can serve the site via a web server. To build the site and run a web server use the command jekyll serve which runs the web server on port 4000, then visit the site at http://localhost:4000.

Docs sub-dir

The docs are not generated as part of the website. They are built separately for each release of Spark from the Spark source repository and then copied to the website under the docs directory. See the instructions for building those in the readme in the Spark project's /docs directory.

Pygments

We also use pygments for syntax highlighting in documentation markdown pages.

To mark a block of code in your markdown to be syntax highlighted by jekyll during the compile phase, use the following syntax:

{% highlight scala %}
// Your scala code goes here, you can replace scala with many other
// supported languages too.
{% endhighlight %}

You probably don't need to install that unless you want to regenerate the pygments CSS file. It requires Python, and can be installed by running sudo easy_install Pygments.