# **Python Example**

This notebook will show you how to use **Apache Spark** with **Python** to perform a simple word count.

You can run a cell by pressing **"shift-enter"**, which will compute the current cell and advance to the next cell, or by clicking in a cell and pressing **"control-enter"**, which will compute the current cell and remain in that cell.

** This notebook covers: **
* *Part 1:* Required Libraries
* *Part 2:* Spark Context
* *Part 3:* Word Count


## Import Required Libraries

This section shows how to import the required libraries.

In [None]:
from pyspark import SparkConf
from pyspark import SparkContext

## Spark Context

This section shows how to initialize and configure a basic SparkContext.

In [None]:
conf = SparkConf().setMaster('spark://localhost:7077').setAppName('Word Count Python App')
sc = SparkContext(conf=conf)

## Word Count

This section shows the Word Counter application.

In [None]:
# load a text file
text_file = sc.textFile("/srv/spark/LICENSE")

In [None]:
# count the times each word appears on the file
counts = (text_file.flatMap(lambda line: line.split(' '))
                   .map(lambda word: (word, 1))
                   .reduceByKey(lambda a, b: a + b))

In [None]:
# top 15 words
counts.takeOrdered(15, key=lambda x: -x[1])

*notebook writen by [fscm](https://github.com/fscm)*