Skip to content

cteplovs/bigramCount

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Execution
 - Simply run the starter script `run.sh`
 - Results are stored in `src/results.txt` after the program completes


File Descriptions
- Dockerfile: configuration file for building Spark Docker image
- scripts: directory that stores scripts for initializing working components in Spark cluster
- src
  - bigramCount.py: the bigramCount program
  - text.txt: testing data
  - run.sh: start the program
- start-master.sh: a wrapper script to start a Master node container
- start-worker.sh: a wrapper script to start a Worker node container

About

Bigram Count with Spark

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages

  • Shell 50.5%
  • Python 49.5%