Switch branches/tags
Find file History
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
..
Failed to load latest commit information.
pycascading_data
README.md
cache.py
callback.py
copy_data_to_hdfs.sh
joins.py
map_types.py
merge_streams.py
pagerank.py
python_fields.py
reduce.py
subassembly.py
total_sort.py
udf_contexts.py
word_count.py

README.md

PyCascading examples

This folder showcases a number of features offered by Cascading and PyCascading. They use input files in the 'pycascading_data' folder, so before running the examples, make sure that:

  • in local mode, you cd first to the examples/ directory (or wherever pycascading_data/ is found), and use local_run.sh to run the example like

  • in Hadoop mode, you copy the data folder to HDFS first by running copy_data_to_hdfs.sh, or

    hadoop fs -put pycascading_data pycascading_data

    and then invoke remote_deploy.sh