Find file History
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
..
Failed to load latest commit information.
pycascading_data
README.md
cache.py
callback.py
copy_data_to_hdfs.sh
joins.py
map_types.py
merge_streams.py
pagerank.py
python_fields.py
reduce.py
subassembly.py
total_sort.py
udf_contexts.py
word_count.py

README.md

PyCascading examples

This folder showcases a number of features offered by Cascading and PyCascading. They use input files in the 'pycascading_data' folder, so before running the examples, make sure that:

  • in local mode, you cd first to the examples/ directory (or wherever pycascading_data/ is found), and use local_run.sh to run the example like
  • in Hadoop mode, you copy the data folder to HDFS first by running copy_data_to_hdfs.sh, or

    hadoop fs -put pycascading_data pycascading_data

    and then invoke remote_deploy.sh