Skip to content

chrismattmann/nutch-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

75 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nutch-python

A Python client library for the Apache Nutch that makes Nutch 1.x capabilities available using the Nutch REST Server.

See (https://wiki.apache.org/nutch/NutchTutorial) for installing Nutch 1.x and alternatively operating it via the command line.

This Python client library for Nutch is installable via Setuptools, Pip and Easy Install.

Installation (with pip)

  1. pip install nutch

Installation (without pip)

  1. python setup.py build
  2. python setup.py install

Wiki Documentation

See the wiki for instructions on how to use Nutch-Python and its API.

New Command Line Tool

When you install Nutch-Python you also get a new command line client tool, nutch-python installed in your /path/to/python/bin directory.

The options and help for the command line tool can be seen by typing nutch-python without any arguments.

Questions, comments?

Send them to Chris A. Mattmann.

Contributors

  • Brian D. Wilson, JPL
  • Chris A. Mattmann, JPL
  • Aron Ahmadia, Continuum Analytics

License

Apache License, version 2

About

Nutch-Python is a Python binding to the Apache Nutch™ REST services allowing Nutch to be called natively in the Python community. — Edit

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages