Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

NLTK for App Engine

branch: master

NLTK lib changes for app engine

removed support for hunpos & stanford taggers
updated path to relevant app engine value.
taggers should into projects under
'taggers/name-of-tagger-project/name-of-tagger.pickle'
latest commit 9181f8991d
rutherford authored October 28, 2012
Octocat-spinner-32 appengine Initial commit for app engine fork October 28, 2012
Octocat-spinner-32 emacs cosmetics: get rid of even more CVS-style $Id$ lines February 29, 2012
Octocat-spinner-32 examples Introduce end-of-line normalization September 21, 2012
Octocat-spinner-32 javasrc updated copyright year to 2012 December 24, 2011
Octocat-spinner-32 nltk NLTK lib changes for app engine October 28, 2012
Octocat-spinner-32 papers Introduce end-of-line normalization September 21, 2012
Octocat-spinner-32 tools removed stale tools, moved nltk_data tools to nltk_data repository May 10, 2012
Octocat-spinner-32 web Merge branch 'master' of https://github.com/nltk/nltk October 18, 2012
Octocat-spinner-32 .gitattributes Introduce end-of-line normalization September 21, 2012
Octocat-spinner-32 .gitignore ignore more files May 16, 2012
Octocat-spinner-32 ChangeLog finalising 2.0.3 September 24, 2012
Octocat-spinner-32 INSTALL.txt new installation URL January 14, 2012
Octocat-spinner-32 LICENSE.txt updated copyright year to 2012 December 24, 2011
Octocat-spinner-32 MANIFEST.in updates to support building 2.0.1rc2 December 01, 2011
Octocat-spinner-32 Makefile misc cleanups May 15, 2012
Octocat-spinner-32 NOTICE.txt updated copyright year to 2012 December 24, 2011
Octocat-spinner-32 README.md NLTK lib changes for app engine October 28, 2012
Octocat-spinner-32 README.txt updated copyright year to 2012 December 24, 2011
Octocat-spinner-32 RELEASE-HOWTO finalising 2.0.3 September 24, 2012
Octocat-spinner-32 distribute_setup.py Support new Distribute Setup; remove old setup files; October 11, 2011
Octocat-spinner-32 setup.cfg updates to support building 2.0.1rc2 December 01, 2011
Octocat-spinner-32 setup.py updated copyright year to 2012 December 24, 2011
Octocat-spinner-32 tox.ini More test improvements: more powerful test runner; test for Russian s… May 16, 2012
README.md

Natural Language Toolkit (NLTK) for App Engine

I have tampered with the NLTK in order to get it running on Google cloud platform. So far, tokenizing and Part Of Speech (POS) tagging are working.

Quick Summary of Changes:

  • Changed path references to a relevant app engine path.
  • Removed support for hunpos, stanford taggers due to subprocess spawning requirements in these modules
  • Removed downloader module; gui not relevant on app engine.

Running on App Engine:

Feel free to use the sample app located under appengine directory as a basis for your project. It includes the Treebank Part of Speech Tagger but not the NLTK for App Engine or PyYAML libs. In any case, the steps to running NLTK for App Engine are:

  1. Add following entry to the base of your app.yaml
libraries:
- name: numpy
  version: "1.6.1"
  1. Download PyYAML and copy it's lib directory to your project root.

  2. Copy NLTK for App Engine to your project root. import nltk and play on.

Sample Code:

Sample App Engine app utilising above method(s) located under appengine directory

Redistributing

NLTK for App Engine source code is distributed under the same license as the NLTK project, that is the Apache 2.0 License.

Something went wrong with that request. Please try again.