Skip to content
NLTK for App Engine http://nltk.org
Python Emacs Lisp TeX Other
Find file
Pull request Compare This branch is 8 commits ahead, 3299 commits behind nltk:develop.
Latest commit 5830512 @rutherford Merge pull request #2 from kirberich/master
Restore functionality of NLTK_DATA environment variable defining additio...
Failed to load latest commit information.
appengine Initial commit for app engine fork
emacs cosmetics: get rid of even more CVS-style $Id$ lines
examples Introduce end-of-line normalization
javasrc
nltk Restore functionality of NLTK_DATA environment variable defining addi…
papers Introduce end-of-line normalization
tools removed stale tools, moved nltk_data tools to nltk_data repository
web Merge branch 'master' of https://github.com/nltk/nltk
.gitattributes Introduce end-of-line normalization
.gitignore ignore more files
ChangeLog finalising 2.0.3
INSTALL.txt new installation URL
LICENSE.txt updated copyright year to 2012
MANIFEST.in updates to support building 2.0.1rc2
Makefile
NOTICE.txt
README.md NLTK lib changes for app engine
README.txt updated copyright year to 2012
RELEASE-HOWTO finalising 2.0.3
distribute_setup.py Support new Distribute Setup; remove old setup files;
setup.cfg updates to support building 2.0.1rc2
setup.py updated copyright year to 2012
tox.ini More test improvements: more powerful test runner; test for Russian s…

README.md

Natural Language Toolkit (NLTK) for App Engine

I have tampered with the NLTK in order to get it running on Google cloud platform. So far, tokenizing and Part Of Speech (POS) tagging are working.

Quick Summary of Changes:

  • Changed path references to a relevant app engine path.
  • Removed support for hunpos, stanford taggers due to subprocess spawning requirements in these modules
  • Removed downloader module; gui not relevant on app engine.

Running on App Engine:

Feel free to use the sample app located under appengine directory as a basis for your project. It includes the Treebank Part of Speech Tagger but not the NLTK for App Engine or PyYAML libs. In any case, the steps to running NLTK for App Engine are:

  1. Add following entry to the base of your app.yaml
libraries:
- name: numpy
  version: "1.6.1"
  1. Download PyYAML and copy it's lib directory to your project root.

  2. Copy NLTK for App Engine to your project root. import nltk and play on.

Sample Code:

Sample App Engine app utilising above method(s) located under appengine directory

Redistributing

NLTK for App Engine source code is distributed under the same license as the NLTK project, that is the Apache 2.0 License.

Something went wrong with that request. Please try again.