Skip to content

GitbookIO/tokenize-english

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tokenize-english

Build Status NPM version

Javascript tokenizer for english sentences.

Installation

$ npm install tokenize-english

Usage

var tokenize = require('tokenize-text')();
var tokenizeEnglish = require('tokenize-english')(tokenize);

var tokens = tokenizeEnglish.sentences("On Jan. 20, former Sen. Barack Obama became the 44th President of the U.S. Millions attended the Inauguration.")

/*
[ { index: 0,
    value: 'On Jan. 20, former Sen. Barack Obama became the 44th President of the U.S.',
    offset: 74 },
  { index: 74,
    value: ' Millions attended the Inauguration.',
    offset: 36 } ]
*/

About

Javascript tokenizer for english sentences

Resources

License

Stars

Watchers

Forks

Packages

No packages published