JavaScript
Latest commit 75c7385 Sep 7, 2017 @spencermountain spencermountain committed on GitHub Create LICENSE
Permalink
Failed to load latest commit information.
builds 10.7.2 Aug 22, 2017
demo new debug mode for clientside Apr 13, 2017
docs noUnits method May 27, 2017
scripts 10.7.2 Aug 22, 2017
src move compressed data Jul 23, 2017
test 10.7.1 Jul 23, 2017
.esformatter move tests around Feb 3, 2017
.eslintrc 10 tests failing Jul 5, 2017
.gitignore v10.5.1 Jun 7, 2017
.npmignore cleanup scripts folder Feb 21, 2017
.travis.yml test build and src in travis now Jun 7, 2017
LICENSE Create LICENSE Sep 7, 2017
README.md yolo Jul 6, 2017
changelog.md 10.7.2 Aug 22, 2017
compromise.d.ts init typefile May 1, 2017
package.json 10.7.2 Aug 22, 2017
scratch.js cleanup lex Jul 23, 2017

README.md

natural language processing, actually in the browser
by Spencer Kelly and many contributors

var nlp = require('compromise')

nlp('Wee-ooh, I look just like buddy holly.').sentences().toPastTense()
// 'Wee-ooh, I looked just like buddy holly.'

nlp('..then consider me Miles Davis!').people().out('freq')
// [{ text:'Miles Davis', count:1 }]
210k
        one javascript file        
86%
    on the Penn treebank    
🙏
    npm install compromise    
IE9+
      caniuse, youbetcha      
with deliberate, rule-based nlp,
compromise makes working with text easy
no jargon,   |   no config,   |   no training       🙌 you can do it!

API doc       Demos       QuickStart       Tutorials

nouns! verbs! adjectives!
people, places, organizations
seven hundred and fifty == 750
like a regex for a sentence
all your base are belong
contractions, style, mood..

⚡️ Client-side!

<script src="https://unpkg.com/compromise@latest/builds/compromise.min.js"></script>
<script>
  var doc = nlp('dinosaur')

  var str = doc.nouns().toPlural().out('text')
  console.log(str)
  // 'dinosaurs'
</script>

🌋 Server-side!

var nlp = require('compromise')

var doc = nlp('London is calling')
doc.sentences().toNegative()
// 'London is not calling'

Toss in text,

even if it's just one word:

doc = nlp('Ludwig van Beethoven wrote to Josephine Brunsvik')

doc.people().out('list')
// ['ludwig van beethoven', 'josephine brunsvik']

doc.match('#TitleCase van #LastName').out()
// 'Ludwig van Beethoven'

doc.match('#PastTense to').hyphenate().out()
// 'wrote-to'

    Throw stuff around:

doc = nlp('a bottle of beer on the wall.')
doc.nouns().first().toPlural()
doc.out('text')
//'The bottles of beer on the wall.'
doc = nlp('ninety five thousand and fifty two')
doc.values().toNumber().out('text')
// '95052'

doc = nlp('the 23rd of December')
doc.values().add(2).toText()
doc.out('text')
// 'the twenty fifth of December'
doc = nlp("the guest-singer's björk   at seven thirty.").normalize().out('text')
// 'The guest singer is Bjork at 7:30.'
  • Tense: - switch between conjugations of any verb
let doc = nlp('she sells seashells by the seashore.')
doc.sentences().toFutureTense().out('text')
//'she will sell seashells...'

doc.verbs().conjugate()
// [{ PastTense: 'sold',
//    Infinitive: 'sell',
//    Gerund: 'selling', ...
// }]
doc = nlp('that opera about richard nixon visiting china')
doc.topics().data()
// [
//   { text: 'richard nixon' },
//   { text: 'china' }
// ]
var lexicon={
  'boston': 'MusicalGroup'
}
doc = nlp('i heard Boston\'s set in Chicago', lexicon)
doc.match('#MusicalGroup').length
// 1

//alternatively, fix it all 'in-post':
doc.match('heard #Possessive set').terms(1).tag('MusicalGroup')
doc.match('#MusicalGroup').length
// 1
doc = nlp('We like Roy! We like Roy!').sentences().out('array')
// ['We like Roy!', 'We like Roy!']

doc = nlp('Tony Hawk').out('html')
/*
<span>
  <span class="nl-Person nl-FirstName">Tony</span>
  <span>&nbsp;</span>
  <span class="nl-Person nl-LastName">Hawk</span>
</span>
*/

and yes, ofcourse, there's a lot more stuff.

Join in - we're fun, using semver, and moving fast. get involved

        Twitter        
      Slack group      
      Mailing-list      
        Projects        
      Pull-requests      

    🌎 Other Languages?

      okay! we've got work-in-progress forks for German and French, in the same philosophy.
      Get involved!

    ☂️ Isn't javascript too...

      yeah..
      it wasn't built to compete with the stanford tagger, and may not fit every project.
      all string stuff is synchronous too, and parallelizing is weird.
      See here for information about speed & performance.

    💃 Can it run on my arduino-watch?

      Only if it's water-proof!
      See quickStart for all sorts of funny environments.

    Partial builds?

      compromise can't really be tree-shaken, because it's one function.
      .. and the tagging methods are competitive, so it's not recommended to pull things out.
      It's best to load the library fully, given it's smaller than this gif.
      A plug-in scheme is in the works.


Don't forget about:

For the former promise-library, see jnewman/compromise (Thanks Joshua!)

(also don't forget 🙇 NLTK, GATE, Stanford, and Illinois toolkit )