GitHub - noscripter/compromise: Natural language processing in javascript

natural language processing, actually in the browser

_{by
Spencer Kelly and

contributors}

var nlp = require('compromise')

nlp('Wee-ooh, I look just like buddy holly.').sentences().toPastTense()
// 'Wee-ooh, I looked just like buddy holly.'

nlp('..then consider me Miles Davis!').people().out('freq')
// [{ text:'Miles Davis', count:1 }]

210k

one javascript file

86%

on the Penn treebank

🙏

npm install compromise

IE9+

caniuse, youbetcha

_{with deliberate, rule-based nlp,}
compromise makes working with text easy

no jargon, | no config, | no training

_{🙌 you can do it!}

Part-of-Speech tags	_{nouns, verbs, adjectives..}	Verb conjugation	_{change tense of a verb or sentence}
Number parsing	_{'seven hundred and fifty' -> 750}	Named-entities	_{the people, places, orgs..}
Template-matches	_{match natural-language forms}	Text cleanup	_{contractions, hyphenation, punctuation}

API docs | Demos

Client-side!

<script src="https://unpkg.com/compromise@latest/builds/compromise.min.js"></script>
<script>
  var doc = nlp('dinosaur')

  var str = doc.nouns().toPlural().out('text')
  console.log(str)
  // 'dinosaurs'
</script>

🌋 Server-side!

var nlp = require('compromise')

var doc = nlp('London is calling')
doc.sentences().toNegative()
// 'London is not calling'

Grab some words,

you can use pre-defined selections (like .nouns()) or grab any pattern with .match()

doc = nlp('Ludwig van Beethoven wrote to Josephine Brunsvik')

doc.people().out('list')
// ['ludwig van beethoven', 'josephine brunsvik']

doc.match('#TitleCase van #LastName').out()
// 'Ludwig van Beethoven'

doc.match('#PastTense to').hyphenate().out()
// 'wrote-to'

Plural/singular:

grab the noun-phrases, make em plural:

doc = nlp('a bottle of beer on the wall.')
doc.nouns().first().toPlural()
doc.out('text')
//'The bottles of beer on the wall.'

Number parsing:

parse written-out numbers, and change their form:

doc = nlp('ninety five thousand and fifty two')
doc.values().toNumber().out('text')
// '95052'

doc = nlp('the 23rd of December')
doc.values().add(2).toText()
doc.out('text')
// 'the twenty fifth of December'

Normalization:

handle the craziness:

doc = nlp("the guest-singer's björk   at seven thirty.").normalize().out('text')
// 'The guest singer is Bjork at 7:30.'

Tense:

all your base are belong:

let doc = nlp('she sells seashells by the seashore.')
doc.sentences().toFutureTense().out('text')
//'she will sell seashells...'

doc.verbs().conjugate()
// [{ PastTense: 'sold',
//    Infinitive: 'sell',
//    Gerund: 'selling', ...
// }]

Named-entities:

get the people, places, organizations:

doc = nlp('that opera about richard nixon visiting china')
doc.topics().data()
// [
//   { text: 'richard nixon' },
//   { text: 'china' }
// ]

Error correction:

make it say what you'd like:

var lexicon={
  'boston': 'MusicalGroup'
}
doc = nlp('i heard Boston\'s set in Chicago', lexicon)
doc.match('#MusicalGroup').length
// 1

//alternatively, fix it all 'in-post':
doc.match('heard #Possessive set').terms(1).tag('MusicalGroup')
doc.match('#MusicalGroup').length
// 1

Handy outputs:

get sensible data:

doc = nlp('We like Roy! We like Roy!').sentences().out('array')
// ['We like Roy!', 'We like Roy!']

doc = nlp('Tony Hawk').out('html')
/*
<span>
  <span class="nl-Person nl-FirstName">Tony</span>
  <span>&nbsp;</span>
  <span class="nl-Person nl-LastName">Hawk</span>
</span>
*/

and yes, ofcourse, there's a lot more stuff.

Join in - we're fun, using semver, and moving fast. get involved

Don't forget about:

✨ naturalNode - decidedly fancier, statistical nlp in javascript
🍭 SuperScript - clever conversation engine in js
💗 NodeBox Linguistics - conjugation, inflection in javascript
🎀 reText - very impressive text utilities in javascript
💎 jsPos - javascript build of the time-tested Brill-tagger
🚗 spaCy - speedy, multilingual tagger in C/python

For the former promise-library, see jnewman/compromise (Thanks Joshua!)

(also don't forget 🙇 NLTK, GATE, Stanford, and Illinois toolkit 🙇 )

Name		Name	Last commit message	Last commit date
Latest commit History 2,342 Commits
builds		builds
demo		demo
docs		docs
scripts		scripts
src		src
test		test
.esformatter		.esformatter
.eslintrc		.eslintrc
.gitignore		.gitignore
.npmignore		.npmignore
.travis.yml		.travis.yml
README.md		README.md
changelog.md		changelog.md
compromise.d.ts		compromise.d.ts
package.json		package.json
scratch.js		scratch.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

no jargon, | no config, | no training

API docs | Demos

Client-side!

🌋 Server-side!

Grab some words,

Plural/singular:

Number parsing:

Normalization:

Tense:

Named-entities:

Error correction:

Handy outputs:

and yes, ofcourse, there's a lot more stuff.

Join in - we're fun, using semver, and moving fast. get involved

Don't forget about:

About

Releases

Packages

Languages

noscripter/compromise

Folders and files

Latest commit

History

Repository files navigation

no jargon, | no config, | no training

API docs | Demos

Client-side!

🌋 Server-side!

Grab some words,

Plural/singular:

Number parsing:

Normalization:

Tense:

Named-entities:

Error correction:

Handy outputs:

and yes, ofcourse, there's a lot more stuff.

Join in - we're fun, using semver, and moving fast. get involved

Don't forget about:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages