Search results
36 packages found
General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.
- natural language processing
- artifical intelligence
- statistics
- Porter stemmer
- Lancaster stemmer
- tokenizer
- bigram
- trigram
- quadgram
- ngram
- stemmer
- bayes
- classifier
- phonetic
- View more
Run Methodius from the command line. Analyze text for ngrams and frequencies with ease.
The easiest way to get n-gram chunks from strings or token arrays!
N-gram search index that is character based and supports Unicode. Useful for implementing autocomplete in functional programming style.
a tiny package to visualize ngram similarity in reasonably sized chunks of text
JavaScript search engine
- JavaScript
- search
- engine
- search-engine
- bitap
- typescript
- fulltext
- string-search
- TFIDF
- BM25
- KMeans
- Naive-Bayes
- spelling
- ngram
Determining the similarity of alphanumeric text based on trigram matching
JS version of ngram-fingerprint from Open Refine
Probabilistic data structures for large or streaming data sets.
library for simularity identification
Takes in a text/file/stream and generates random sentences that sound like they could have been in the text
Get keywords (tokens) from any search query
String ngram splitter.
Minor modifications to the original `natural` node package: General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levens
- natural language processing
- artifical intelligence
- statistics
- Porter stemmer
- Lancaster stemmer
- tokenizer
- bigram
- trigram
- quadgram
- ngram
- stemmer
- bayes
- classifier
- phonetic
- View more
Word generation based on n-gram models, and a cli utility to generate said models
Object-stream to create ngram tokens from strings
General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.
- natural language processing
- artifical intelligence
- statistics
- Porter stemmer
- Lancaster stemmer
- tokenizer
- bigram
- trigram
- quadgram
- ngram
- stemmer
- bayes
- classifier
- phonetic
- View more
search by Ngram similarity. An emitation of the python NGram module
A simple and fast module for creating and adding to ngram models and generating markov chains using these models.