General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.
- natural language processing
- artifical intelligence
- statistics
- Porter stemmer
- Lancaster stemmer
- tokenizer
- bigram
- trigram
- quadgram
- ngram
- stemmer
- bayes
- classifier
- phonetic
- metaphone
- inflector
- Wordnet
- tf-idf
- logistic regression
- doublemetaphone
- double
- jaro-winkler distance
- levenshtein distance
- string distance
- part-of-speech tagger
- Eric Brill
- Brill tagger
- sentiment analysis
- maximum entropy modelling