Natural language processing (NLP) refers to the branch of computer science—and more specifically, the branch of artificial intelligence or AI—concerned with giving computers the ability to understand text and spoken words in much the same way human beings can.
Natural Language Projects :
-
Part of speech Tagging (POS)
-
Spam Filtering
-
Text Summarization
-
Speech TO text
-
sentiment analysis
-
Information Retrivel
-
Text / Document Classification
-
Question Answering
-
Machine Translation
Text Preprocessing Steps :
-
Lower casing
-
Removal of Punctuations
-
Removal of Stopwords
-
Removal of Frequent words
-
Removal of Rare words
-
Stemming
-
Lemmatization
-
Removal of emojis
-
Removal of emoticons
-
Conversion of emoticons to words
-
Conversion of emojis to words
-
Removal of URLs
-
Removal of HTML tags
Word-Vectorization :
-
One hot encoding
-
BOW(Bag of words)
-
Tfidf(Term frequency and inverse document frequency)
-
WordEmbedding
-
Word2Vec :
-
CBOW
-
Skip - Gram
-
-
Models:
MACHINE LEARING MODELS:
- Naive Bayes
- Random Forest
- XGBOOST
DEEP LEARNING MODELS :
- RNN
- LSTM
- GRU
- Bi- Directional LSTM
- Stacked LSTM
- Transformers (ATTENTION ALL YOU NEED)
- BERT
- Al-BERT
- distill BERT
- GPT
- ELMO