nlpPandas does basic processing to an NLP dataset. Input a Pandas dataframe and output preprocessed dataframe.
>>> import nlp-pandas
>>> nlpPandas = nlp-pandas.pass_data(data = some_df, target_column = some_column)
"""
- preprocessor "strong": remove nan, lowercase, remove special characters, remove numbers, remove website links, remove emails, remove nextline (\n), remove repeating whitespace
- preprocessor "base": remove nan, remove website links, remove emails, remove nextline (\n), remove repeating whitespace, *give whitespace number ("3boys"->"3 boys")
- preprocessor "weak": remove nan, remove website links, remove emails, remove nextline (\n), remove repeating whitespace, *give whitespace number ("3boys"->"3 boys")
- preprocessor "custom": under dev
"""
>>> nlpPandas.use_preprocessor(preprocessor = "base")
"""
- analyzer (under dev) "base": give each word count (returns dictionary)
"""
>>> nlpPandas.use_analyzer(analyzer = "base")
pip install nlpPandas