GitHub - len-sla/NLP_Flair_Texhero_DistilBERT: using Flair, Texthero and DistilBERT to get good results

Project Name

Predict which tweets are about real dissters and which ones are not in a couple lines of code. Data for calculation taken from https://www.kaggle.com/c/nlp-getting-started

General info

If you need to get quickly some initial results in typical NLP task than using packages Flair, Texthero and DistilBERT would give quite good results.

Libraries and useful links

Status

Project is: in progress,

Inspiration

Project inspired by Kaggle nootebook

result on leaderboard

Second attempt

Rev_B_real_or_not.ipynb Results were worse compared with initial simple automatic approach. That proves how good/opimised Flair Framework is to get best results. Tweaking does not give better results. Maybe more extensive text cleaning and deciphering abbreviation and other shorcuts could do better result. Things like hero.visualization.wordcloud, kmeans, custom_pipeline were checked.

Info

Created by [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
Rev_B_real_or_not.ipynb		Rev_B_real_or_not.ipynb
pca.JPG		pca.JPG
real-or-not.JPG		real-or-not.JPG
real_or_not.ipynb		real_or_not.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Name

General info

Libraries and useful links

Status

Inspiration

result on leaderboard

Second attempt

Info

About

Releases

Packages

Languages

len-sla/NLP_Flair_Texhero_DistilBERT

Folders and files

Latest commit

History

Repository files navigation

Project Name

General info

Libraries and useful links

Status

Inspiration

result on leaderboard

Second attempt

Info

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages