Twitter-Sentiment-Analysis-Using-Python

Twitter Sentiment Analysis uses NLP and machine learning to classify tweets as positive, negative, or neutral. This project processes tweets individually or as part of a larger dataset to understand emotions and opinions.

Dataset Details

The dataset used is the Sentiment140 Dataset, containing 1,600,000 tweets extracted via the Twitter API. The columns in this dataset are:

target: The polarity of the tweet (positive or negative)
ids: Unique ID of the tweet
date: The date of the tweet
flag: The query (if no query exists, it's "NO QUERY")
user: The name of the user who tweeted
text: The text of the tweet

You can dowload the dataset from here 👉 : https://www.kaggle.com/datasets/kazanova/sentiment140

Project Pipeline

The steps involved in the machine learning pipeline are:

Import necessary dependencies
Read and load the dataset
Perform exploratory data analysis (EDA)
Visualize target variables
Preprocess the data
Split data into train and test sets
Transform the dataset using TF-IDF Vectorizer
Create a function for model evaluation
Build the model
Evaluate the model

Model Evaluation

Accuracy

Logistic Regression performs better than SVM, which in turn performs better than Bernoulli Naive Bayes.

F1-score

Class 0:
- Bernoulli Naive Bayes: 0.90
- SVM: 0.91
- Logistic Regression: 0.92
Class 1:
- Bernoulli Naive Bayes: 0.66
- SVM: 0.68
- Logistic Regression: 0.69

AUC Score

All three models have the same ROC-AUC score.

Conclusion

Logistic Regression is the best model for this dataset.
Following Occam’s Razor, Logistic Regression is the simplest and most effective model due to the lack of assumptions in the dataset.
This project demonstrates how Twitter Sentiment Analysis can be used to understand public emotions in tweets. We preprocess the data and feed it into ML models to achieve the best accuracy.

Key Takeaways

Twitter Sentiment Analysis identifies and classifies sentiments in text.
Logistic Regression, SVM, and Naive Bayes are effective algorithms for this task.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
code.ipynb		code.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter-Sentiment-Analysis-Using-Python

Dataset Details

You can dowload the dataset from here 👉 : https://www.kaggle.com/datasets/kazanova/sentiment140

Project Pipeline

Model Evaluation

Accuracy

F1-score

AUC Score

Conclusion

Key Takeaways

About

Releases

Packages

Languages

sathvik995/Twitter-Sentiment-Analysis-Using-Python

Folders and files

Latest commit

History

Repository files navigation

Twitter-Sentiment-Analysis-Using-Python

Dataset Details

You can dowload the dataset from here 👉 : https://www.kaggle.com/datasets/kazanova/sentiment140

Project Pipeline

Model Evaluation

Accuracy

F1-score

AUC Score

Conclusion

Key Takeaways

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages