Skip to content

Estrellas-857/IDS703NLP_Final_Project

Repository files navigation

Sentiment Analysis of Movie Reviews

This is the repo for Duke IDS 703 NLP Final Project Fall 2023!!!

Group Members: George Wang, Yanzheng Wu, Yi Chen

About The Project

This project employs two approaches to perform sentiment analysis on movie reviews. The first approach is utilized by Naive Bayes classifier, and moreover a Markov Chain Text Generator was developed to create synthetic movie reviews. The second solution is based on a discriminative neural network, which combines Convolutional Neural Networks (CNNs) and Long Short-Term Memory networks (LSTMs). Our goal is to accurately classify reviews as either positive or negative. Click here to read our report.

Dataset

The dataset we choosed consists of movie reviews labeled as positive or negative.

https://www.kaggle.com/datasets/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews/data

Conclusion

The CNN & LSTM model offers superior accuracy on real data for sentiment analysis due to their ability to capture nuanced sequential patterns, suffer from longer training times and require significant computational resources. Conversely, Naive Bayes offers greater interpretability and require less computational power, making them suitable for rapid development cycles and resource-constrained environments. However, they may not match the performance of CNN & LSTM models on tasks involving complex data patterns.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published