CS 337 - Project 1

By: Gabe Bider, Spencer Rothfleisch, Eli Barlow, and Isaac Miller

How to Run

To Run: Create the virtual environment:

python3 -m venv env

Activate the virtual environment:

source env/bin/activate

Install the requirements:

pip install -r requirements.txt

or

pip3 install -r requirements.txt

To Configure the mocks edit the .env file By default the mocks are set to the following:

MOCK_AWARD_CATEGORIES=False
MOCK_AWARD_PRESENTERS=False
MOCK_AWARD_WINNERS=False
MOCK_AWARD_NOMINEES=False
MOCK_HOSTS=False
MOCK_RED_CARPET=False
MOCK_SENTIMENT=False

Turning these on will enable the mocks and will use the data from gg_apifake.py

To Run the program: Supplying no arguments will run the program with the default values You need supply --output_results in order to get console output results Adding --save_json will save the json files to gg_{year}_generated_answers.json in the format for the auto grader

# This will print the results and save the json files
python Runner.py --output_results --year 2013 --save_json

# This will save the json files
python Runner.py --year 2013 --save_json

# This will print the results
python Runner.py --output_results --year 2013

Scraping Data

In order to scrape our data from IMDB, we made a file scraper.py

The requirements for the scraper are installed with the requirements.txt file

What did we do

Main Requirements

Extras

Other notes

Our code groups together similar awards into one award name with a set of aliases. These can be found in award_aliases.json. Because of this, it is possible that code may not work will all the subparts when mocking Award Categories.
The files TimeToJson.py and IntervalTester.py create plots in the folder test_tweets_time/ that provide interesting visualizations about where tweets about certain awards fall on a chronological scale (we used these to identify presenters and nominees)
The files in saved_jsons/ are for internal use of the award category recognition function.
The main Runner takes on average 4.5 minutes to run on a Macbook Pro M1 chip (with video/other programs running in the background). The extra sections add around 30 seconds in total.

Name		Name	Last commit message	Last commit date
Latest commit History 211 Commits
.vscode		.vscode
autograder		autograder
saved_jsons		saved_jsons
test_files		test_files
.env		.env
.gitignore		.gitignore
Award.py		Award.py
AwardCategory.py		AwardCategory.py
AwardNameToNominees.py		AwardNameToNominees.py
AwardNameToPresenters.py		AwardNameToPresenters.py
AwardNameToWinners.py		AwardNameToWinners.py
IntervalTester.py		IntervalTester.py
README.md		README.md
Runner.py		Runner.py
TimeToJson.py		TimeToJson.py
TweetsNearMedian.py		TweetsNearMedian.py
TweetsToAwardNames.py		TweetsToAwardNames.py
TweetsToHost.py		TweetsToHost.py
TweetsToRedCarpet.py		TweetsToRedCarpet.py
TweetsToSentiment.py		TweetsToSentiment.py
award_aliases.json		award_aliases.json
gg2013.json		gg2013.json
gg_2013_generated_answers.json		gg_2013_generated_answers.json
movies.csv		movies.csv
nominee_words.json		nominee_words.json
people.csv		people.csv
presenter_words.json		presenter_words.json
requirements.txt		requirements.txt
scraper.py		scraper.py
series.csv		series.csv
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS 337 - Project 1

By: Gabe Bider, Spencer Rothfleisch, Eli Barlow, and Isaac Miller

How to Run

Scraping Data

What did we do

Main Requirements

Extras

Other notes

Our GitHub Repository

About

Releases

Packages

Contributors 4

Languages

gabebider/cs337-proj1

Folders and files

Latest commit

History

Repository files navigation

CS 337 - Project 1

By: Gabe Bider, Spencer Rothfleisch, Eli Barlow, and Isaac Miller

How to Run

Scraping Data

What did we do

Main Requirements

Extras

Other notes

Our GitHub Repository

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages