A web application that returns a movie's or TV series' IMDb metadata given its title. Built this to learn and gain familiarity with Scrapy, Flask, and Heroku. I built this in a weekend in April 2014 to experiement with the above technologies for the very first time, so it's very rough. Check out how I parsed JSON then if you dare.
https://movie-crawlr.herokuapp.com
- Getting Started with Python on Heroku
- Scrapy Tutorial
- Flask Quickstart
- Jinja Template Designer Documentation
- Bootstrap CSS
- Google Python Style Guide
- XPath
- The Open Movie Database API - used this to get IMDb's movie id for queried movies.
- Could not install Scrapy in
virtualenv
error: distutils.errors.DistutilsError: Setup script exited with error: command 'cc' failed with exit status 1
- Solution on Stack Overflow
- Accidentally adding venv to git
- Fixed using
git reset HEAD^
- Pushing to Heroku
error: distutils.errors.DistutilsError: Setup script exited with error: command 'gcc' failed with exit status 1
- Solution on Stack Overflow
- Bash script could not run on Heroku
- Substituted
wget
withcurl
in get_imdb_url.sh