Webreg Scrapy

This is a web scraper for retrieving UCI course information from the UCI University Registrar. This is a tool I built for the UCI Course API.

Usage

Use this scraper to grab course information and import it into a PostgreSQL database

Process

Scraper is hosted on Heroku
Executes the department spider to grab updated list of departments
Executes a course spider for each department in department list
Uploads all the information to the AWS RDS PostgreSQL database

Requirements

PostgreSQL

Development

Installing Dependencies

From within the root directory:

pip install -r requirements.txt

Running the Scraper

Start up PostgreSQL server with correct relations setup

// To crawl courses into database
scrapy crawl course_scrapy  
// To crawl courses into database and store them into courses.json
scrapy crawl course_scrapy -o courses.json

Handling UCI Data Changes

Change items.py
Change the way course_spider.py parses
Change the models.py to reflect database schema
Change pipelines.py to manage the insertion of new data

Roadmap

View the project roadmap here

Contributing

See CONTRIBUTING.md for contribution guidelines.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
webreg_scrapy		webreg_scrapy
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Procfile		Procfile
README.md		README.md
clock.py		clock.py
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Webreg Scrapy

Table of Contents

Usage

Process

Requirements

Development

Installing Dependencies

Running the Scraper

Handling UCI Data Changes

Roadmap

Contributing

About

Releases

Packages

Languages

djchie/webreg_scrapy

Folders and files

Latest commit

History

Repository files navigation

Webreg Scrapy

Table of Contents

Usage

Process

Requirements

Development

Installing Dependencies

Running the Scraper

Handling UCI Data Changes

Roadmap

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages