goodreads Scraper

This project extracts the top 30 books on the shelves on goodreads and the most read books this week for six different genres: Science Fiction, Travel, Thriller, Poetry, Fantasy, and Business. The extracted data is then stored in a SQLite database. This project is useful for anyone who wants to keep track of the most popular books in these genres and analyze trends in reading habits. It can also be used as a starting point for building a recommendation system or for conducting data analysis on book trends.

TBD: Finalizing scrapy scripts

Installation

To install the project and its dependencies, follow these steps:

Clone the repository to your local machine.
Navigate to the project directory.
Run the run.sh script to install Poetry and the project dependencies.
Run the run.sh script to scrape the top 30 books from goodreads and store them in a SQLite database.

./run.sh scrape

(Optional) Run the run.sh script to run the project tests.

./run.sh unittest

(Optional) Run the run.sh script to do linting.

./run.sh lint

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
brs		brs
data		data
README.md		README.md
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

goodreads Scraper

Installation

About

Releases

Packages

Languages

ansuff/goodreads-Scraper

Folders and files

Latest commit

History

Repository files navigation

goodreads Scraper

Installation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages