Blook is a tool for turning blogs into ebooks. GIve blook a blog's URL and it'll create a .epub from the content.
Clone this repo and install the prerequisites:
$ git clone https://github.com/kchodorow/blook.git
$ pip install beautifulsoup4 ebooklib
To use, specify the 'main' url of the blog, e.g.:
$ python blook.py http:https://avc.com/
If you only want the latest N entries (for example, you're downloading a very large blog and you don't actually want all of the entries) you can give a limit:
$ python blook.py --limit 73 avc.com
This would download the last 73 entries.
If you find a blog that this does not parse correctly, please file an issue.
Alternatively, please feel free to add a new blog format:
- Add a file to filters/.
- Extend
base.BaseEntry
andbase.BaseListing
with the appropriate code (look at siat.py for examples). - Add your filter to the filter list.
- Add some tests to
filters/your_filter_test.py
following thefilters/nhl_test.py
model. - Make sure your test passes by running
python -m filters.your_filter_test
. - Submit a pull request!