Skip to content
This repository has been archived by the owner on Apr 26, 2023. It is now read-only.
/ pws_remastered Public archive

A fresh take on poorlydrwanlines.com comic image scraper

Notifications You must be signed in to change notification settings

baduker/pws_remastered

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pws_remastered

 a remake of...
┌─┐┌─┐┌─┐┬─┐┬ ┬ ┬    
├─┘│ ││ │├┬┘│ └┬┘    
┴  └─┘└─┘┴└─┴─┘┴     
┬ ┬┬─┐┬┌┬┐┌┬┐┌─┐┌┐┌  
│││├┬┘│ │  │ ├┤ │││  
└┴┘┴└─┴ ┴  ┴ └─┘┘└┘  
┌─┐┌─┐┬─┐┌─┐┌─┐┌─┐┬─┐
└─┐│  ├┬┘├─┤├─┘├┤ ├┬┘
└─┘└─┘┴└─┴ ┴┴  └─┘┴└─
        version: 0.1.4

This is a remake of one of my first ever non-trivial Python scripts for a popular comic page poorlydrwanlines.com.

I've decided to give it another shot because I feel like I've learned a thing or two over the past year and the initial script lacks in certain areas.

learn.png

Here's a list of some the major changes and improvements over the first edition.

What's new?

  1. The script comes with a pre-loaded JSON to speed up scraping but also allows to narrow down the scraping scope to e.g. a year and/or a month and gives a nice little peek into the statistics (Still to be implemented)
  2. The scraping algorithm has been updated and now supports LXML and XPath.
  3. A new, colorful logo is there to great you each time you launch the script. Long live the ASCII art!

Requirements

  • Python 3.6+
  • requests
  • colorama
  • lxml

Create a new virtual environment with Python 3.6+ and then just type:

pip3 install -r requirements.txt

The dump

As of December 2019, the entire dump of comics can be found here.

About

A fresh take on poorlydrwanlines.com comic image scraper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages