Skip to content

๐Ÿ Hydra: a multithreaded site-crawling link checker in Python standard library

License

Notifications You must be signed in to change notification settings

jquery/hydra-link-checker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

10 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Hydra: multithreaded site-crawling link checker in Python

Tests status badge

A Python program that crawls slithers ๐Ÿ a website for links and prints a YAML report of broken links.

Requires

Python 3.6 or higher.

There are no external dependencies, Neo.

Usage

Run in a terminal:

python hydra.py [URL]

Ensure URL is an absolute url including schema, i.e. https://example.com.

The report will be YAML formatted. To save the output to a file, run:

python hydra.py [URL] > [PATH/TO/FILE.yaml]

You can add the current date to the filename using a command substitution, such as:

python hydra.py [URL] > /path/to/$(date '+%Y_%m_%d')_report.yaml

To see how long Hydra takes to check your site, add time:

time python hydra.py [URL]

Test

Run:

python -m unittest tests/test.py

About

๐Ÿ Hydra: a multithreaded site-crawling link checker in Python standard library

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages

  • Python 89.9%
  • HTML 10.1%