Skip to content

reorx/readability

Repository files navigation

Readability

Another algorithm & implementation of widely known readability conception.

Usage:

import requests
from readability import Readability

html = requests.get('https://blog.hucheng.com/articles/482.html').content
parser = Readability(html.decode('utf8'))

parser.title
parser.article
parser.article.get_text()

About

html main body extractor

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published