Skip to content

Web scraping (web harvesting or web data extraction) is a computer software technique of extracting information from websites.

License

Notifications You must be signed in to change notification settings

snehi23/Web-Scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Web-Scraping

Web scraping (web harvesting or web data extraction) is a computer software technique of extracting information from websites.

ASNs (Autonomous System Numbers) are one of the building blocks of the Internet. This project is to create a mapping from each ASN in use to the company that owns it. For example, ASN 36375 is used by the University of Michigan - https://bgp.he.net/AS36375

The site https://bgp.he.net/ has lots of useful information about ASNs. Starting at https://bgp.he.net/report/world crawl and scrape the linked country reports to make a structure mapping each ASN to info about that ASN. Sample structure: {3320: {'Country': 'DE', 'Name': 'Deutsche Telekom AG', 'Routes v4': 13547, 'Routes v6': 268}, 36375: {'Country': 'US', 'Name': 'University of Michigan', 'Routes v4': 14, 'Routes v6': 1}}

About

Web scraping (web harvesting or web data extraction) is a computer software technique of extracting information from websites.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages