Web Scraping using Python

Web scraping simply refers to the extracting and processing of information/data from websites. This abitity to scrape data from the internet is very useful. Through scraping we can transform unstructured data (HTML format) on the web into structured data (database or spreadsheet) for easier manipulation.

We can perform scraping through various ways, but I prefer Python since it is easy and has some powerful libraries required for this task.

Dependencies used :

Urllib3 (fetches the web page):

It is a Python module used for fetching URLs. It contains useful functions and classes which help us with URL actions (basic and digest authentication, redirections, cookies, etc).

BeautifulSoup (Scrapes the data) :

It is a Python library for pulling data out of HTML and XML files.It works with a parser to provide idiomatic ways of navigating, searching, and modifying the parse tree. We can use it to extract tables, lists, paragraph and you can also put filters to extract information from web pages.

Examples :

Run the python files present in this repository to get the following results :

Image_Scraping.py

IMDb_Review_Scraping.py

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
images		images
IMDb_Review_Scraping.py		IMDb_Review_Scraping.py
Image_Scraping.py		Image_Scraping.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Scraping using Python

Dependencies used :

Examples :

About

Releases

Packages

Languages

HeliosX7/web-scraping-python-beautifulsoup

Folders and files

Latest commit

History

Repository files navigation

Web Scraping using Python

Dependencies used :

Examples :

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages