Skip to content

Latest commit

 

History

History
11 lines (8 loc) · 883 Bytes

File metadata and controls

11 lines (8 loc) · 883 Bytes

IPhone-12-Amazon-Data-Extraction

Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages. Beautiful Soup helps to pull particular content from a webpage, remove the HTML markup, and save the information. It is a tool for web scraping that helps to clean up and parse the documents that have pulled down from the web. In this work, data scrapping is done from Amazon web page that contains the list of Iphone 12. Intially, url is input for subsequent pages of iphone 12 with the help of function (inspecting the pages). This is followed by the use of beautiful soup to parse the HTML contents by creating trees that makes the parsing easier. Thus, the extracted data links are made active and saved to attached csv file.

Libraries Used:

bs4, re, requests, BeautifulSoup

Programing Language

Python

IDE Used

Jupyter Notebook