scraper

A basic web scraper built completely in Python

Setup and Installation

First, get these files with git:

git clone https://github.com/gotlougit/scraper.git

Then run the setup.py file contained in this repository:

python3 setup.py install

Usage

Scraper is designed to be used to get information about the webpage through the tags and attributes present. The functions present are designed to get the webpage, get the HTML tags out of the webpage and then searching for a variety of tags with a specific attribute, and then getting the value of that attribute.

Get a webpage with the getPage() function, like this:

import scraper
page = scraper.getPage("www.github.com")

Get a list of tags using getTags(), for example:

tags = scraper.getTags(page)

Find the values of attributes of specific tags using the getTagAttributes() function.

attributes = scraper.getTagAttributes(tags,'a','title') #searches for the title attribute in the <a> tag

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
scraper		scraper
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scraper

scraper

LICENSE

LICENSE

README.md

README.md

setup.py

setup.py

Repository files navigation

scraper

Setup and Installation

Usage

About

Releases

Packages

Languages

License

gotlougit/scraper

Folders and files

Latest commit

History

Repository files navigation

scraper

Setup and Installation

Usage

About

Resources

License

Stars

Watchers

Forks

Languages