webscrapping

Sample Repository with basic web scrapping

Welcome DSAC Students.

I'm Sumanth,

You can find all the resources here.

Presentation Link

Environment Needed

Python 3 🐍
The below mentioned Python Packages 📦
- pip install lxml
- pip install Scrapy
- pip install requests
- pip install gTTS (Optional)

Why Should i use these packages?

LXML:

lxml is a Pythonic, mature binding for the libxml2 and libxslt libraries. It provides safe and convenient access to these libraries using the ElementTree API.

Scrapy:

Scrapy is a fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

Requests:

Requests is a simple, yet elegant, HTTP library. Requests allows you to send HTTP/1.1 requests extremely easily. There’s no need to manually add query strings to your URLs, or to form-encode your PUT & POST data — but nowadays, just use the json method!

gTTS:

gTTS (Google Text-to-Speech), a Python library and CLI tool to interface with Google Translate's text-to-speech API. Write spoken mp3 data to a file, a file-like object (bytestring) for further audio manipulation, or stdout. Or simply pre-generate Google Translate TTS request URLs to feed to an external program.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
basics		basics
guardian		guardian
scrapy		scrapy
LICENSE		LICENSE
README.md		README.md
xpath.html		xpath.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

webscrapping

Welcome DSAC Students.

You can find all the resources here.

Environment Needed

Why Should i use these packages?

LXML:

Scrapy:

Requests:

gTTS:

About

Releases

Packages

Languages

License

insumanth/webscrapping

Folders and files

Latest commit

History

Repository files navigation

webscrapping

Welcome DSAC Students.

You can find all the resources here.

Environment Needed

Why Should i use these packages?

LXML:

Scrapy:

Requests:

gTTS:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages