python_beautifulsoup

In this repository you can find the code present in this post on devinsimplewords blog.
This post explain how to use the beautifulsoup package and here you can find two different Python files.
The first one is parse_html_file.py and it analyzes the html file present in the html_file folder.
The second one is web_scraping.py and it performs a simple web scraping task on the 'HelloWorld' Wikipedia page.

Steps to run the script:

prepare a virtualenv and install what it is present in requirements.txt file using the following command:
```
pip install -r requirements.txt
```
If you don't know how to create and activate a virtualenv, please check this post

Run the script using the following command:

python parse_html_file.py

or

python web_scraping.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
html_file		html_file
.gitignore		.gitignore
README.md		README.md
parse_html_file.py		parse_html_file.py
requirements.txt		requirements.txt
web_scraping.py		web_scraping.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

html_file

html_file

.gitignore

.gitignore

README.md

README.md

parse_html_file.py

parse_html_file.py

requirements.txt

requirements.txt

web_scraping.py

web_scraping.py

Repository files navigation

python_beautifulsoup

About

Releases

Packages

Languages

devinsimplewords/python_beautifulsoup

Folders and files

Latest commit

History

Repository files navigation

python_beautifulsoup

About

Resources

Stars

Watchers

Forks

Languages