Skip to content

This Python script scrapes data from the Supremo Tribunal Federal (STF) website jurisprudencia.stf.jus.br. It utilizes Playwright for browser automation and Selectolax for HTML parsing.

License

Notifications You must be signed in to change notification settings

mvdiogo/stf-web-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

STF Web Scraper

This Python script scrapes data from the Supremo Tribunal Federal (STF) website jurisprudencia.stf.jus.br. It utilizes Playwright for browser automation and Selectolax for HTML parsing.

Pre requisites

Before running the script, make sure you have the following installed:

  • Python 3.x
  • Playwright Python library (playwright)
  • BeautifulSoup Python library (BeautifulSoup)
  • Browser

You can install the dependencies using pip:

pip install playwright beautifulsoup4

You can install the Chromium using apt-get:

sudo apt-get install chromium

Usage

  1. Clone this repository to your local machine:
git clone https://github.com/mvdiogo/stf-web-scraper.git
  1. Navigate to the project directory:
cd stf-web-scraper
  1. Run the script:
python app.py
  1. The script will launch a browser window, navigate to the STF website, scrape the data based on the specified base and subject, and print the results to the console.

Contributing

Contributions are welcome! If you find any issues or want to add new features, feel free to open an issue or submit a pull request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

This Python script scrapes data from the Supremo Tribunal Federal (STF) website jurisprudencia.stf.jus.br. It utilizes Playwright for browser automation and Selectolax for HTML parsing.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages