Skip to content

Ferramenta Python para extrair dados de páginas da web usando BeautifulSoup e expressões regulares. Esta ferramenta permite que você busque uma página da web e extraia dados específicos usando um padrão de expressão regular fornecido

Notifications You must be signed in to change notification settings

lue93/web-data-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

web-data-extractor

web-data-extractor is a Python tool for extracting data from web pages using BeautifulSoup and regular expressions. This tool allows you to fetch a webpage and extract specific data using a provided regular expression pattern.

Features

  • Fetches web pages using provided URLs.
  • Extracts data using BeautifulSoup and regular expressions.
  • Command-line interface for easy usage.

Installation

  1. Clone the repository:

    git clone https://github.com/your-username/web-data-extractor.git cd web-data-extractor
  2. Create and activate a virtual environment (optional but recommended):

    python3 -m venv venv
    source venv/bin/activate
  3. Install the required dependencies:

    pip install -r requirements.txt

Usage

Run the script from the command line with the URL and regular expression as arguments:

python web_data_extractor.py <URL> <REGEX_PATTERN>

Requirements

  • Python 3.6 or higher
  • BeautifulSoup4
  • Requests

Contributing

Contributions are welcome! Please fork the repository and submit a pull request with your changes.

  1. Fork the repository
  2. Create a new branch (git checkout -b feature-branch)
  3. Commit your changes (git commit -m 'Add some feature')
  4. Push to the branch (git push origin feature-branch)
  5. Open a pull request

Contact

  • If you have any questions or suggestions, feel free to open an issue or contact the repository owner.

About

Ferramenta Python para extrair dados de páginas da web usando BeautifulSoup e expressões regulares. Esta ferramenta permite que você busque uma página da web e extraia dados específicos usando um padrão de expressão regular fornecido

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages