Crawlly: Web Page Crawler

Crawlly is a straightforward Python tool designed for web page crawling and scraping. Using the requests library, it sends a GET request to the specified URL and utilizes the BeautifulSoup library to parse the HTML content of the response. The script then extracts all anchor <a> tags from the page and prints the URLs of the links present.

Features:

Lightweight and efficient web page crawler.
Scrapes and extracts URLs from a provided web page.
Utilizes the requests library for HTTP requests and BeautifulSoup for HTML parsing.

Installation:

Clone the repository to your local machine using the following command:

git clone https://github.com/Toothless5143/Crawlly.git && cd crawlly

Install the required dependencies by executing the following command:
```
pip install -r requirements.txt
```
Launch the tool by running the following command:
```
python3 crawlly.py
```

License: This tool is open source and available under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Crawlly: Web Page Crawler

Files

README.md

Latest commit

History

README.md

File metadata and controls

Crawlly: Web Page Crawler