GoScavenger

This repository contains a simple web scraper written in Go. The scraper is designed to connect to a specified website using HTTPS, retrieve HTML content, and extract specific data based on HTML class names.

Features

Connects to websites using HTTPS.
Reads and processes HTTP response headers.
Handles both fixed Content-Length and Transfer-Encoding: chunked responses.
Extracts content from HTML based on class names, ID and HTML tags.

Files in the Repository

main.go: Contains the main function that drives the web scraping process.
scraper.go: Includes the FindStringInTag, FindContentByID and FindContentByClass functions, which are used for parsing HTML and extracting content.

Getting Started

To use this scraper, you need to have Go installed on your machine. Download and install Go if you haven't already.

Installation

Clone the repository to your local machine:

git clone https://github.com/araujo88/GoScavenger.git
cd GoScavenger

Usage

Open main.go.
Modify the server variable to specify the website you want to scrape.
Optionally, adjust the request headers according to your requirements.
Run the scraper:

go run .

The output will be printed to the console.

Contributing

Contributions to improve this simple web scraper are welcome. Feel free to fork the repository and submit pull requests.

License

This project is licensed under the GPL License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go
scraper.go		scraper.go
scraper_test.go		scraper_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GoScavenger

Features

Files in the Repository

Getting Started

Installation

Usage

Contributing

License

About

Releases

Packages

Languages

License

araujo88/GoScavenger

Folders and files

Latest commit

History

Repository files navigation

GoScavenger

Features

Files in the Repository

Getting Started

Installation

Usage

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages