GitHub - gtchakama/nodejs-web-scrapper: This code provides a web scraping API that extracts information from the HTML of a given website.

Web Scraping API

This code provides a web scraping API that extracts information from the HTML of a given website. It uses the following modules:

express: to create an instance of the Express application and define routes
axios: to make HTTP requests to the website to be scraped
cheerio: to parse the HTML and extract data from it

How to use

Clone or download this repository.
Install the required dependencies by running npm install in the project directory.
Start the server by running node index.js.
Make a GET request to the /scrape endpoint with a url query parameter set to the URL of the website you want to scrape. Example: http://localhost:3000/scrape?url=https://www.example.com.

The API will respond with a JSON object containing the extracted data from the website:

{
  "title": "Example Domain",
  "description": "Example Domain. This domain is established to be used for illustrative examples in documents. You may use this domain in examples without prior coordination or asking for permission.",
  "paragraph": "This domain is established to be used for illustrative examples in documents. You may use this domain in examples without prior coordination or asking for permission."
}

If there is an error during the scraping process, the API will respond with a 500 status code and an error message.

Dependencies

express: ^4.17.1
axios: ^0.21.1
cheerio: ^1.0.0-rc.3

License

This code is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Scraping API

How to use

Dependencies

License

About

Releases

Packages

Languages

gtchakama/nodejs-web-scrapper

Folders and files

Latest commit

History

Repository files navigation

Web Scraping API

How to use

Dependencies

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages