Node.js Crawler

Node Crawler is a web scraping tool built with Node.js that allows you to extract data from websites and export to json. It's a flexible and customizable solution for scraping data for various purposes.

Note: Be aware that websites may change their structure over time, so ensure that the class names and selectors used in this script are up to date.

By default,

the crawler will fetch data from the URL specified in the index.ts file. You can modify this URL by changing the value of the url property in the const paginationURLsToVisit = ["https://scrapeme.live/shop"]; file.

The crawler will output the product details to the console. If you want to save the data to a file, you can modify the code in index.ts to write the data to a file instead.

Installation

To install the necessary dependencies, run: yarn

Dependencies

This project relies on the following dependencies:

axios
cheerio

Development

To run the project in development mode with live reloading, run: yarn dev

to run specific file

yarn run ts-node src/category.ts

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
src		src
.gitignore		.gitignore
README.md		README.md
destinationData.json		destinationData.json
hotelsData.json		hotelsData.json
package.json		package.json
testData.json		testData.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Node.js Crawler

By default,

Installation

Dependencies

Development

to run specific file

About

Releases

Packages

Languages

anjitpariyar/node-crawler

Folders and files

Latest commit

History

Repository files navigation

Node.js Crawler

By default,

Installation

Dependencies

Development

to run specific file

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages