hackernews-web-scraper

A lightweight TypeScript webscraper to retrieve the homepage of hackernews.

System Requirements

This repo utilizes Node's experimental fetch. To run this script, you will need to use Node 18 or higher (see .nvmrc for the exact version this was designed for).

Instructions To Run

Clone the repo
npm i
npm start

It'll console.log the results of the scrape.

To get pages beyond the first page, you can set process.env.page to the page number you want to get. For example, to get the second page, you can run page=2 npm start.

Response Body

It'll be an array of objects, each object being a post on the homepage of hackernews. If a property does not exist on that object, it'll return null. The object will have the following properties:

{
  rank,
  title,
  link,
  author,
  age,
  points,
  comments
}

Contributions

Have an idea that makes this even better? Contributions are welcome! Please open an issue or PR. See CONTRIBUTING.md for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src		src
.gitignore		.gitignore
.nvmrc		.nvmrc
.prettierrc		.prettierrc
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hackernews-web-scraper

System Requirements

Instructions To Run

Response Body

Contributions

About

Releases

Languages

License

deeheber/hackernews-web-scraper

Folders and files

Latest commit

History

Repository files navigation

hackernews-web-scraper

System Requirements

Instructions To Run

Response Body

Contributions

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Languages