Skip to content

Simple boilerplate to start crawling with Puppeteer + TypeScript + DB(TypeORM) + Docker

Notifications You must be signed in to change notification settings

tsugitta/easy-puppeteer-crawling-boilerplate

Repository files navigation

Easy Puppeteer Crawling Boilerplate

The environment you can implement a simple crawling application right now with Puppeteer, TypeScript and DB (TypeORM) on Docker

Usage

  1. Install dependencies (TypeScript related packages are required also in local for coding)
$ npm i
  1. Write your awesome crawling codes in src/crawl.ts

if you need to store data, define entities as well. see TypeORM. connection settings are already done.

  1. Run crawling
$ docker-compose run app
  1. Extract data

if necessary, extract data from postgres container.

$ docker-compose exec postgres psql -P pager=off -U postgres -c "select * from table;" -A -F $'\t' | sed '$d' > result.tsv

About

Simple boilerplate to start crawling with Puppeteer + TypeScript + DB(TypeORM) + Docker

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages