VOZ crawler

cloning all comments of the Stock channel on VOZ

Packages (includes frontend and backend)

location:

frontend: app/frontend
backend: app/backend

# run at root project

## up the related services
docker-compose up -d 

# generate db (do this at the first time)
./scripts/clone.sh

# install dependencies
yarn install

# build common libs
yarn build:shared 

# start frontend service
yarn start:fe

# start backend service
yarn start:be # run backend server

Crawler service

Setup

# prepare env for crawl job (just support centos env)
./scripts/setupEnv.sh

# crawling data
./scripts/crawl.sh

Data

data/comments.csv
data/comments.xlsx

Database

location: ./crawler/databases/*
update the database backup: ./script/dump.sh

Configuration

By default, the script is crawling data from Stock with 30 Codes.

./spiders/voz_stock.py
stockCodes=[
    ...
]

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.vscode		.vscode
app		app
crawler		crawler
scripts		scripts
.gitignore		.gitignore
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
package.json		package.json
scrapy.cfg		scrapy.cfg
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VOZ crawler

Packages (includes frontend and backend)

Crawler service

Setup

Data

Database

Configuration

About

Releases

Packages

Languages

karrot0195/voz-crawler

Folders and files

Latest commit

History

Repository files navigation

VOZ crawler

Packages (includes frontend and backend)

Crawler service

Setup

Data

Database

Configuration

About

Topics

Resources

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages