Skip to content

liao961120/ptt-terms

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Build Status Support Python Version Support Scrapy Version

Modification

To modify the behavior of the spider, edit the files marked with # in the directory tree below.

Directory structure of PTTdict/:

.
├── run.sh                # scrapy crawl parameters
├── view.json             # Auto-generated (for viewing)
├── scrapy.cfg
├── setup.py
│
├── PTTdict
│   ├── __init__.py
│   ├── items.py          # Define item fields
│   ├── middlewares.py
│   ├── pipelines.py
│   ├── postprocess
│   │   ├── __pycache__/
│   │   └── tidyup.py     # Process items before output
│   ├── __pycache__/
│   ├── settings.py       # Setting for item piplines
│   └── spiders
│       ├── dict.py       # Spider for scraping PTT wiki
│       ├── __init__.py
│       └── __pycache__/
└── data
    ├── dict_constr.R     # Filter & convert to data frame
    ├── index.Rmd         # Build Web Site
    ├── _site.yml
    └── style.css

About

Scrapy project for PTT 鄉民百科

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published