Skip to content
A crawler that collects information of all structured notes from Taiwan TDCC website.
Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
tdcc finish translator class Apr 27, 2019
tests finished implementation of def _crawl_missing_distributor() Apr 27, 2019
.DS_Store finished implementation of def _crawl_missing_distributor() Apr 27, 2019
.gitignore add individual product url in crawl() result Apr 25, 2019
LICENSE finish package Apr 4, 2019
example.py finish package Apr 4, 2019
readme.md
setup.py

readme.md

Taiwan Structured Product Information Crawler

PyPI version PyPI license Package Version Github Last Commit

This is a repository that offers a StructuredProductCrawler class to crawl Taiwan TDCC website for the product information.

Tutorial


from tdcc import StructuredProductCrawler
crawler = StructuredProductCrawler()
all_products = crawler.crawl()

crawl() returns a Pandas DataFrame. Data columns include:

Column Name Are
URL the product's partial url
UID product id
NAME product name
CURRENCY product denomination
MATURITY maturity date
UNDERLYING underlying asset type
PRINCIPAL_PROTECTION % of principal protection
PI professional investor
ISSUE_DATE issue date
ISSUER issuer
MASTER_AGENT master agent
DISTRIBUTOR distributor

Installation

To install this verson from PyPI, type:


pip install tdcc

To get the newest one from this repo (note that we are in the alpha stage, so there may be frequent updates), type:


pip install git+git://github.com/jn8029/tdcc.git

To-do

TBC

You can’t perform that action at this time.