Skip to content

Using Python to extract the financial data from XBRL instance documents.

Notifications You must be signed in to change notification settings

steffen-zou/Extract-financial-data-from-XBRL

Repository files navigation

In this project, I use Python to automate the getting of financial data by extracting the financial data from XBRL instance documents.

To view the project Extract_financial_data_from_XBRL.ipynb, click here which will display the project using Jupyter Notebook Viewer, as GitHub sometimes is not able to load this project.

The data_files folder contains the XBRL instance documents that are used in this project.

The stock_db_dump.sql file is a PostgreSQL database backup that contains a table that this project insert/update records to. The command to restore the backup can be found in the PostgreSQL documentation.

Below is the description of each column of the table that this project insert/update records to.

Column Description
company_name Company name
filings The type of filing that the financials data is retrieved from (e.g. 10-K, 20-F, 40-F)
reporting_currency Reporting currency
fiscal_year_ended Fiscal year ended
cash_and_equivalents Cash and cash equivalents
short_term_investments Short-term investments
current_debt Current portion of debt/notes
revenue Revenue
cost_of_revenue Cost of revenue
revenue_1_fy_ago Revenue from one fiscal year ago
operating_cash_flow Operating cash flow
capital_expenditure Capital expenditure. It consists of the following items under "Cash flows from investing activities":
  • Purchases of property, plant and equipment
  • Capitalized software development costs
  • Capitalized internal-use software
  • Purchase of intangible assets

About

Using Python to extract the financial data from XBRL instance documents.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published