Skip to content

v1.0.0 - SEC Note Report Extractor

Latest

Choose a tag to compare

@Charlie-El Charlie-El released this 10 Jun 08:17
· 4 commits to main since this release

v1.0.0 发布说明

这是项目的首个公开版本,提供面向最终用户的 Windows 桌面工具,以及可复现构建的源码仓库。

本版本包含

  • 年报提取功能
  • 季报提取功能
  • 可选映射表输出
  • 未匹配公司输出
  • 中英文 README
  • GitHub Release 安装包

核心规则

  • 默认基于本地 SEC note 数据运行
  • 季报模式优先选择最新季报
  • 如果同一公司在 note 中的最新年报 filed_date 更晚,则自动改用年报数据
  • 只有在提供映射表时,才输出修改映射表

输出文件

年报模式:

  • 美股年报提取_源表.xlsx
  • 美股年报提取_未匹配公司.xlsx
  • 美股年报提取_修改映射.xlsx(可选)

季报模式:

  • 美股季报提取_源表.xlsx
  • 美股季报提取_未匹配公司.xlsx
  • 美股季报提取_修改映射.xlsx(可选)

数据来源

SEC 官方下载地址:
https://www.sec.gov/data-research/sec-markets-data/financial-statement-notes-data-sets

这套数据来自 SEC 的 Financial Statement and Notes Data Sets,用于提供财务报表正文及附注中的结构化文本和数值信息。

公开发布说明

  • 仓库中不包含私有业务数据
  • 仓库中不包含真实业务输出结果
  • 打包后的软件安装包通过 GitHub Releases 提供,不直接提交到 Git 仓库

v1.0.0 Release Notes

This is the first public release of the project. It includes a Windows desktop application for end users and a source repository for reproducible builds.

Included in this release

  • Annual extraction workflow
  • Quarterly extraction workflow
  • Optional mapped workbook output
  • Unmatched company output
  • Chinese and English README files
  • Packaged application attached through GitHub Releases

Core rules

  • The application runs on local SEC note datasets by default
  • Quarterly mode prefers the latest quarterly filing
  • If the latest annual filing in notes has a later filed_date for the same company, the application uses the annual filing instead
  • A remapped workbook is generated only when a mapping workbook is provided

Output files

Annual mode:

  • 美股年报提取_源表.xlsx
  • 美股年报提取_未匹配公司.xlsx
  • 美股年报提取_修改映射.xlsx (optional)

Quarterly mode:

  • 美股季报提取_源表.xlsx
  • 美股季报提取_未匹配公司.xlsx
  • 美股季报提取_修改映射.xlsx (optional)

Data source

Official SEC download page:
https://www.sec.gov/data-research/sec-markets-data/financial-statement-notes-data-sets

The application is designed for the SEC Financial Statement and Notes Data Sets, which provide structured text and numeric information from financial statements and footnotes.

Public release note

  • No private business datasets are included in the repository
  • No real business output files are included in the repository
  • Packaged application files are distributed via GitHub Releases instead of being committed to Git