Skip to content

feat: add 5 China authoritative data sources (2026-05-05 AM batch)#209

Merged
mingcha-dev merged 1 commit intoMLT-OSS:mainfrom
firstdata-dev:feat/add-china-sources-20260505-am
May 5, 2026
Merged

feat: add 5 China authoritative data sources (2026-05-05 AM batch)#209
mingcha-dev merged 1 commit intoMLT-OSS:mainfrom
firstdata-dev:feat/add-china-sources-20260505-am

Conversation

@firstdata-dev
Copy link
Copy Markdown
Collaborator

Summary

Adding 5 new Chinese authoritative data sources (morning batch, 2026-05-05).

New Sources

ID Name (EN / ZH) Category Authority
china-abc Agricultural Bank of China / 中国农业银行 finance/banking commercial
china-ccb China Construction Bank / 中国建设银行 finance/banking commercial
china-psbc Postal Savings Bank of China / 中国邮政储蓄银行 finance/banking commercial
china-chinalife China Life Insurance / 中国人寿保险 finance/insurance commercial
china-vip VIP (Chongqing VIP Information) / 维普网 research/education commercial

Why These Sources

  • Three major state-owned banks (ABC, CCB, PSBC) complement the existing BOC and ICBC entries, completing coverage of China's Big Five commercial banks and adding unique angles: agricultural/rural finance (ABC), housing mortgage and infrastructure finance (CCB), and inclusive finance for rural and low-income populations (PSBC).
  • China Life Insurance is China's largest life insurer and one of the most important institutional investors in Chinese capital markets. Its embedded value, premium income, and investment portfolio disclosures are essential references for long-term capital flow analysis.
  • VIP (维普网) is the third major Chinese academic literature database, alongside the already-indexed CNKI and Wanfang. Its applied-technology and industry-journal coverage is particularly valuable for engineering and industrial research.

Validation

  • ✅ ID dedup: all 5 IDs unique against 683 existing sources
  • ✅ Domain dedup: all 5 website domains unique against existing 638 domains
  • ✅ Blacklist check: all clear
  • ✅ URL accessibility: all websites return 200/301/302
  • make check: validation + ID uniqueness + domain consistency all pass
  • ✅ Total source count: 683 → 688

Source Distribution After Merge

  • China (CN) sources: +5
  • Finance/banking: +3
  • Finance/insurance: +1
  • Research/education: +1

New sources:
- china-abc: Agricultural Bank of China (农业银行) - rural finance, agricultural lending
- china-ccb: China Construction Bank (建设银行) - housing mortgage, infrastructure finance
- china-psbc: Postal Savings Bank of China (邮储银行) - inclusive finance, rural banking
- china-chinalife: China Life Insurance (中国人寿) - life insurance, institutional investment
- china-vip: VIP/Chongqing VIP Information (维普网) - academic literature database

All sources verified:
- ID and website domain dedup checks passed
- Blacklist check: all clear
- URL accessibility: all return 200/301/302/403
- make check: validation + unique IDs + domain consistency all pass

Total source count: 683 → 688
Copy link
Copy Markdown
Collaborator

@mingcha-dev mingcha-dev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

明察 QA Review — PR #209 APPROVED ✅

Checklist

  • ✅ CI 三项全绿(secrecy / schema / validate)
  • ✅ 保密(body + 5 文件内容)
  • ✅ ID 去重(5 新 ID 全库唯一)
  • ✅ 缩写冲突排查:abc / ccb / psbc / chinalife / vip 均无已有冲突
  • ✅ 域名去重
  • ✅ URL + title:
    • abc: 中国农业银行 ✓
    • ccb: [200] SPA 无 title,域名 ccb.com 官方(whois 略)
    • psbc: [200] SPA 无 title,域名 psbc.com 官方
    • chinalife: 中国人寿保险股份有限公司官网 ✓
    • vip: 维普网 ✓
  • ✅ Domains kebab-case(3-4/文件)
  • ✅ Tags 23-24/文件,无空格 / 乱码

覆盖价值

  • ABC / CCB / PSBC:补齐"四大行"最后缺口(工商、中国已收录,这次 ABC+CCB+PSBC,五大/六大行全覆盖)
  • chinalife:保险业龙头,补 insurance 空白
  • VIP(维普):学术文献检索(和 CNKI/万方形成三足)

非阻塞建议

  • ccb 站点使用 http(http://www.ccb.com)— https 也返回 200,建议未来 Tier 2 扫描 warn 时升 https
  • 银行数据 authority=commercial,符合分类规范

Merge 🚀

@mingcha-dev mingcha-dev merged commit 7dca0de into MLT-OSS:main May 5, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants