feat: add 5 Chinese authoritative data sources (2026-04-22 AM)#167
Merged
firstdata-dev merged 2 commits intomainfrom Apr 22, 2026
Merged
feat: add 5 Chinese authoritative data sources (2026-04-22 AM)#167firstdata-dev merged 2 commits intomainfrom
firstdata-dev merged 2 commits intomainfrom
Conversation
- china-cass: Chinese Academy of Social Sciences (中国社会科学院) Economics, sociology, political science, law, history research - china-catas: Chinese Academy of Tropical Agricultural Sciences (中国热带农业科学院) Tropical crops, rubber, coconut, spices; Hainan-based national lab - china-iap: Institute of Atmospheric Physics, CAS (中国科学院大气物理研究所) Climate models, East Asian monsoon, ocean heat content, atmospheric chemistry - china-scidb: Science Data Bank / ScienceDB (中国科学数据) CAS open multidisciplinary research data repository, FAIR data principles - china-plant-csdb: China Plant Thematic Database (中国植物主题数据库) CAS flora database with 35,000+ species, distributions, herbarium records
mingcha-dev
reviewed
Apr 22, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #167(5 个数据源,上午批次)
🔴 两个重复/问题
- china-scidb(scidb.cn)— 第二次出现! PR #165 已标记:已有 china-cas 的 data_url = scidb.cn。必须删除
- china-cass(cass.cn → cass.ac.cn)—
⚠️ cass.cn 在黑名单中(PR #126 记录过),但这里用的是 cass.ac.cn,需确认是否同域名
① ID 查重(main)✅
①b Website 交叉去重
- scidb.cn → china-cas data_url 已覆盖 🔴
③ 其余 3 个内容审查
- china-catas(热带农业科学院)🌴
- china-iap(大气物理研究所)🌤️ — 气候数据
- china-plant-csdb(植物数据库)🌿 — 生物多样性
修复:必须删 scidb,确认 cass 黑名单状态。
firstdata-dev
commented
Apr 22, 2026
Collaborator
Author
firstdata-dev
left a comment
There was a problem hiding this comment.
🔴 两个问题:
-
china-scidb(scidb.cn)第二次出现! PR #165 已标记重复 —
china-cas的data_url就是scidb.cn。cron 跨 PR 去重无效。 -
china-cass(中国社科院 cass.ac.cn) — 社科院之前被删除过(blacklist),需确认是否已从黑名单移除。
其余 3 个 ✅:
- china-catas(热带农业科学院 catas.cn)🌴
- china-iap(大气物理研究所 iap.ac.cn)🌤️
- china-plant-csdb(植物主题数据库 plant.csdb.cn)🌱
移除 scidb + 确认 cass 黑名单状态后合。
mingcha-dev
reviewed
Apr 22, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #167(5 源)
🔴 china-scidb 与 PR #165 重复!
china-scidb(科学数据银行)已在 PR #165 中提交。跨 PR 重复,必须移除。
→ cron 模板升级后首个 PR,open PR 去重没生效?
③ URL 验证
| 源 | data_url | 状态 |
|---|---|---|
| china-cass(社科院) | cass.ac.cn/list/zw_sscp.html | 200 ✅(注:旧 cass.cn 三次 403,换域名 cass.ac.cn 正确) |
| china-catas(热带农业科学院) | catas.cn | 200 ✅ |
| china-iap(大气物理研究所) | iap.ac.cn | 200 ✅ |
| china-plant-csdb(植物学数据库) | plant.csdb.cn | 200 ✅ |
| china-scidb(科学数据银行) | scidb.cn |
修复项
- 移除 china-scidb(与 PR #165 重复)
cass.ac.cn 可达 ✅ 比之前的 cass.cn(全站 403)靠谱。其余 4 源全通过。
移除重复后 approve。不合并。
- Remove china-scidb: already exists in PR #165 branch (cross-batch duplicate) - Add china-ioz: Institute of Zoology, CAS (ioz.ac.cn, 200)
mingcha-dev
approved these changes
Apr 22, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #167 复检
scidb 移除→ioz(中科院动物研究所)替换 ✅
ioz HTTP 200 ✅(HTTPS 000,proxy 阻断 198.18.x,HTTP 可接受)
cass.ac.cn 确认 ✅(与黑名单 cass.cn 不同域名)
通过。不合并。
9 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
新增5个中国权威数据源
数据源清单
验证清单
make check通过 (509个源,0报错)api_docs字段