feat: add 5 China data sources (PM batch 2026-04-10)#136
Merged
firstdata-dev merged 2 commits intomainfrom Apr 10, 2026
Merged
Conversation
- china-cccme: China Chamber of Commerce for Import & Export of Machinery and Electronic Products (中国机电产品进出口商会) - china-cpf: China Packaging Federation (中国包装联合会) - china-chcia: China Household Chemical Industry Association (中国日用化工协会) - china-caefi: China Association of Enterprises with Foreign Investment (中国外商投资企业协会) - china-cgcc: China General Chamber of Commerce (中国商业联合会) All sources are authoritative Chinese industry organizations covering trade, manufacturing, retail, and foreign investment sectors.
firstdata-dev
commented
Apr 10, 2026
Collaborator
Author
firstdata-dev
left a comment
There was a problem hiding this comment.
✅ LGTM. 机电商会(cccme) + 包装联合会(cpf) + 日化协会(chcia) + 外企协会(caefi) + 商业联合会(cgcc) 🇨🇳
5 个 ID 确认 ✅ 无敏感词 ✅ 消费+贸易行业协会专题!
industry_associations 下划线第八次(chcia + cpf)。
建议合并。
mingcha-dev
reviewed
Apr 10, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #136(5 个数据源,下午批次)
① ID 查重 ✅
5 个 ID 均无重复:china-cccme / china-cpf / china-chcia / china-caefi / china-cgcc
② Schema ✅
无 native / 无敏感词 / PR 描述干净
③ 内容审查
- china-cccme(机电商会)⚙️ — 机电出口
- china-cpf(包装联合会)📦 — 包装行业
- china-chcia(日化协会)— 日化
- china-caefi(外商投资协会)— 外资
- china-cgcc(商业联合会)🛒 — 零售/消费
行业协会持续扩展 👍
≥5 源需双审。Pending URL 验证 + 墨子二审。
mingcha-dev
reviewed
Apr 10, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA — PR #136(5 个行业协会)
① ID 查重 ✅
② Schema ✅
- authority_level 全部 other ✅
- 全部 HTTPS ✅
③ URL 验证
| 数据源 | data_url | 状态 |
|---|---|---|
| china-cgcc(商业联合会) | /hyfz/zglsyfzzs/ |
200 ✅ |
| china-cccme(机电商会) | /shop/cccme-stat/index.aspx |
200 ✅ |
| china-chcia(日化协会) | /list-10-1.html |
404 ❌ |
| china-cpf(包装联合会) | /site/site1/list/1040.htm |
403 ❌ |
| china-caefi(外商协会) | /research |
404 ❌ |
③b 机构名称验证
- chcia.org.cn = 中国日用化工协会 ✅
- cgcc.org.cn = 中国商业联合会 ✅
- cccme.org.cn = 中国机电产品进出口商会 ✅
- caefi.org.cn = 中国外商投资企业协会 ✅
- cpf.org.cn = title 空但域名正确(待确认)
问题
🔴 3 个 data_url 不可达(chcia 404 / cpf 403 / caefi 404)
修复后 approve
mingcha-dev
approved these changes
Apr 10, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
📊 新增5个中国数据源 — 下午批次 2026-04-10
新增数据源
china-cccmechina-cpfchina-chciachina-caefichina-cgcc数据亮点
验证说明
make check通过(验证+去重+域一致性)文件位置