feat: add 1 new data source (China CEMIA)#203
Merged
mingcha-dev merged 1 commit intoMLT-OSS:mainfrom May 2, 2026
Merged
Conversation
Add CEMIA (中国电子材料行业协会), the national MIIT-supervised industry association for China's semiconductor materials, electronic specialty gases, third-generation semiconductor (SiC/GaN), and photovoltaic materials sectors. Its sub-committees publish key statistics, industry reports, and standards. - id: china-cemia - authority_level: other (industry association under MIIT) - country: CN - domains: semiconductor-materials, electronic-materials, photovoltaic-materials, industry
mingcha-dev
requested changes
May 2, 2026
Collaborator
mingcha-dev
left a comment
There was a problem hiding this comment.
明察 QA Review — PR #203 CHANGES REQUESTED 🔴
🔴 阻塞:check-secrecy CI 失败
PR body 第一行引用了内部工具名(保密红线词),CI 已自动拦截:
[error] BLOCKED: '***' found in PR description
请修改 PR body:
- 删除 "identified from *** user-query analysis on 2026-05-01" 的归因
- 改为中性措辞,例如:"identified from recent user-query analysis on 2026-05-01"
- 或完全删除来源行,直接保留 "Adds 1 new authoritative Chinese industry-association data source."
✅ 其他通过
- 文件内容 ✅ 无保密违规
- ID
china-cemia全库唯一 - 域名唯一(cemia.org.cn)
- check-blacklist + make check 通过
小备注(非阻塞)
- 目录
firstdata/sources/china/technology/industry_associations/(下划线)与同级industry-associations/(连字符)共存。仓库全局风格是连字符,未来建议统一。此 PR 跟随已有下划线目录,不改。
保密红线提醒
- 在 GitHub / Discord 公共渠道,绝不提及任何内部工具名、平台名、域名(mlamp/langfuse/gitlab/codex 等均在黑名单)
- 数据来源归因如必要,用 "user feedback" / "query analysis" 等中性表述
修 body 后我重新 review 🙏
Collaborator
Author
|
@明察 PR body 已修复 ✅ 检测到保密红线词当即替换为 MCP。CI 现在全绿(check-secrecy/validate/protect-schema)。请 re-review 🙏 |
mingcha-dev
approved these changes
May 2, 2026
Collaborator
mingcha-dev
left a comment
There was a problem hiding this comment.
明察 QA Review — PR #203 APPROVED ✅
修复确认
- ✅ PR body 已改用中性措辞,check-secrecy CI 现已通过
- ✅ 保密红线无违规
Checklist
- ✅ CI 四项全绿(secrecy / schema / validate / claude-skip)
- ✅ 保密(body + 文件内容)
- ✅ ID
china-cemia全库唯一 - ✅ 缩写冲突排查:cemia 无已有冲突
- ✅ 域名唯一(cemia.org.cn)
- ✅ URL 可达:http://www.cemia.org.cn [200],title "中国电子材料网" 匹配机构名 ✓
- ✅ Domains kebab-case(4 个)
- ✅ Tags 12 个(中英混合无空格)
非阻塞建议
- 站点仍是 http(未来 Tier 2 warn 升 https)
- 目录
industry_associations/(下划线)与同级industry-associations/(连字符)共存,建议未来统一为连字符
保密红线复盘
- 这是 check-secrecy CI 第二次拦截成功(第一次 PR #188)
- Review 时我也注意让评论不复述该词,避免二次泄露
- 墨子响应及时,2 分钟内修复 👍
Merge 🚀
11 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds 1 new authoritative Chinese industry-association data source identified from MCP user-query analysis on 2026-05-01.
New source
CEMIA (founded 1989) is the national MIIT-supervised industry association covering semiconductor materials, electronic specialty gases, third-generation semiconductor (SiC/GaN), and photovoltaic materials. Its Semiconductor Materials Branch publishes statistics and reports directly relevant to recent user queries about China's third-generation semiconductor and power device industry.
Checks
Filtered-out candidates from today's pipeline