mcp-data-platform-v0.8.0
Release Notes: v0.8.0
Highlights
Lineage-Aware Semantic Enrichment - Downstream datasets now automatically inherit column metadata from upstream sources via DataHub lineage. Query an undocumented Elasticsearch index and get descriptions, glossary terms, and tags from its Cassandra source table.
New Features
Lineage-Aware Column Inheritance
When describing a table with undocumented columns, the platform now:
- Queries DataHub for upstream lineage
- Fetches schemas from upstream datasets
- Matches columns and inherits metadata
- Returns enriched response with provenance tracking
Configuration:
semantic:
provider: datahub
instance: primary
lineage:
enabled: true
max_hops: 2
inherit:
- glossary_terms
- descriptions
- tags
prefer_column_lineage: true
column_transforms:
- strip_prefix: "rxtxmsg.payload."
aliases:
- source: "cassandra.prod_fuse.system_sale"
targets:
- "elasticsearch.default.jakes-sale-*"Response format:
{
"column_context": {
"amount": {
"description": "Net sale amount",
"glossary_terms": [{"name": "Net Sale Amount"}],
"inherited_from": {
"source_dataset": "urn:li:dataset:cassandra.system_sale",
"source_column": "initial_net",
"hops": 1,
"match_method": "name_transformed"
}
}
},
"inheritance_sources": ["urn:li:dataset:cassandra.system_sale"]
}Match methods:
| Method | Description |
|---|---|
column_lineage |
DataHub column-level lineage edges |
name_exact |
Exact column name match |
name_transformed |
Match after strip prefix/suffix |
alias |
Explicit alias configuration |
Dependencies
- Requires
mcp-datahubv0.4.0 (forGetColumnLineageandGetSchemasmethods)
Breaking Changes
None. Lineage inheritance is opt-in (lineage.enabled: false by default).
Dependency Updates
| Dependency | From | To |
|---|---|---|
| Alpine (Docker) | 3.21 | 3.23 |
| actions/setup-python | 6.1.0 | 6.2.0 |
| actions/checkout | 6.0.1 | 6.0.2 |
| anchore/sbom-action | 0.21.1 | 0.22.0 |
| github/codeql-action | 4.31.10 | 4.31.11 |
Documentation
- Updated README with new banner image
- Added lineage inheritance documentation
- Updated llms.txt and llms-full.txt per llmstxt.org spec
Contributors
- Lineage-Aware Semantic Enrichment (#26)
- Docs/README updates (#19, #20)
- Dependency updates via Dependabot (#21-25)
Changelog
Others
- 5addde9: Add documentation for automatic column metadata inheritance via DataHub lineage (@cjimti)
- fc5259f: Docs/readme updates (#19) (@cjimti)
- f486db3: Document lineage-aware column metadata inheritance in DataHub, including configuration options, match methods, and use cases. (@cjimti)
- b20b97d: Document lineage-aware inheritance in DataHub, including configuration, metadata context, and provenance tracking. (@cjimti)
- 415e7e7: Lineage-Aware Semantic Enrichment (#26) (@cjimti)
- 2c667c1: Update README to replace logo with banner image and add the new banner file (#20) (@cjimti)
- f55c092: ci: bump actions/checkout from 6.0.1 to 6.0.2 (#24) (@dependabot[bot])
- 32f7e98: ci: bump actions/setup-python from 6.1.0 to 6.2.0 (#22) (@dependabot[bot])
- 34b4353: ci: bump anchore/sbom-action from 0.21.1 to 0.22.0 (#23) (@dependabot[bot])
- 13b2b15: ci: bump github/codeql-action from 4.31.10 to 4.31.11 (#25) (@dependabot[bot])
- afea984: docker: bump alpine from 3.21 to 3.23 (#21) (@dependabot[bot])
Installation
Homebrew (macOS)
brew install txn2/tap/mcp-data-platformClaude Code CLI
claude mcp add mcp-data-platform -- mcp-data-platformDocker
docker pull ghcr.io/txn2/mcp-data-platform:v0.8.0Verification
All release artifacts are signed with Cosign. Verify with:
cosign verify-blob --bundle mcp-data-platform_0.8.0_linux_amd64.tar.gz.sigstore.json \
mcp-data-platform_0.8.0_linux_amd64.tar.gz