Skip to content

mcp-data-platform-v0.8.0

Choose a tag to compare

@github-actions github-actions released this 27 Jan 09:12
· 423 commits to main since this release
f486db3

Release Notes: v0.8.0

Highlights

Lineage-Aware Semantic Enrichment - Downstream datasets now automatically inherit column metadata from upstream sources via DataHub lineage. Query an undocumented Elasticsearch index and get descriptions, glossary terms, and tags from its Cassandra source table.

New Features

Lineage-Aware Column Inheritance

When describing a table with undocumented columns, the platform now:

  1. Queries DataHub for upstream lineage
  2. Fetches schemas from upstream datasets
  3. Matches columns and inherits metadata
  4. Returns enriched response with provenance tracking

Configuration:

semantic:
  provider: datahub
  instance: primary

  lineage:
    enabled: true
    max_hops: 2
    inherit:
      - glossary_terms
      - descriptions
      - tags
    prefer_column_lineage: true

    column_transforms:
      - strip_prefix: "rxtxmsg.payload."

    aliases:
      - source: "cassandra.prod_fuse.system_sale"
        targets:
          - "elasticsearch.default.jakes-sale-*"

Response format:

{
  "column_context": {
    "amount": {
      "description": "Net sale amount",
      "glossary_terms": [{"name": "Net Sale Amount"}],
      "inherited_from": {
        "source_dataset": "urn:li:dataset:cassandra.system_sale",
        "source_column": "initial_net",
        "hops": 1,
        "match_method": "name_transformed"
      }
    }
  },
  "inheritance_sources": ["urn:li:dataset:cassandra.system_sale"]
}

Match methods:

Method Description
column_lineage DataHub column-level lineage edges
name_exact Exact column name match
name_transformed Match after strip prefix/suffix
alias Explicit alias configuration

Dependencies

  • Requires mcp-datahub v0.4.0 (for GetColumnLineage and GetSchemas methods)

Breaking Changes

None. Lineage inheritance is opt-in (lineage.enabled: false by default).

Dependency Updates

Dependency From To
Alpine (Docker) 3.21 3.23
actions/setup-python 6.1.0 6.2.0
actions/checkout 6.0.1 6.0.2
anchore/sbom-action 0.21.1 0.22.0
github/codeql-action 4.31.10 4.31.11

Documentation

  • Updated README with new banner image
  • Added lineage inheritance documentation
  • Updated llms.txt and llms-full.txt per llmstxt.org spec

Contributors

  • Lineage-Aware Semantic Enrichment (#26)
  • Docs/README updates (#19, #20)
  • Dependency updates via Dependabot (#21-25)

Changelog

Others

  • 5addde9: Add documentation for automatic column metadata inheritance via DataHub lineage (@cjimti)
  • fc5259f: Docs/readme updates (#19) (@cjimti)
  • f486db3: Document lineage-aware column metadata inheritance in DataHub, including configuration options, match methods, and use cases. (@cjimti)
  • b20b97d: Document lineage-aware inheritance in DataHub, including configuration, metadata context, and provenance tracking. (@cjimti)
  • 415e7e7: Lineage-Aware Semantic Enrichment (#26) (@cjimti)
  • 2c667c1: Update README to replace logo with banner image and add the new banner file (#20) (@cjimti)
  • f55c092: ci: bump actions/checkout from 6.0.1 to 6.0.2 (#24) (@dependabot[bot])
  • 32f7e98: ci: bump actions/setup-python from 6.1.0 to 6.2.0 (#22) (@dependabot[bot])
  • 34b4353: ci: bump anchore/sbom-action from 0.21.1 to 0.22.0 (#23) (@dependabot[bot])
  • 13b2b15: ci: bump github/codeql-action from 4.31.10 to 4.31.11 (#25) (@dependabot[bot])
  • afea984: docker: bump alpine from 3.21 to 3.23 (#21) (@dependabot[bot])

Installation

Homebrew (macOS)

brew install txn2/tap/mcp-data-platform

Claude Code CLI

claude mcp add mcp-data-platform -- mcp-data-platform

Docker

docker pull ghcr.io/txn2/mcp-data-platform:v0.8.0

Verification

All release artifacts are signed with Cosign. Verify with:

cosign verify-blob --bundle mcp-data-platform_0.8.0_linux_amd64.tar.gz.sigstore.json \
  mcp-data-platform_0.8.0_linux_amd64.tar.gz