GitHub - zillow/intake-nested-yaml-catalog: Supports a single YAML file hierarchical catalog to organize datasets and avoid a data swamp.

https://travis-ci.org/zillow/intake-nested-yaml-catalog.svg?branch=master

https://coveralls.io/repos/github/zillow/intake-nested-yaml-catalog/badge.svg?branch=master

Welcome to Intake plugin for nested YAML catalogs

This is an Intake plugin supporting a single YAML hierarchical catalog to organize datasets and avoid a data swamp.

Example of organizing the datasets by business domain entities:

metadata:
  hierarchical_catalog: true
entity:
  customer:
    customer_attributes:
      args:
        urlpath: s3://foo
      driver: parquet
  user:
    user_profile:
      args:
        urlpath: s3://foo
      driver: parquet

Can be accessed as:

df = catalog.entity.customer.customer_attributes.read()

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
docs		docs
intake_nested_yaml_catalog		intake_nested_yaml_catalog
.coveragerc		.coveragerc
.flake8		.flake8
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.rst		README.rst
setup.py		setup.py
ubuild.py		ubuild.py
uranium		uranium
versions.yaml		versions.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Welcome to Intake plugin for nested YAML catalogs

About

Releases

Packages

Languages

License

zillow/intake-nested-yaml-catalog

Folders and files

Latest commit

History

Repository files navigation

Welcome to Intake plugin for nested YAML catalogs

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages