A general python package for describing, loading and processing data
Taking the pain out of data access and distribution
Intake is an open-source package to:
- describe your data declaratively
- gather data sets into catalogs
- search catalogs and services to find the right data you need
- load, transform and output data in many formats
- work with third party remote storage and compute platforms
Documentation is available at Read the Docs.
Please report issues at https://github.com/intake/intake/issues
Recommended method using conda:
conda install -c conda-forge intake
You can also install using pip
, in which case you have a choice as to how many of the optional
dependencies you install, with the simplest having least requirements
pip install intake
Note that you may well need specific drivers and other plugins, which usually have additional dependencies of their own.
- Create development Python environment with the required dependencies, ideally with
conda
. The requirements can be found in the yml files in thescripts/ci/
directory of this repo.- e.g.
conda env create -f scripts/ci/environment-py311.yml
and thenconda activate test_env
- e.g.
- Install intake using
pip install -e .
- Use
pytest
to run tests. - Create a fork on github to be able to submit PRs.
- We respect, but do not enforce, pep8 standards; all new code should be covered by tests.