Intake-avro

The Apache Avro format is a popular method for information exchange, particularly for message and reord distribution in high-throughput systems.

In Intake, there are two plugins provided for reading avro data:

avro_table is appropriate for data which have a schema simple enough to be represented as columns with simple types, i.e., flat schema. The output of this plugin are dataframes, and the reading is optimized for speed.
avro_sequence is more generic and can handle all possible avro schema, but it produces generic python sequences of dictionaries, and is consequently much slower than the table reader.

The conda install instructions are:

conda install -c conda-forge fastavro uavro
conda install -c intake intake_avro

See the notebook in the examples/ directory.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
conda		conda
docs		docs
examples		examples
intake_avro		intake_avro
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
readthedocs.yml		readthedocs.yml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
versioneer.py		versioneer.py

intake/intake-avro