An Avro file reader/writer for Python.
>>> import pyavroc >>> with open('myfile.avro') as fp: >>> reader = pyavroc.AvroFileReader(fp, types=True) >>> for record in reader: >>> print record
Comparison with original Avro Python API
pyavroc is a Python API on top of upstream Avro-C. This means it reads about 40 times faster than Avro's Python implementation. (The exact timings depend on the version of Python used).
|Name||Description||Relative speed (bigger is better)|
|python-avro||Avro's implementation (pure Python)||1|
|fastavro||python-avro improved, using Cython||10|
|pyavroc||Python/C API on upstream Avro-C||40|
Building the module
You will need to build Avro-C. This is available at https://github.com/apache/avro.git.
Then you can build pyavroc, linking against the Avro-C shared library.
The pyavroc repository contains the script
clone_avro_and_build.sh which automates this process:
Installing the module
- Make sure cmake is available (eg.
apt-get install cmake).
- This generates a build folder
python setup.py bdist_wheel -d buildmakes an executable wheel of the project and saves in the build/ folder
pip install pyavroc-0.7.2-cp36-cp36m-linux_x86_64.whl
pyavroc supports writing, both for records created as dictionaries, and for records created as Python objects.
More examples are available in the
Copyright 2015 Byhiras (Europe) Limited
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at:
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.