Skip to content

deepakagrawal/fastpandavro

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Fastpandavro

Converts avro file to pandas dataframe in parallel

Requirements

  • fastavro
  • joblib
  • multiporcessing
  • pandas

Installation Instruction

  • Downlowd the repository and Extract to any location
  • Go to the folder and run following command
# with admin rights
python setup.py install

# without admin rights
python setup.py install --user

How to convert Avro file to Pandas dataframe

import fastpandavro as fpa
import pandas as pd
df = fpa.avro_to_pandas(fname="test/sample.avro")
print(df.shape)

How to write pandas dataframe to avro file

fpa.pandas_to_avro(df, "output.avro", schema_file="test/schema.avsc")

About

Fast conversion of avro files to pandas dataframe

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages