Parquet is high-ratio compressed file format wildely used in bigdata systems.
This repo is a parquet handler library which features the function like convert big parquet to small ones so you can use pandas to convert the small parquets to csv files. When the parquet is too big to directly transform into a CSV using pandas, you can use the compromise method above.
splitParquet2csv:split a parquet by columns into several small CSV files.
mergeCSV: Merge all the csv-format files into a CSV file.
python3.5
pip install pandas pyarrow