Skip to content

A proof of concept demo to transform CSV to Arrow in JS using csvtojson and the new Arrow Builders

License

Notifications You must be signed in to change notification settings

trxcllnt/csv-to-arrow-js

Repository files navigation

csv-to-arrow-js

A proof of concept demo to transform CSV to Arrow in JS using csvtojson and the new Arrow Builder stream APIs.

See this PR for more information: apache/arrow#4476

Cloning

This demo uses the file big.csv from the PapaParse examples. I've committed this file with git large-file storage.

After installing git-lfs, run these commands to clone the repository and pull the large files:

git clone https://github.com/trxcllnt/csv-to-arrow-js.git
cd csv-to-arrow-js
git lfs pull

If you can't install git-lfs, you can download the "Large file" from the PapaParse demo page.

Running

# install the dependencies
npm install
# run the demo to convert big.csv to an Arrow RecordBatch stream.
# The Arrow table is printed to the console with the `arrow2csv` utility
npm start
# or time how long it takes
time node index.js > big.arrow

About

A proof of concept demo to transform CSV to Arrow in JS using csvtojson and the new Arrow Builders

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published