Combine multiple XLS/X sheets into a single CSV
JavaScript Shell
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
bin
test
.gitignore
.travis.yml
LICENSE
README.md
index.js
package.json

README.md

sheetstack

Sheetstack is a command line utility that merges multiple XLS/X sheets into a single CSV.

It's a simple extension of J useful for processing files with the exact same layout split into several worksheets.

Sheetstack adds a grouping column to the resulting CSV. By default this is the name of the sheet, but it could be anything.

Install

npm install sheetstack

Usage

Let's say we have an xls file with two sheets, "dogs" and "cats":

name,best friend
Pluto,Mickey
Santa's Little Helper,Bart
Scooby Doo,Shaggy
name,best friend
Cat in the Hat,the fish
Garfield,Jon
Hello Kitty,you

The simplest use will combine all the files and output the result to stdout.

$ sheetstack file.xls

sheet,name,best friend
dogs,Pluto,Mickey
dogs,Santa's Little Helper,Bart
dogs,Scooby Doo,Shaggy
cats,Cat in the Hat,the fish
cats,Garfield,Jon
cats,Hello Kitty,you

Sheets

The --sheets option controls which sheets are included, and in what order.

$ sheetstack --sheets dog,cat file.xls

sheet,name,best friend
dogs,Pluto,Mickey
dogs,Santa's Little Helper,Bart
dogs,Scooby Doo,Shaggy

Groups

The --groups option specifies custom values for the grouping column, --group-name sets the value for the top of the column.

$ sheetstack --groups canis,felis --group-name genus

species,name,best friend
canis,Pluto,Mickey
canis,Santa's Little Helper,Bart
canis,Scooby Doo,Shaggy
felis,Cat in the Hat,the fish
felis,Garfield,Jon
felis,Hello Kitty,you

Removing leading lines

By default, sheetstack removes the first line from all sheets except for the first one. The can be changed with the --rm-lines setting. If --rm-lines is set to 0, no lines will be removed. Higher values will remove more lines, but no lines will be removed from the first sheet.

Output format

You can also specify row-separator and field-separator options, which are passed through to J:

$ sheetstack --row-sep '\r\n' --field-sep ;