Extract columnar data from tabulated fixed-width text
Ruby
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
lib
test
.gitignore
COPYING.txt
README.md
detabulator.gemspec

README.md

Detabulator

Extract columnar data from tabulated fixed-width text.

Example

Given some tabulated data:

sample = <<END
            Column 1    Column 2
Drinks      Beer        Whiskey
Not drinks  Toothpaste  Mouthwash
END

This:

require "detabulator"
Detabulator.new.detabulate(sample)

Will produce this:

[["", "Column 1", "Column 2"],
 ["Drinks", "Beer", "Whiskey"],
 ["Not drinks", "Toothpaste", "Mouthwash"]] 

This is just a first release that does something useful.

Limitations

  • Double-width characters (e.g. Japanese) are not handled correctly.
  • Combining diacritics are not handled correctly.
  • When one cell contains much longer text and a space, an extra column is generated.