tabula 1.1.0 / tabula-java 0.9.1
We're proud to announce the first official release of Tabula 1.1! This version contains a rewrite of our processing backend which should provide a significant performance increase. The backend rewrite also improves support for RTL languages and fixes many other bugs.
If you have any issues with this version of Tabula, please let us know!
New features / bugfixes
- Tabula now extracts tables up to 7x faster than previous versions.
- Table auto-detection has also been improved. (#456, tabulapdf/tabula-java#56)
- If there’s an error during file upload & initial processing, warn the user. (#433)
- Allow running the jar distribution in "headless" mode. Users for the jar distribution will need to manually open their web browser to the Tabula page (normally http://127.0.0.1:8080/ ).
- Improved support for RTL languages like Hebrew and Arabic. (tabulapdf/tabula-java#66)
- Upgraded to jruby-188.8.131.52, improving encoding support.
- Lots and lots of other improvements — the extraction and processing backend has been completely rewritten!
Known Issues & Caveats
- Note: If you are using the .jar version (for Linux/etc), you now need to open your browser to the Tabula page (http://127.0.0.1:8080/) manually.
- Caveat: Tabula only works on text-based PDFs (ones where you can select text). Scanned documents do not work, and we do not recommend OCR for large files unless you have a data cleaning plan since even state-of-the-art OCR software can have significant error rates.
- _OS X Gatekeeper_: If you’re running Mac OS X 10.8 or later and get a message that says "Tabula can't be opened because it is from an unidentified developer" message, please let us know. (It shouldn't be happening anymore.) See this GateKeeper page for more assistance — your "Allow applications downloaded from" setting should be set to "Mac App Store and identified developers" or "Anywhere."
- Windows: tabula-win-1.1.0c.zip (PGP sig)
- Mac: tabula-mac-1.1.0c.zip (PGP sig)
- Linux/Other: tabula-jar-1.1.0c.zip (PGP sig)
$ shasum -a256 tabula*.zip 4df6dce00f3bf7393684cb832c4c3cf3e2539edb6d62d3a0042330254d593826 tabula-jar-1.1.0c.zip 70ec8a524e881ed66d6048776ed0ceb16a0c8b68d06e1295a39698e836274b04 tabula-mac-1.1.0c.zip 243270c5918229415223794da56dffd8ba102683d195579a7c9f6aa578a2765a tabula-win-1.1.0c.zip
You can also verify that you are downloading an authentic, unmodified version of Tabula using PGP. Files are signed with Mike Tigas’ key (0xA993E7156E0E9923), which is available here, on Keybase, or on most key servers.
.zip of the Tabula version you want and also download the corresponding
.zip.asc, then use the
gpg --verify command, such as:
$ gpg --verify tabula-jar-1.1.0c.zip.asc
You will want the output to contain
Good signature from "Mike Tigas <...>" somewhere in it.