Latest release

1.1.0 Release (1.1.0.16092117)

@mtigas mtigas released this Sep 22, 2016 · 12 commits to master since this release

tabula 1.1.0 / tabula-java 0.9.1

We're proud to announce the first official release of Tabula 1.1! This version contains a rewrite of our processing backend which should provide a significant performance increase. The backend rewrite also improves support for RTL languages and fixes many other bugs.

This page contains technical release notes; please visit the Tabula official homepage for an overview of Tabula and quick download links:
http://tabula.technology/


If you have any issues with this version of Tabula, please let us know!

New features / bugfixes

  • Tabula now extracts tables up to 7x faster than previous versions.
  • Table auto-detection has also been improved. (#456, tabulapdf/tabula-java#56)
  • If there’s an error during file upload & initial processing, warn the user. (#433)
  • Allow running the jar distribution in "headless" mode. Users for the jar distribution will need to manually open their web browser to the Tabula page (normally http://127.0.0.1:8080/ ).
  • Improved support for RTL languages like Hebrew and Arabic. (tabulapdf/tabula-java#66)
  • Upgraded to jruby-9.1.5.0, improving encoding support.
  • Lots and lots of other improvements — the extraction and processing backend has been completely rewritten!

Known Issues & Caveats

  • Note: If you are using the .jar version (for Linux/etc), you now need to open your browser to the Tabula page (http://127.0.0.1:8080/) manually.
  • Caveat: Tabula only works on text-based PDFs (ones where you can select text). Scanned documents do not work, and we do not recommend OCR for large files unless you have a data cleaning plan since even state-of-the-art OCR software can have significant error rates.
  • _OS X Gatekeeper_: If you’re running Mac OS X 10.8 or later and get a message that says "Tabula can't be opened because it is from an unidentified developer" message, please let us know. (It shouldn't be happening anymore.) See this GateKeeper page for more assistance — your "Allow applications downloaded from" setting should be set to "Mac App Store and identified developers" or "Anywhere."

Downloads

Windows & Linux users need to have Java installed to use Tabula. You can download Java here. The Mac version contains an integrated copy of Java.

Verification:

SHA-256

$ shasum -a256 tabula*.zip
4df6dce00f3bf7393684cb832c4c3cf3e2539edb6d62d3a0042330254d593826  tabula-jar-1.1.0c.zip
70ec8a524e881ed66d6048776ed0ceb16a0c8b68d06e1295a39698e836274b04  tabula-mac-1.1.0c.zip
243270c5918229415223794da56dffd8ba102683d195579a7c9f6aa578a2765a  tabula-win-1.1.0c.zip

PGP

You can also verify that you are downloading an authentic, unmodified version of Tabula using PGP. Files are signed with Mike Tigas’ key (0xA993E7156E0E9923), which is available here, on Keybase, or on most key servers.

Download the .zip of the Tabula version you want and also download the corresponding .zip.asc, then use the gpg --verify command, such as:

$ gpg --verify tabula-jar-1.1.0c.zip.asc

You will want the output to contain Good signature from "Mike Tigas <...>" somewhere in it.

Downloads