Skip to content

1.1.0-beta.1

Pre-release
Pre-release
Compare
Choose a tag to compare
@mtigas mtigas released this 31 Mar 16:13
· 721 commits to master since this release
v1.1.0-beta.1
6fe481d

tabula 1.1.0-beta.1 / tabula-java 0.9.0

We're proud to announce the first public beta preview of Tabula 1.1! This version contains several bugfixes and a rewrite of our processing backend which should provide a significant performance increase.

This is a beta release: because Tabula utilizes a new processing backend, users may encounter bugs that were not present in previous versions of Tabula. If you have any issues with this beta version of Tabula, please let us know!

New features / bugfixes

  • Tabula now extracts tables up to 7x faster than previous versions.
  • Table auto-detection has also been improved. (#456, tabulapdf/tabula-java#56)
  • If there’s an error during file upload & initial processing, warn the user. (#433)
  • Lots and lots of other improvements — the extraction and processing backend has been completely rewritten!

Known Issues & Caveats

  • Bug: This version of Tabula always shows that there is an update, even for the same version: "New version! Tabula 1.1.0-beta.1 is available (you have 1.1.0-beta1)".
  • Bug: Although the backend for the Tabula app has been rewritten using tabula-java instead of tabula-extractor, the "script" export option still generates commands to call the older tabula-extractor. To fix this, you may follow the download and example instructions for tabula-java, and replace the tabula command with java -jar tabula-0.9.0-jar-with-dependencies.jar in the generated script. (#484)
  • Caveat: Tabula only works on text-based PDFs (ones where you can select text). Scanned documents do not work, and we do not recommend OCR for large files unless you have a data cleaning plan since even state-of-the-art OCR software can have significant error rates.
  • _OS X Gatekeeper_: If you’re running Mac OS X 10.8 or later and get a message that says "Tabula can't be opened because it is from an unidentified developer" message, please let us know. (It shouldn't be happening anymore.) See this GateKeeper page for more assistance — your "Allow applications downloaded from" setting should be set to "Mac App Store and identified developers" or "Anywhere."

Downloads

Windows & Linux users need to have Java installed to use Tabula. You can download Java here. The Mac version contains an integrated copy of Java.

Verification:

SHA-1

$ shasum -a1 tabula*.zip
647602da17d365f107260afd8a8b5b3e1d687b98  tabula-jar-1.1.0-beta.1a.zip
054e5c2eb68149f3384b56cc615539e4050de479  tabula-mac-1.1.0-beta.1a.zip
bbdca4686b3818e9ec0be5d36e7bc48e944a77d6  tabula-win-1.1.0-beta.1a.zip

PGP

You can also verify that you are downloading an authentic, unmodified version of Tabula using PGP. Files are signed with Mike Tigas’ key (0xA993E7156E0E9923), which is available here, on Keybase, or on most key servers.

Download the .zip of the Tabula version you want and also download the corresponding .zip.asc, then use the gpg --verify command, such as:

$ gpg --verify tabula-jar-1.1.0-beta.1.zip.asc

You will want the output to contain Good signature from "Mike Tigas <...>" somewhere in it.