tabula 1.1.0-beta.1 / tabula-java 0.9.0
We're proud to announce the first public beta preview of Tabula 1.1! This version contains several bugfixes and a rewrite of our processing backend which should provide a significant performance increase.
This is a beta release: because Tabula utilizes a new processing backend, users may encounter bugs that were not present in previous versions of Tabula. If you have any issues with this beta version of Tabula, please let us know!
New features / bugfixes
- Tabula now extracts tables up to 7x faster than previous versions.
- Table auto-detection has also been improved. (#456, tabulapdf/tabula-java#56)
- If there’s an error during file upload & initial processing, warn the user. (#433)
- Lots and lots of other improvements — the extraction and processing backend has been completely rewritten!
Known Issues & Caveats
- Bug: This version of Tabula always shows that there is an update, even for the same version: "New version! Tabula 1.1.0-beta.1 is available (you have 1.1.0-beta1)".
- Bug: Although the backend for the Tabula app has been rewritten using tabula-java instead of tabula-extractor, the "script" export option still generates commands to call the older tabula-extractor. To fix this, you may follow the download and example instructions for tabula-java, and replace the
java -jar tabula-0.9.0-jar-with-dependencies.jarin the generated script. (#484)
- Caveat: Tabula only works on text-based PDFs (ones where you can select text). Scanned documents do not work, and we do not recommend OCR for large files unless you have a data cleaning plan since even state-of-the-art OCR software can have significant error rates.
- _OS X Gatekeeper_: If you’re running Mac OS X 10.8 or later and get a message that says "Tabula can't be opened because it is from an unidentified developer" message, please let us know. (It shouldn't be happening anymore.) See this GateKeeper page for more assistance — your "Allow applications downloaded from" setting should be set to "Mac App Store and identified developers" or "Anywhere."
- Windows: tabula-win-1.1.0-beta.1a.zip (PGP sig)
- Mac: tabula-mac-1.1.0-beta.1a.zip (PGP sig)
- Linux/Other: tabula-jar-1.1.0-beta.1a.zip (PGP sig)
$ shasum -a1 tabula*.zip 647602da17d365f107260afd8a8b5b3e1d687b98 tabula-jar-1.1.0-beta.1a.zip 054e5c2eb68149f3384b56cc615539e4050de479 tabula-mac-1.1.0-beta.1a.zip bbdca4686b3818e9ec0be5d36e7bc48e944a77d6 tabula-win-1.1.0-beta.1a.zip
You can also verify that you are downloading an authentic, unmodified version of Tabula using PGP. Files are signed with Mike Tigas’ key (0xA993E7156E0E9923), which is available here, on Keybase, or on most key servers.
.zip of the Tabula version you want and also download the corresponding
.zip.asc, then use the
gpg --verify command, such as:
$ gpg --verify tabula-jar-1.1.0-beta.1.zip.asc
You will want the output to contain
Good signature from "Mike Tigas <...>" somewhere in it.