Skip to content

Latest commit

 

History

History
executable file
·
302 lines (181 loc) · 15.2 KB

HISTORY.md

File metadata and controls

executable file
·
302 lines (181 loc) · 15.2 KB

Release History

master

0.11.0 (2023-02-26)

  • Replace PdfFileReader with PdfReader and pin PyPDF to >=3.0.0. #307 by Martin Thoma.

0.10.1 (2021-07-11)

  • Change extra requirements from cv to base. You can use pip install "camelot-py[base]" to install everything required to run camelot.

    0.10.0 (2021-07-11)


Improvements

  • Add support for multiple image conversion backends. #198 and #253 by Vinayak Mehta.
  • Add markdown export format. #222 by Lucas Cimon.

Documentation


Bugfixes

Improvements

Documentation

  • #193 Add better checks to confirm proper installation of ghostscript. #196 by jimhall.

  • Update advanced.rst plotting examples. #119 by Jens Diemer.

    0.8.2 (2020-07-27)


  • Revert the changes in 0.8.1.

    0.8.1 (2020-07-21)


Bugfixes

  • #169 Fix import error caused by pdfminer.six==20200720. #171 by Vinayak Mehta.

    0.8.0 (2020-05-24)


Improvements

  • Drop Python 2 support!
    • Remove Python 2.7 and 3.5 support.
    • Replace all instances of .format with f-strings.
    • Remove all __future__ imports.
    • Fix HTTP 403 forbidden exception in read_pdf(url) and remove Python 2 urllib support.
    • Fix test data.

Bugfixes

  • Fix library discovery on Windows. #32 by KOLANICH.

  • Fix calling convention of callback functions. #34 by KOLANICH.

    0.7.3 (2019-07-07)


Improvements

  • Camelot now follows the Black code style! #1 and #3.

Bugfixes

  • Fix Click.HelpFormatter monkey-patch. #5 by Dimiter Naydenov.

  • Fix strip_text argument getting ignored. #4 by Dimiter Naydenov.

  • #25 edge_tol skipped in read_pdf. #26 by Vinayak Mehta.

  • Fix pytest deprecation warning. #2 by Vinayak Mehta.

  • #293 Split text ignores all text to the right of last cut. #294 by Vinayak Mehta.

  • #277 Sort TableList by order of tables in PDF. #283 by Sym Roe.

  • #312 table_regions throws ValueError when flavor='stream'. #332 by Vinayak Mehta.

    0.7.2 (2019-01-10)


Bugfixes

  • #245 Fix AttributeError for encrypted files. #251 by Yatin Taluja.

    0.7.1 (2019-01-06)


Bugfixes

  • Move ghostscript import to inside the function so Anaconda builds don't fail.

    0.7.0 (2019-01-05)


Improvements

  • #240 Add support to analyze only certain page regions to look for tables. #243 by Vinayak Mehta.
    • You can use table_regions in read_pdf() to specify approximate page regions which may contain tables.
    • Kwarg line_size_scaling is now called line_scale.
  • #212 Add support to export as sqlite database. #244 by Vinayak Mehta.
  • #239 Raise warning if PDF is image-based. #240 by Vinayak Mehta.

Documentation

  • Remove mention of old mesh kwarg from docs. #241 by fte10kso.

Note: The python wrapper to Ghostscript's C API is now vendorized under the ext module. This was done due to unavailability of the ghostscript package on Anaconda. The code should be removed after we submit a recipe for it to conda-forge. With this release, the user doesn't need to ensure that the Ghostscript executable is available on the PATH variable.

0.6.0 (2018-12-24)

Improvements

  • #91 Add support to read from url. #236 by Vinayak Mehta.

  • #229, #230 and #233 New configuration parameters. #234 by Vinayak Mehta.

    • strip_text: To define characters that should be stripped from each string.
    • edge_tol: Tolerance parameter for extending textedges vertically.
    • resolution: Resolution used for PDF to PNG conversion.
    • Check out the advanced docs for usage details.
  • #170 Add option to pass pdfminer layout kwargs. #232 by Vinayak Mehta. _ Keyword arguments for pdfminer.layout.LAParams can now be passed using layout_kwargs in read_pdf(). _ The margins keyword argument in read_pdf() is now deprecated.

    0.5.0 (2018-12-13)


Improvements

  • #207 Add a plot type for Stream text edges and detected table areas. #224 by Vinayak Mehta.
  • #204 suppress_warnings is now called suppress_stdout. #225 by Vinayak Mehta.

Bugfixes

Documentation

  • Add pdfplumber comparison and update Tabula (stream) comparison. Check out the wiki page.

    0.4.1 (2018-12-05)


Bugfixes


Improvements

  • #102 Detect tables automatically when Stream is used. #206 Add implementation of Anssi Nurminen's table detection algorithm by Vinayak Mehta.

    0.3.2 (2018-11-04)


Improvements

  • #186 Add _bbox attribute to table. #193 by Vinayak Mehta. * You can use table._bbox to get coordinates of the detected table.

    0.3.1 (2018-11-02)


Improvements

  • Matplotlib is now an optional requirement. #190 by Vinayak Mehta.

    • You can install it using $ pip install camelot-py[plot].
  • #127 Add tests for plotting. Coverage is now at 87%! #179 by Suyash Behera.

    0.3.0 (2018-10-28)


Improvements

  • #162 Add password keyword argument. #180 by rbares.
    • An encrypted PDF can now be decrypted by passing password='<PASSWORD>' to read_pdf or --password <PASSWORD> to the command-line interface. (Limited encryption algorithm support from PyPDF2.)
  • #139 Add suppress_warnings keyword argument. #155 by Jonathan Lloyd.
    • Warnings raised by Camelot can now be suppressed by passing suppress_warnings=True to read_pdf or --quiet to the command-line interface.
  • #154 The CLI can now be run using python -m. Try python -m camelot --help. #159 by Parth P Panchal.
  • #165 Rename table_area to table_areas. #171 by Parth P Panchal.

Bugfixes

  • Raise error if the ghostscript executable is not on the PATH variable. #166 by Vinayak Mehta.
  • Convert filename to lowercase to check for PDF extension. #169 by Vinicius Mesel.

Files

  • #114 Add Makefile and make codecov run only once. #132 by Vaibhav Mule.
  • Add .editorconfig. #151 by KOLANICH.
  • Downgrade numpy version from 1.15.2 to 1.13.3.
  • Add requirements.txt for readthedocs.

Documentation

  • Add "Using conda" section to installation instructions.

  • Add readthedocs badge.

    0.2.3 (2018-10-08)


  • Remove hard dependencies on requirements versions.

    0.2.2 (2018-10-08)


Bugfixes

  • Move opencv-python to extra_requires. #134 by Vinayak Mehta.

    0.2.1 (2018-10-05)


Bugfixes

  • #121 Fix ghostscript subprocess call for Windows. #124 by Vinayak Mehta.

Improvements

  • #123 Make PEP8 compatible. #125 by Oshawk.
  • #110 Add more tests. Coverage is now at 84%!
    • Add tests for __repr__. #128 by Vaibhav Mule.
    • Add tests for CLI. #122 by Vaibhav Mule and #117 by Vinayak Mehta.
    • Add tests for errors/warnings. #113 by Vinayak Mehta.
    • Add tests for output formats and parser kwargs. #126 by Vinayak Mehta.
  • Add Python 3.5 and 3.7 support. #119 by Vinayak Mehta.
  • Add logging and warnings.

Documentation

Note: This release also bumps the version for numpy from 1.13.3 to 1.15.2 and adds a MANIFEST.in. Also, openpyxl==2.5.8 is a new requirement and pytest-cov==2.6.0 is a new dev requirement.

0.2.0 (2018-09-28)

Improvements

  • #81 Add Python 3.6 support. #109 by Vinayak Mehta.

    0.1.2 (2018-09-25)


Improvements

  • #85 Add Travis and Codecov.

    0.1.1 (2018-09-24)


Documentation

  • Add documentation fixes.

    0.1.0 (2018-09-24)


  • Rebirth!