Permalink
Commits on Nov 23, 2011
  1. Extraneous ]

    knowtheory committed Nov 23, 2011
Commits on Nov 22, 2011
Commits on Nov 18, 2011
  1. Updating JODConverter versions. 3.0b4 has support for firing up Libre…

    …Office. (also removing JS comments from the top of conf file, as it blows up the json parser the JODConverter uses)
    knowtheory committed Nov 18, 2011
Commits on Oct 3, 2011
  1. Merge pull request #23 from minio-sk/portable-tests

    Make the tests portable
    jashkenas committed Oct 3, 2011
  2. Merge pull request #24 from minio-sk/feature-multiple_languages

    Feature multiple languages
    jashkenas committed Oct 3, 2011
Commits on Oct 1, 2011
  1. Add test for multiple language support

    This test does not require any additional tesseract language backends to be
    installed, but might fail if tesseract changes its error messages in the
    future.
    Michal Barla committed Oct 1, 2011
  2. Allow language parameter for tesseract text extraction

    Michal Barla committed Oct 1, 2011
  3. Make the tests portable:

    This patch addresses two problems with tests
     - Various tests rely on Dir.glob ordering. This is not reliable; this patch
       introduces assert_directory_contains to avoid Dir.glob ordering
       inconsistencies.
     - test_ocr_extraction relies on exact text match from tesseract. However, this
       differs with each version of tesseract. This patch instead checks that all
       required txt files exist and that they have reasonable size.
    kremso committed Oct 1, 2011
  4. Support libre office as office home param for java

    Michal Barla committed Oct 1, 2011
Commits on Sep 29, 2011
  1. Merge pull request #21 from simeonwillbanks/master

    File --mime-type option unrecognized on CentOS
    jashkenas committed Sep 29, 2011
Commits on Sep 28, 2011
  1. CentOS, and most likely other distros, do not have the --mime-type op…

    …tion for file. Here is the error: 'file: unrecognized option --mime-type'. The --mime option is more standard.
    simeonwillbanks committed Sep 28, 2011
Commits on Sep 15, 2011
  1. Merge pull request #13 from simeonwillbanks/master

    Inspect file mime-type
    jashkenas committed Sep 15, 2011
Commits on Sep 14, 2011
  1. Revert "removing Docsplit's default unsharp."

    This reverts commit e9153b7.
    jashkenas committed Sep 14, 2011
Commits on Sep 13, 2011
  1. Docsplit 0.6.0

    jashkenas committed Sep 13, 2011
  2. Issue #10, stop crying wolf.

    jashkenas committed Sep 13, 2011
Commits on Sep 9, 2011
  1. Merge pull request #19 from edtsech/add_brew_to_gh-pages

    Add `brew` command to installation part of gh-page.
    jashkenas committed Sep 9, 2011
  2. Add `brew` to installation.

    edtsech committed Sep 9, 2011
Commits on Sep 1, 2011
  1. At least on my version of Ubuntu (Natty Narwhal) the tesseract librar…

    …y is labeled as 'tesseract-ocr'
    palewire committed Sep 1, 2011
Commits on Aug 4, 2011
  1. Inspect file mime-type to determine if GraphicsMagick can convert fil…

    …e to PDF
    Simeon Willbanks committed Aug 4, 2011
Commits on Jul 25, 2011
  1. Merge pull request #12 from vrybas/11_escape_incoming_file_names

    11 escape incoming file names
    jashkenas committed Jul 25, 2011
Commits on Jul 22, 2011
Commits on May 16, 2011
Commits on May 13, 2011
  1. despeckle before OCR.

    jashkenas committed May 13, 2011
  2. Docsplit 0.5.2

    jashkenas committed May 13, 2011