Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[wip] added grobid with delft image #441

Closed
wants to merge 1,609 commits into from
This pull request is big! We’re only showing the most recent 250 commits.

Commits on Jul 25, 2018

  1. Copy the full SHA
    e9ae3f2 View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2018

  1. update documentation format

    lfoppiano committed Jul 26, 2018
    Copy the full SHA
    eb2881b View commit details
    Browse the repository at this point in the history
  2. Merge pull request kermitt2#331 from kermitt2/docu-fix

    Update documentation and fix build on readthedocs
    lfoppiano committed Jul 26, 2018
    Copy the full SHA
    fcb3480 View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    f0762f6 View commit details
    Browse the repository at this point in the history

Commits on Aug 14, 2018

  1. Copy the full SHA
    af01d0b View commit details
    Browse the repository at this point in the history

Commits on Aug 16, 2018

  1. Copy the full SHA
    9c414cd View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    99ea296 View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    3a276bc View commit details
    Browse the repository at this point in the history
  4. add bibtex entry

    kermitt2 committed Aug 16, 2018
    Copy the full SHA
    55183a4 View commit details
    Browse the repository at this point in the history
  5. styling bibtex entry

    kermitt2 committed Aug 16, 2018
    Copy the full SHA
    58be7f1 View commit details
    Browse the repository at this point in the history

Commits on Aug 22, 2018

  1. Copy the full SHA
    639cb4e View commit details
    Browse the repository at this point in the history
  2. Fix test

    kermitt2 committed Aug 22, 2018
    Copy the full SHA
    31d46be View commit details
    Browse the repository at this point in the history

Commits on Aug 23, 2018

  1. First iteration for pdfalto integration.

    Aazhar committed Aug 23, 2018
    Copy the full SHA
    76724b2 View commit details
    Browse the repository at this point in the history
  2. Add pdfalto bin for mac-64.

    Aazhar committed Aug 23, 2018
    Copy the full SHA
    c5b3f9c View commit details
    Browse the repository at this point in the history
  3. Use ICU library for diacritics handling.

    Aazhar committed Aug 23, 2018
    Copy the full SHA
    e68001f View commit details
    Browse the repository at this point in the history
  4. Add linux 64 pdfalto binary

    kermitt2 authored and Aazhar committed Aug 23, 2018
    Copy the full SHA
    d9652ee View commit details
    Browse the repository at this point in the history
  5. Update pdfalto mac-64 binary.

    Aazhar committed Aug 23, 2018
    Copy the full SHA
    a3d5e32 View commit details
    Browse the repository at this point in the history
  6. Update pdflato for mac 64.

    Aazhar committed Aug 23, 2018
    Copy the full SHA
    7ffac55 View commit details
    Browse the repository at this point in the history
  7. Add pdf metadata parser.

    Aazhar committed Aug 23, 2018
    Copy the full SHA
    8e9a463 View commit details
    Browse the repository at this point in the history
  8. Merge branch 'pdfalto_integration' of https://github.com/kermitt2/grobid

     into pdfalto_integration
    Aazhar committed Aug 23, 2018
    Copy the full SHA
    5e1e8e0 View commit details
    Browse the repository at this point in the history
  9. Merge branch 'master' into pdfalto_integration

    Aazhar committed Aug 23, 2018
    Copy the full SHA
    da9c491 View commit details
    Browse the repository at this point in the history

Commits on Aug 29, 2018

  1. Delete pdf metadata file.

    Aazhar committed Aug 29, 2018
    Copy the full SHA
    eb972ae View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    3beeee2 View commit details
    Browse the repository at this point in the history
  3. Remove ICU and charcter composition from parser (now support in pdfal…

    …to converter).
    Aazhar committed Aug 29, 2018
    Copy the full SHA
    05fc0e5 View commit details
    Browse the repository at this point in the history
  4. Add pdfalto options.

    Aazhar committed Aug 29, 2018
    Copy the full SHA
    0495347 View commit details
    Browse the repository at this point in the history
  5. Put back gradle files.

    Aazhar committed Aug 29, 2018
    Copy the full SHA
    e25f511 View commit details
    Browse the repository at this point in the history

Commits on Sep 3, 2018

  1. Fixing failing test

    lfoppiano committed Sep 3, 2018
    Copy the full SHA
    887ebf6 View commit details
    Browse the repository at this point in the history
  2. Update pdfalto executables.

    Aazhar committed Sep 3, 2018
    Copy the full SHA
    3e52181 View commit details
    Browse the repository at this point in the history

Commits on Sep 4, 2018

  1. Update pdfalto binaries using static icu.

    Aazhar committed Sep 4, 2018
    Copy the full SHA
    bf83c36 View commit details
    Browse the repository at this point in the history

Commits on Sep 6, 2018

  1. Fix after and before properties settings.

    Aazhar committed Sep 6, 2018
    Copy the full SHA
    7973eff View commit details
    Browse the repository at this point in the history
  2. Merge branch 'master' into pdfalto_integration

    -This merge is to fix issue with unit tests.
    Aazhar committed Sep 6, 2018
    Copy the full SHA
    b44f383 View commit details
    Browse the repository at this point in the history
  3. Update travis show stacktrace.

    Aazhar committed Sep 6, 2018
    Copy the full SHA
    b683c51 View commit details
    Browse the repository at this point in the history

Commits on Sep 7, 2018

  1. Add JAXB api dependency

    kermitt2 committed Sep 7, 2018
    Copy the full SHA
    20d0727 View commit details
    Browse the repository at this point in the history

Commits on Sep 10, 2018

  1. Update travis conf, use upgrade gcc.

    Aazhar committed Sep 10, 2018
    Copy the full SHA
    139d16b View commit details
    Browse the repository at this point in the history
  2. Add Grobid Factory reset method.

    * Static fields need to be cleared after each test class (otherwise any modification will impact all the rest of test cases)
    Aazhar committed Sep 10, 2018
    Copy the full SHA
    2c6c162 View commit details
    Browse the repository at this point in the history
  3. Update pdftoxml_server to pdfalto_server.

    Aazhar committed Sep 10, 2018
    Copy the full SHA
    0e028c2 View commit details
    Browse the repository at this point in the history

Commits on Sep 11, 2018

  1. Copy the full SHA
    5d68f95 View commit details
    Browse the repository at this point in the history
  2. Add before test class to init properties.

    Aazhar committed Sep 11, 2018
    Copy the full SHA
    59d26b9 View commit details
    Browse the repository at this point in the history

Commits on Sep 12, 2018

  1. Update pdfalto linux binary.

    Aazhar committed Sep 12, 2018
    Copy the full SHA
    61646c5 View commit details
    Browse the repository at this point in the history
  2. Use staticly linked pdfalto.

    Aazhar committed Sep 12, 2018
    Copy the full SHA
    cbd28ad View commit details
    Browse the repository at this point in the history
  3. typo.

    Aazhar committed Sep 12, 2018
    Copy the full SHA
    126e8ac View commit details
    Browse the repository at this point in the history
  4. Update binary file mode.

    Aazhar committed Sep 12, 2018
    Copy the full SHA
    36fe377 View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2018

  1. Update gradle version

    kermitt2 committed Sep 19, 2018
    Copy the full SHA
    ea843cc View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    4c3dfa0 View commit details
    Browse the repository at this point in the history
  3. correct dev version in doc

    kermitt2 committed Sep 19, 2018
    Copy the full SHA
    df5f92b View commit details
    Browse the repository at this point in the history

Commits on Sep 22, 2018

  1. Update end-to-end evaluation

    kermitt2 committed Sep 22, 2018
    Copy the full SHA
    688ff81 View commit details
    Browse the repository at this point in the history

Commits on Sep 25, 2018

  1. Copy the full SHA
    c199c59 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    597d866 View commit details
    Browse the repository at this point in the history

Commits on Oct 3, 2018

  1. Updated dependencies

     - dropwizard (to latest version)
     - pdfbox (to latest minor version)
    lfoppiano committed Oct 3, 2018
    Copy the full SHA
    3f05357 View commit details
    Browse the repository at this point in the history
  2. Removing unused imports

    lfoppiano committed Oct 3, 2018
    Copy the full SHA
    b254a8c View commit details
    Browse the repository at this point in the history
  3. Merge pull request kermitt2#350 from kermitt2/updated-dependencies

    Updated dependencies
    kermitt2 committed Oct 3, 2018
    Copy the full SHA
    ba8e2e4 View commit details
    Browse the repository at this point in the history

Commits on Oct 8, 2018

  1. Copy the full SHA
    c5ef2c2 View commit details
    Browse the repository at this point in the history

Commits on Oct 14, 2018

  1. Copy the full SHA
    6fbfd41 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    071948b View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    286f04f View commit details
    Browse the repository at this point in the history

Commits on Oct 17, 2018

  1. Copy the full SHA
    56fc012 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    df4f8d3 View commit details
    Browse the repository at this point in the history
  3. Updating libraries

    lfoppiano committed Oct 17, 2018
    Copy the full SHA
    a221103 View commit details
    Browse the repository at this point in the history
  4. Copy the full SHA
    a231c2b View commit details
    Browse the repository at this point in the history
  5. Copy the full SHA
    ce7d267 View commit details
    Browse the repository at this point in the history
  6. Copy the full SHA
    c4eb7d4 View commit details
    Browse the repository at this point in the history
  7. Copy the full SHA
    e1f95a9 View commit details
    Browse the repository at this point in the history
  8. Copy the full SHA
    eabb3cb View commit details
    Browse the repository at this point in the history
  9. minor typos in the doc

    kermitt2 committed Oct 17, 2018
    Copy the full SHA
    11e23a4 View commit details
    Browse the repository at this point in the history

Commits on Oct 21, 2018

  1. Copy the full SHA
    90ad311 View commit details
    Browse the repository at this point in the history

Commits on Oct 23, 2018

  1. put on hold output of collaboration in the header (no training data f…

    …or it for the moment)
    kermitt2 committed Oct 23, 2018
    Copy the full SHA
    1b2338a View commit details
    Browse the repository at this point in the history

Commits on Oct 29, 2018

  1. - fixing reference tokens (they were sharing the same List object)

    - fixing running crossref client
    - refactoring of the main page area detection
    - a bit more heuristics to detect figures
    detonator413 committed Oct 29, 2018
    Copy the full SHA
    38cb062 View commit details
    Browse the repository at this point in the history
  2. adapting to new PDFBox

    detonator413 committed Oct 29, 2018
    Copy the full SHA
    e6b0ae8 View commit details
    Browse the repository at this point in the history

Commits on Nov 9, 2018

  1. 3
    Copy the full SHA
    6736736 View commit details
    Browse the repository at this point in the history

Commits on Nov 16, 2018

  1. Copy the full SHA
    35bd124 View commit details
    Browse the repository at this point in the history

Commits on Nov 18, 2018

  1. Add new consolidation option

    kermitt2 committed Nov 18, 2018
    Copy the full SHA
    34b1ae5 View commit details
    Browse the repository at this point in the history
  2. fix tests

    kermitt2 committed Nov 18, 2018
    Copy the full SHA
    8a77d8e View commit details
    Browse the repository at this point in the history
  3. Merge pull request kermitt2#355 from kermitt2/standalone-figure-extra…

    …ction
    
    Figure extraction improvements
    kermitt2 committed Nov 18, 2018
    Copy the full SHA
    35c758f View commit details
    Browse the repository at this point in the history
  4. 6
    Copy the full SHA
    502d541 View commit details
    Browse the repository at this point in the history

Commits on Nov 19, 2018

  1. Copy the full SHA
    78082b0 View commit details
    Browse the repository at this point in the history

Commits on Nov 21, 2018

  1. Copy the full SHA
    d58c820 View commit details
    Browse the repository at this point in the history

Commits on Nov 24, 2018

  1. complete consolidation

    kermitt2 committed Nov 24, 2018
    Copy the full SHA
    f0e5bf4 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    b05734f View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    a4f5f25 View commit details
    Browse the repository at this point in the history

Commits on Nov 25, 2018

  1. Copy the full SHA
    b619513 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    5c58af7 View commit details
    Browse the repository at this point in the history

Commits on Dec 6, 2018

  1. Copy the full SHA
    3e8ab20 View commit details
    Browse the repository at this point in the history

Commits on Dec 10, 2018

  1. Copy the full SHA
    8bd6a44 View commit details
    Browse the repository at this point in the history

Commits on Dec 11, 2018

  1. Update pdflato_server.

    Aazhar committed Dec 11, 2018
    Copy the full SHA
    06ac2b6 View commit details
    Browse the repository at this point in the history

Commits on Dec 13, 2018

  1. Add biblio-glutton client

    kermitt2 committed Dec 13, 2018
    Copy the full SHA
    38e23a1 View commit details
    Browse the repository at this point in the history
  2. Update segmentation.md

    brijml committed Dec 13, 2018
    Copy the full SHA
    0bed935 View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    4caabaa View commit details
    Browse the repository at this point in the history
  4. Update Training-the-models-of-Grobid.md

    Added a link to a comment on an issue about the training parameters.
    brijml committed Dec 13, 2018
    Copy the full SHA
    7831cfa View commit details
    Browse the repository at this point in the history
  5. Copy the full SHA
    459a007 View commit details
    Browse the repository at this point in the history
  6. fix NPE for glutton

    kermitt2 committed Dec 13, 2018
    Copy the full SHA
    d301fb5 View commit details
    Browse the repository at this point in the history

Commits on Dec 14, 2018

  1. adjust glutton query rate

    kermitt2 committed Dec 14, 2018
    Copy the full SHA
    4d36466 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    dce4651 View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    a7ebb1b View commit details
    Browse the repository at this point in the history

Commits on Dec 17, 2018

  1. Merge pull request kermitt2#364 from brijml/patch-2

    Update Training-the-models-of-Grobid.md
    kermitt2 committed Dec 17, 2018
    Copy the full SHA
    f823152 View commit details
    Browse the repository at this point in the history
  2. Styling doc

    kermitt2 committed Dec 17, 2018
    Copy the full SHA
    3d1a9ca View commit details
    Browse the repository at this point in the history
  3. Merge pull request kermitt2#363 from brijml/patch-1

    Update segmentation.md
    kermitt2 committed Dec 17, 2018
    Copy the full SHA
    296f209 View commit details
    Browse the repository at this point in the history

Commits on Dec 22, 2018

  1. Add consolidation service property; various simplifactions; remove pr…

    …operty-based admin web services
    kermitt2 committed Dec 22, 2018
    Copy the full SHA
    5ae8acd View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    a83e499 View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    d2d42ba View commit details
    Browse the repository at this point in the history

Commits on Dec 25, 2018

  1. First integration via JEP

    kermitt2 committed Dec 25, 2018
    Copy the full SHA
    6605dc3 View commit details
    Browse the repository at this point in the history

Commits on Dec 26, 2018

  1. Copy the full SHA
    ca1464f View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    d45ebe5 View commit details
    Browse the repository at this point in the history

Commits on Dec 27, 2018

  1. Copy the full SHA
    6ce9f8f View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    fad45a5 View commit details
    Browse the repository at this point in the history

Commits on Dec 29, 2018

  1. Copy the full SHA
    ab43d65 View commit details
    Browse the repository at this point in the history

Commits on Jan 4, 2019

  1. Copy the full SHA
    e45eca9 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    4026d24 View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    97f3112 View commit details
    Browse the repository at this point in the history
  4. renamed test method

    de-code committed Jan 4, 2019
    Copy the full SHA
    81d8501 View commit details
    Browse the repository at this point in the history

Commits on Jan 7, 2019

  1. Merge pull request kermitt2#369 from kermitt2/delft-integration

    DeLFT deep learning model integration
    kermitt2 committed Jan 7, 2019
    Copy the full SHA
    b5c1ef6 View commit details
    Browse the repository at this point in the history
  2. Merge pull request kermitt2#368 from de-code/use-available-threads-wh…

    …en-nb-threads-is-zero
    
    Use available threads when nb threads is zero
    kermitt2 committed Jan 7, 2019
    Copy the full SHA
    5e12b9b View commit details
    Browse the repository at this point in the history

Commits on Jan 10, 2019

  1. Copy the full SHA
    ee8fff1 View commit details
    Browse the repository at this point in the history

Commits on Jan 11, 2019

  1. Copy the full SHA
    0a87166 View commit details
    Browse the repository at this point in the history

Commits on Jan 13, 2019

  1. Copy the full SHA
    3f37b11 View commit details
    Browse the repository at this point in the history

Commits on Jan 15, 2019

  1. Add parameter to load models at service startup; remove service prope…

    …rties file; remove non parallel mode in service; code simplification
    kermitt2 committed Jan 15, 2019
    Copy the full SHA
    b62ff88 View commit details
    Browse the repository at this point in the history
  2. Update document image parsing.

    Aazhar committed Jan 15, 2019
    Copy the full SHA
    cb74558 View commit details
    Browse the repository at this point in the history

Commits on Jan 17, 2019

  1. Copy the full SHA
    94c0794 View commit details
    Browse the repository at this point in the history
  2. cleaning

    kermitt2 committed Jan 17, 2019
    Copy the full SHA
    a951a12 View commit details
    Browse the repository at this point in the history
  3. Merge pull request kermitt2#374 from kermitt2/citation-contexts

    Citation context improvement.
    kermitt2 committed Jan 17, 2019
    Copy the full SHA
    7848bce View commit details
    Browse the repository at this point in the history
  4. allowing changing project version when uploading archives (useful for…

    … fixing a version in between releases)
    detonator413 committed Jan 17, 2019
    1
    Copy the full SHA
    13503f6 View commit details
    Browse the repository at this point in the history

Commits on Jan 18, 2019

  1. Use reading order and update pdfalto.

    Aazhar committed Jan 18, 2019
    Copy the full SHA
    3154f29 View commit details
    Browse the repository at this point in the history
  2. Update MonographParser.java

    Comment out a test of the existence of a file which was not yet created.
    albamath committed Jan 18, 2019
    Copy the full SHA
    d631477 View commit details
    Browse the repository at this point in the history
  3. Fix pdfalto linux binary.

    Aazhar committed Jan 18, 2019
    Copy the full SHA
    8408e11 View commit details
    Browse the repository at this point in the history

Commits on Jan 19, 2019

  1. Update pdfalto.

    Aazhar committed Jan 19, 2019
    Copy the full SHA
    6335bf1 View commit details
    Browse the repository at this point in the history
  2. Add option for ocrization of (problematic) characters (so far replace…

    … with a placeholder).
    Aazhar committed Jan 19, 2019
    Copy the full SHA
    87b4225 View commit details
    Browse the repository at this point in the history

Commits on Jan 20, 2019

  1. Update pdflato (still staging).

    Aazhar committed Jan 20, 2019
    Copy the full SHA
    094238a View commit details
    Browse the repository at this point in the history

Commits on Jan 24, 2019

  1. Update pdfalto_server.

    Aazhar committed Jan 24, 2019
    Copy the full SHA
    ac295a9 View commit details
    Browse the repository at this point in the history

Commits on Jan 29, 2019

  1. Various improvements of citation context identification, correspondin…

    …g updates of training data and models
    kermitt2 committed Jan 29, 2019
    Copy the full SHA
    f6cce24 View commit details
    Browse the repository at this point in the history
  2. remove outdated tests

    kermitt2 committed Jan 29, 2019
    Copy the full SHA
    943e2b0 View commit details
    Browse the repository at this point in the history
  3. cleaning

    kermitt2 committed Jan 29, 2019
    Copy the full SHA
    660d725 View commit details
    Browse the repository at this point in the history

Commits on Feb 1, 2019

  1. Copy the full SHA
    5b36e62 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    d3003f6 View commit details
    Browse the repository at this point in the history
  3. Update pdfalto bins and some renamings.

    Aazhar committed Feb 1, 2019
    Copy the full SHA
    01a11ad View commit details
    Browse the repository at this point in the history
  4. Fix missing updates.

    Aazhar committed Feb 1, 2019
    Copy the full SHA
    adceb8c View commit details
    Browse the repository at this point in the history
  5. Merge branch 'master' into pdfalto_integration

    Aazhar committed Feb 1, 2019
    Copy the full SHA
    a847d71 View commit details
    Browse the repository at this point in the history
  6. Merge branch 'master' into pdfalto_integration

    Aazhar committed Feb 1, 2019
    Copy the full SHA
    dc67949 View commit details
    Browse the repository at this point in the history
  7. Typo.

    Aazhar committed Feb 1, 2019
    Copy the full SHA
    6b7d72f View commit details
    Browse the repository at this point in the history
  8. Update modes.

    Aazhar committed Feb 1, 2019
    Copy the full SHA
    9654410 View commit details
    Browse the repository at this point in the history

Commits on Feb 2, 2019

  1. Update pdfalto bins.

    Aazhar committed Feb 2, 2019
    Copy the full SHA
    401190f View commit details
    Browse the repository at this point in the history

Commits on Feb 5, 2019

  1. Copy the full SHA
    cca88a6 View commit details
    Browse the repository at this point in the history
  2. update eval

    kermitt2 committed Feb 5, 2019
    Copy the full SHA
    51275b7 View commit details
    Browse the repository at this point in the history
  3. Merge pull request kermitt2#382 from kermitt2/better-citation-contexts

    Better citation contexts
    kermitt2 committed Feb 5, 2019
    Copy the full SHA
    4254d0e View commit details
    Browse the repository at this point in the history
  4. Update result for release

    kermitt2 committed Feb 5, 2019
    Copy the full SHA
    b1f532e View commit details
    Browse the repository at this point in the history
  5. Update doc for release

    kermitt2 committed Feb 5, 2019
    Copy the full SHA
    969415e View commit details
    Browse the repository at this point in the history
  6. Copy the full SHA
    4606dc7 View commit details
    Browse the repository at this point in the history
  7. Copy the full SHA
    0fcd1f0 View commit details
    Browse the repository at this point in the history
  8. Copy the full SHA
    f547dee View commit details
    Browse the repository at this point in the history
  9. Copy the full SHA
    4edff00 View commit details
    Browse the repository at this point in the history
  10. update doc for new release

    kermitt2 committed Feb 5, 2019
    Copy the full SHA
    7fdc0c6 View commit details
    Browse the repository at this point in the history

Commits on Feb 6, 2019

  1. Use the correct parsing method.

    Aazhar committed Feb 6, 2019
    Copy the full SHA
    5cca707 View commit details
    Browse the repository at this point in the history
  2. Update pdfalto bin.

    Aazhar committed Feb 6, 2019
    Copy the full SHA
    694fca6 View commit details
    Browse the repository at this point in the history

Commits on Feb 10, 2019

  1. doc styling

    kermitt2 committed Feb 10, 2019
    Copy the full SHA
    001d5cc View commit details
    Browse the repository at this point in the history
  2. Cleaning

    kermitt2 committed Feb 10, 2019
    Copy the full SHA
    1861501 View commit details
    Browse the repository at this point in the history
  3. change default config

    kermitt2 committed Feb 10, 2019
    Copy the full SHA
    1951d77 View commit details
    Browse the repository at this point in the history
  4. Merge pull request kermitt2#380 from kermitt2/pdfalto_integration

    [first iteration] PDFALTO integration
    kermitt2 committed Feb 10, 2019
    Copy the full SHA
    9048a02 View commit details
    Browse the repository at this point in the history
  5. Copy the full SHA
    a755694 View commit details
    Browse the repository at this point in the history
  6. Copy the full SHA
    9189534 View commit details
    Browse the repository at this point in the history
  7. Copy the full SHA
    af790f0 View commit details
    Browse the repository at this point in the history
  8. Copy the full SHA
    ef5fbdc View commit details
    Browse the repository at this point in the history

Commits on Feb 12, 2019

  1. Copy the full SHA
    69e5e3d View commit details
    Browse the repository at this point in the history
  2. review consolidation

    kermitt2 committed Feb 12, 2019
    Copy the full SHA
    1a55156 View commit details
    Browse the repository at this point in the history
  3. better glutton interfacing

    kermitt2 committed Feb 12, 2019
    Copy the full SHA
    6d487aa View commit details
    Browse the repository at this point in the history

Commits on Feb 13, 2019

  1. Copy the full SHA
    52ff6a3 View commit details
    Browse the repository at this point in the history

Commits on Feb 14, 2019

  1. Copy the full SHA
    9472e55 View commit details
    Browse the repository at this point in the history

Commits on Feb 15, 2019

  1. update lexicon

    Gerit Wagner committed Feb 15, 2019
    Copy the full SHA
    196f322 View commit details
    Browse the repository at this point in the history

Commits on Feb 16, 2019

  1. hotfix for pdf2xml renaming

    kermitt2 committed Feb 16, 2019
    Copy the full SHA
    dbbbe0f View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    9758d1b View commit details
    Browse the repository at this point in the history

Commits on Feb 18, 2019

  1. Copy the full SHA
    85df4ee View commit details
    Browse the repository at this point in the history
  2. Merge pull request kermitt2#396 from geritwagner/master

    update lexicon
    kermitt2 committed Feb 18, 2019
    Copy the full SHA
    c496e0a View commit details
    Browse the repository at this point in the history

Commits on Feb 20, 2019

  1. Copy the full SHA
    60f0b6e View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    0b92f5f View commit details
    Browse the repository at this point in the history
  3. [pdfalto] Fix svg coordinates.

    Aazhar committed Feb 20, 2019
    Copy the full SHA
    d304dd9 View commit details
    Browse the repository at this point in the history

Commits on Feb 22, 2019

  1. Copy the full SHA
    d04ed4d View commit details
    Browse the repository at this point in the history

Commits on Feb 26, 2019

  1. Use svg vectorial image extension.

    Aazhar committed Feb 26, 2019
    Copy the full SHA
    3aea34b View commit details
    Browse the repository at this point in the history
  2. Update Xqueries.

    Aazhar committed Feb 26, 2019
    Copy the full SHA
    8a79a7f View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    ad298ba View commit details
    Browse the repository at this point in the history
  4. Copy the full SHA
    704894c View commit details
    Browse the repository at this point in the history
  5. Fix issue invalid utf8 sequence.

    Aazhar committed Feb 26, 2019
    Copy the full SHA
    fcd0ba3 View commit details
    Browse the repository at this point in the history

Commits on Feb 28, 2019

  1. Copy the full SHA
    533a178 View commit details
    Browse the repository at this point in the history

Commits on Mar 1, 2019

  1. Copy the full SHA
    acc71d0 View commit details
    Browse the repository at this point in the history

Commits on Mar 5, 2019

  1. [pdfalto] Fix issue with coordinates kermitt2#330.

    * Use numerical mapping when ocr is not activated.
    Aazhar committed Mar 5, 2019
    Copy the full SHA
    82395ac View commit details
    Browse the repository at this point in the history

Commits on Mar 12, 2019

  1. Copy the full SHA
    0439efd View commit details
    Browse the repository at this point in the history
  2. Renaming to pdfalto.

    Aazhar committed Mar 12, 2019
    Copy the full SHA
    b973846 View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    ac4959b View commit details
    Browse the repository at this point in the history
  4. Update pdfalto sax parser tests.

    Aazhar committed Mar 12, 2019
    Copy the full SHA
    5c72b9c View commit details
    Browse the repository at this point in the history
  5. Remove old pdf2xml sax parser.

    Aazhar committed Mar 12, 2019
    Copy the full SHA
    4fa512d View commit details
    Browse the repository at this point in the history
  6. Copy the full SHA
    41fd74e View commit details
    Browse the repository at this point in the history

Commits on Mar 13, 2019

  1. Update unit test for pdfalto parsers.

    Aazhar committed Mar 13, 2019
    Copy the full SHA
    a55deaa View commit details
    Browse the repository at this point in the history

Commits on Mar 19, 2019

  1. Update annotation actions.

    Aazhar committed Mar 19, 2019
    Copy the full SHA
    0a9ea3f View commit details
    Browse the repository at this point in the history

Commits on Apr 1, 2019

  1. Copy the full SHA
    99d7083 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    d8ecab7 View commit details
    Browse the repository at this point in the history

Commits on Apr 2, 2019

  1. Copy the full SHA
    828c1b9 View commit details
    Browse the repository at this point in the history
  2. Merge pull request kermitt2#414 from rgieseke/patch-1

    Add workaround for Java version to Troubleshooting
    kermitt2 committed Apr 2, 2019
    Copy the full SHA
    d5d713d View commit details
    Browse the repository at this point in the history

Commits on Apr 6, 2019

  1. Add missing win-64 binary

    kermitt2 committed Apr 6, 2019
    Copy the full SHA
    e24a7d4 View commit details
    Browse the repository at this point in the history

Commits on Apr 11, 2019

  1. Copy the full SHA
    1e2cc19 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    b33428f View commit details
    Browse the repository at this point in the history

Commits on Apr 12, 2019

  1. 2
    Copy the full SHA
    f8322f3 View commit details
    Browse the repository at this point in the history
  2. Update Troubleshooting.md

    Aazhar committed Apr 12, 2019
    Copy the full SHA
    97bceac View commit details
    Browse the repository at this point in the history

Commits on Apr 14, 2019

  1. make it build in IntelliJ

    boumenot committed Apr 14, 2019
    Copy the full SHA
    22666b0 View commit details
    Browse the repository at this point in the history
  2. Merge pull request kermitt2#421 from boumenot/boumenot/playground

    make it build in IntelliJ
    kermitt2 committed Apr 14, 2019
    Copy the full SHA
    a35341b View commit details
    Browse the repository at this point in the history

Commits on Apr 15, 2019

  1. Copy the full SHA
    da0a8f1 View commit details
    Browse the repository at this point in the history

Commits on Apr 23, 2019

  1. test citation style

    kermitt2 committed Apr 23, 2019
    Copy the full SHA
    db23ff8 View commit details
    Browse the repository at this point in the history
  2. test variant citation style

    kermitt2 committed Apr 23, 2019
    Copy the full SHA
    346543d View commit details
    Browse the repository at this point in the history

Commits on Apr 24, 2019

  1. Copy the full SHA
    10a96a6 View commit details
    Browse the repository at this point in the history

Commits on Apr 25, 2019

  1. Clean pdftoxml exe and dlls.

    Aazhar committed Apr 25, 2019
    Copy the full SHA
    0feb144 View commit details
    Browse the repository at this point in the history
  2. Update pdfalto and add windows/cygwin dependencies.

    * cygwin1.dll is mandantory and cant be staticly compiled : http://cygwin.com/faq/faq.html#faq.programming.static-linking
    Aazhar committed Apr 25, 2019
    Copy the full SHA
    5d07f93 View commit details
    Browse the repository at this point in the history

Commits on Apr 26, 2019

  1. Revert "Clean pdftoxml exe and dlls."

    This reverts commit 0feb144.
    Aazhar committed Apr 26, 2019
    Copy the full SHA
    2285253 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    fe8c8b8 View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    4e1bb24 View commit details
    Browse the repository at this point in the history

Commits on May 4, 2019

  1. Copy the full SHA
    b4ff5b7 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    ec23b2f View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    dad047c View commit details
    Browse the repository at this point in the history

Commits on May 5, 2019

  1. Copy the full SHA
    ad47453 View commit details
    Browse the repository at this point in the history

Commits on May 7, 2019

  1. Copy the full SHA
    68f43e6 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    262475b View commit details
    Browse the repository at this point in the history

Commits on May 12, 2019

  1. ignore new submodule

    kermitt2 committed May 12, 2019
    Copy the full SHA
    523e37c View commit details
    Browse the repository at this point in the history

Commits on May 15, 2019

  1. Copy the full SHA
    76fd65d View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    edea564 View commit details
    Browse the repository at this point in the history

Commits on May 16, 2019

  1. Copy the full SHA
    dd9a7df View commit details
    Browse the repository at this point in the history

Commits on May 17, 2019

  1. Copy the full SHA
    332e303 View commit details
    Browse the repository at this point in the history

Commits on May 20, 2019

  1. update date demo

    kermitt2 committed May 20, 2019
    Copy the full SHA
    8ef4ca5 View commit details
    Browse the repository at this point in the history

Commits on May 26, 2019

  1. Copy the full SHA
    4957316 View commit details
    Browse the repository at this point in the history

Commits on May 27, 2019

  1. Copy the full SHA
    a01aa93 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    e01592b View commit details
    Browse the repository at this point in the history

Commits on May 28, 2019

  1. update tests

    lfoppiano committed May 28, 2019
    Copy the full SHA
    997cf87 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    d747d4f View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    4cd5173 View commit details
    Browse the repository at this point in the history
  4. Cleaning

    kermitt2 committed May 28, 2019
    Copy the full SHA
    49c6421 View commit details
    Browse the repository at this point in the history
  5. Copy the full SHA
    1fbfb19 View commit details
    Browse the repository at this point in the history
  6. Copy the full SHA
    f5238bf View commit details
    Browse the repository at this point in the history
  7. Update doc to new version

    kermitt2 committed May 28, 2019
    Copy the full SHA
    d446747 View commit details
    Browse the repository at this point in the history
  8. Merge pull request kermitt2#433 from kermitt2/0.5.4-fixes

    pdf.js missing when building with docker
    lfoppiano committed May 28, 2019
    Copy the full SHA
    bfcd747 View commit details
    Browse the repository at this point in the history
  9. Copy the full SHA
    51475f4 View commit details
    Browse the repository at this point in the history

Commits on May 31, 2019

  1. insominac fix of kermitt2#295

    kermitt2 committed May 31, 2019
    Copy the full SHA
    f93e4ab View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    65c07ab View commit details
    Browse the repository at this point in the history

Commits on Jun 6, 2019

  1. Copy the full SHA
    5d98df9 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    c59cea3 View commit details
    Browse the repository at this point in the history

Commits on Jun 8, 2019

  1. Copy the full SHA
    18f18ed View commit details
    Browse the repository at this point in the history

Commits on Jun 9, 2019

  1. Copy the full SHA
    ae1ba53 View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    20d0aeb View commit details
    Browse the repository at this point in the history

Commits on Jun 10, 2019

  1. Merge pull request kermitt2#435 from kermitt2/gradle5

    migration to gradle 5
    kermitt2 committed Jun 10, 2019
    Copy the full SHA
    d372e6c View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    e3b8886 View commit details
    Browse the repository at this point in the history

Commits on Jun 14, 2019

  1. Swap w/h in coordinates documentation

    These were the wrong way round.
    bfirsh committed Jun 14, 2019
    Copy the full SHA
    34d59bb View commit details
    Browse the repository at this point in the history
  2. Merge pull request kermitt2#437 from bfirsh/patch-1

    Swap w/h in coordinates documentation
    kermitt2 committed Jun 14, 2019
    Copy the full SHA
    9eac968 View commit details
    Browse the repository at this point in the history

Commits on Jun 24, 2019

  1. added grobid with delft image

    de-code committed Jun 24, 2019
    Copy the full SHA
    712df80 View commit details
    Browse the repository at this point in the history

Commits on Jun 26, 2019

  1. link delft data directory

    de-code committed Jun 26, 2019
    Copy the full SHA
    694257b View commit details
    Browse the repository at this point in the history
  2. Copy the full SHA
    771ca1f View commit details
    Browse the repository at this point in the history
  3. Copy the full SHA
    c0c913d View commit details
    Browse the repository at this point in the history
  4. moved venv down

    de-code committed Jun 26, 2019
    Copy the full SHA
    506bec1 View commit details
    Browse the repository at this point in the history
  5. Copy the full SHA
    b71408c View commit details
    Browse the repository at this point in the history