Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle old Archivematica METS documents #53

Merged
merged 8 commits into from
Mar 8, 2019

Commits on Jan 17, 2019

  1. Accept missing mets:metsHdr

    jrwdunham authored and sevein committed Jan 17, 2019
    Configuration menu
    Copy the full SHA
    62955db View commit details
    Browse the repository at this point in the history

Commits on Mar 8, 2019

  1. Accept dirs with fptrs

    This commit updates `_parse_tree_structmap` so `FSEntry` objects are
    created also when the parser runs into direct `fptr` elements, e.g.:
    
        <div TYPE="directory" LABEL="objects">
            <fptr FILEID="001.csv-0c0576c3-e82e-4501-9aef-1f3d2d35aeda"/>
            <fptr FILEID="002.mpg-722de890-2a3c-4343-bb69-f2b29894d9a7"/>
            <fptr FILEID="003.mpg-423247e7-c27d-4b5a-a64a-b02833f2212d"/>
        </div>
    sevein committed Mar 8, 2019
    Configuration menu
    Copy the full SHA
    c6443dd View commit details
    Browse the repository at this point in the history
  2. premisrw: add PREMIS v2.1

    sevein committed Mar 8, 2019
    Configuration menu
    Copy the full SHA
    e7a584b View commit details
    Browse the repository at this point in the history
  3. Fix FSEntry path encoding issue

    In Py2, ``FSEntry.path`` uses binary. This commit updates the constructor so
    it uses the `utf-8` encoder when we're encoding text type. The default encoder
    is `ascii` which is problematic.
    
    In Py3, ``FSEntry.path`` is using Unicode and that needs to be fixed.
    sevein committed Mar 8, 2019
    Configuration menu
    Copy the full SHA
    329c8e1 View commit details
    Browse the repository at this point in the history
  4. Fix flake8 F632 warnings

    sevein committed Mar 8, 2019
    Configuration menu
    Copy the full SHA
    eac45cf View commit details
    Browse the repository at this point in the history
  5. Bump version to 0.3.0

    sevein committed Mar 8, 2019
    Configuration menu
    Copy the full SHA
    f7ff13f View commit details
    Browse the repository at this point in the history
  6. Use utf-8 codec in premisrw

    This avoids the following issue:
    
        > return ' '.join([str(x) for x in texts])
                           ^^^^^^
        E UnicodeEncodeError: 'ascii' codec can't encode characters in
          position 42-44: ordinal not in range(128)
    
    Only fixed in Py2. Py3 codepath will need more work overall.
    sevein committed Mar 8, 2019
    Configuration menu
    Copy the full SHA
    f06008d View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    26c6635 View commit details
    Browse the repository at this point in the history