Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Non-MARC archive.org import data issues #2220

Closed
hornc opened this issue Jul 17, 2019 · 2 comments
Closed

Non-MARC archive.org import data issues #2220

hornc opened this issue Jul 17, 2019 · 2 comments
Labels
Affects: Data Issues that affect book/author metadata or user/account data. [managed] Lead: @hornc Issues overseen by Charles (Staff: Data Engineering Lead) [managed] Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed] Priority: 3 Issues that we can consider at our leisure. [managed] Type: Bug Something isn't working. [managed]
Projects

Comments

@hornc
Copy link
Collaborator

hornc commented Jul 17, 2019

Issues with No MARC import:
an example:
https://openlibrary.org/books/OL27042435M/Self-Study_and_Evaluation_Guide_Section_D-26_Residential_Living_Program

  • Subject not split on semi-colon
  • Publisher not populated
  • Pages not populated

relates to original features #1029 , code in #1058

@hornc hornc added the Type: Bug Something isn't working. [managed] label Jul 17, 2019
@hornc hornc self-assigned this Jul 17, 2019
@hornc hornc added the Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed] label Jul 17, 2019
@hornc hornc added this to To do in Continuous Import Pipeline via automation Jul 17, 2019
@hornc hornc added the Affects: Data Issues that affect book/author metadata or user/account data. [managed] label Aug 16, 2019
@hornc hornc moved this from To do to Backlog in Continuous Import Pipeline Oct 2, 2019
@xayhewalo xayhewalo added this to Un-Triaged in Triage Oct 20, 2019
@xayhewalo xayhewalo added Priority: 3 Issues that we can consider at our leisure. [managed] State: Backlogged labels Nov 20, 2019
@xayhewalo xayhewalo moved this from Un-Triaged to Triaged in Triage Nov 20, 2019
@xayhewalo
Copy link
Collaborator

@hornc did this get fixed when #1058 got merged?

@mekarpeles mekarpeles added the Lead: @hornc Issues overseen by Charles (Staff: Data Engineering Lead) [managed] label Dec 18, 2019
@hornc
Copy link
Collaborator Author

hornc commented Feb 28, 2020

@guyjeangilles no, this issue relates to imports from archive.org without MARC records. That PR only applies to archive.org MARC items

@hornc hornc removed this from Backlog in Continuous Import Pipeline Mar 7, 2020
@hornc hornc removed their assignment Mar 7, 2020
@hornc hornc closed this as not planned Won't fix, can't repro, duplicate, stale Aug 18, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Affects: Data Issues that affect book/author metadata or user/account data. [managed] Lead: @hornc Issues overseen by Charles (Staff: Data Engineering Lead) [managed] Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed] Priority: 3 Issues that we can consider at our leisure. [managed] Type: Bug Something isn't working. [managed]
Projects
No open projects
Triage
  
Triaged
Development

No branches or pull requests

3 participants