This repository has been archived by the owner on Jan 13, 2022. It is now read-only.
[Infrastructure] Fix Smithsonian NMNH related discrepancies #470
Comments
7 tasks
Numbers of missing metadata looks as follows after this implementation The numbers and percentages of missing creators:-
The numbers and percentages of missing descriptions in the meta data field:-
|
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Current Situation
As explained in ticket #397 the NMNH (National museum national history) images from the Smithsonian API have discrepancies with regard to certain fields. The creator field is missing for all of the NMNH records and the description is missing for 99.6% of it.
Suggested Improvement
With the initial research conducted, it was observed that the creator field may be populated with the value corresponding to the
freetext -> name -> Collector
field in the JSON response. Further discussion is necessary to determine whether this is the appropriate field. The description can be taken from thefreetext -> notes -> Notes
field for some of the images in NMNH.Benefit
Making this change would improve the completeness of NMNH related data in Smithsonian. More than 95% of the Smithsonian data comes from NMNH and it is important to improve its completeness as much as possible.
Additional context
This issue is related to #397
The text was updated successfully, but these errors were encountered: