As a content admin, I want scholarship records exported to github so that there is a publicly accessible, versioned copy of project data available for researchers. #1100

rlskoeser · 2022-09-19T20:13:39Z

) * Rewrite source tabular export to use new exporter class ref #1100 #278 * Document methods on new SourceQuerySet class * Revise tests for admin csv export to not use mocks

kseniaryzhova · 2022-11-12T00:57:26Z

@rlskoeser so the footnote download did not have transcription content aside from the fact that transcriptions exist or not and the download was super quick (for all footnotes). Is the csv for footnotes supposed to have actual transcription content?

rlskoeser · 2022-11-14T14:51:31Z

@kseniaryzhova I should have probably given you links, just to make sure we were talking about the same things.

Were you testing the footnotes export from this page? https://test-geniza.cdh.princeton.edu/admin/footnotes/footnote/

It looks to me like it does include the text content of the transcription — happy to remove it if it's not useful / important to have here or to have in this format.

And the sources export should be tested from this page: https://test-geniza.cdh.princeton.edu/admin/footnotes/source/

kseniaryzhova · 2022-11-14T15:29:10Z

@rlskoeser I tried it again with the links you provided - sources look good, they have the URLs, etc. But I'm still not seeing the transcription content for footnotes.

rlskoeser · 2022-11-15T17:03:48Z

@kseniaryzhova so weird!

Would you try downloading the subset from this link and report back?
https://test-geniza.cdh.princeton.edu/admin/footnotes/footnote/?q=5454
select them and then use the action -> export selected to csv -> go

I'm getting None for content for the first one and transcription content for the second one. (We need to fix the None at least, but I'm still not sure if we should have the content here or not – if it continues to not work for you maybe we should drop it.)

I tried exporting all footnotes and I think it failed; it was definitely slow. So that slowness might be enough reason to drop the content from this export, since we have the content in the annotation backup repo already.

rlskoeser · 2022-11-15T19:19:43Z

discussed with @kseniaryzhova and finally figured out what's going on here:

Excel limits rows to one line by default, so it's hard to see if there's any content (if anything you probably only see an english language label like recto or verso)
Excel is not reading as UTF-8, so transcription content is garbage

changes needed for this to be acceptable:
— should not display None when there is no content
— should have byte order mark so that Excel will automatically read as unicode

kseniaryzhova · 2022-11-17T20:56:28Z

@rlskoeser works, closing!

rlskoeser mentioned this issue Sep 19, 2022

data exports #1098

Open

rlskoeser added this to the CDH/PGP end of grant year 2 milestone Sep 19, 2022

rlskoeser modified the milestones: Transcription migration + transcription editor, CDH/PGP end of grant year 2 Oct 4, 2022

rlskoeser self-assigned this Nov 8, 2022

rlskoeser added a commit that referenced this issue Nov 8, 2022

Rewrite source tabular export to use new exporter class

ca54449

ref #1100 #278

rlskoeser mentioned this issue Nov 8, 2022

Rewrite footnotes source tabular export to use new exporter class #1222

Merged

rlskoeser added the 🗜️ awaiting testing Implemented and ready to be tested label Nov 11, 2022

rlskoeser mentioned this issue Nov 15, 2022

Separate admin and public exports for sources and footnotes #1235

Merged

kseniaryzhova closed this as completed Nov 17, 2022

rlskoeser removed the 🗜️ awaiting testing Implemented and ready to be tested label Nov 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

As a content admin, I want scholarship records exported to github so that there is a publicly accessible, versioned copy of project data available for researchers. #1100

As a content admin, I want scholarship records exported to github so that there is a publicly accessible, versioned copy of project data available for researchers. #1100

rlskoeser commented Sep 19, 2022 •

edited by kseniaryzhova

kseniaryzhova commented Nov 12, 2022

rlskoeser commented Nov 14, 2022

kseniaryzhova commented Nov 14, 2022

rlskoeser commented Nov 15, 2022

rlskoeser commented Nov 15, 2022

kseniaryzhova commented Nov 17, 2022

As a content admin, I want scholarship records exported to github so that there is a publicly accessible, versioned copy of project data available for researchers. #1100

As a content admin, I want scholarship records exported to github so that there is a publicly accessible, versioned copy of project data available for researchers. #1100

Comments

rlskoeser commented Sep 19, 2022 • edited by kseniaryzhova

testing notes

retesting notes

dev notes

kseniaryzhova commented Nov 12, 2022

rlskoeser commented Nov 14, 2022

kseniaryzhova commented Nov 14, 2022

rlskoeser commented Nov 15, 2022

rlskoeser commented Nov 15, 2022

kseniaryzhova commented Nov 17, 2022

rlskoeser commented Sep 19, 2022 •

edited by kseniaryzhova