Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transcript Search and Indexing #5708

Open
elynema opened this issue Mar 5, 2024 · 0 comments
Open

Transcript Search and Indexing #5708

elynema opened this issue Mar 5, 2024 · 0 comments
Labels

Comments

@elynema
Copy link
Contributor

elynema commented Mar 5, 2024

Adding metadata and document data to Solr index, in order to provide further facets and to index for search matches within transcript documents.

  • Show "found in" hints to indicate to user if search term(s) were found in:
    • transcript (also includes captions marked to be treated as transcripts)
    • metadata
    • section labels
    • structure
  • Two new facets: Has Captions and Has Transcripts
    • Also add "Has Supplemental Files" facet?
  • Values for "Has Captions" and "Has Transcripts": "Yes" and "No"
  • Ability to treat captions like transcript files
    • collection staff mark captions as "treat as transcript" in the Manage Files page
    • captions treated as transcripts appear as transcripts in the Transcript component
  • pass search on to media item page
  • test search relevance and weights to make sure title only matches ranked reasonably compared to transcript only matches

https://docs.google.com/document/d/1lj3IpNmND5Ccs-uizPszfXrJk-Z65rHS7JO0EBmjdsI/edit

@elynema elynema added the EPIC label Mar 5, 2024
@joncameron joncameron changed the title Transcript searching Search and Indexing Mar 13, 2024
@elynema elynema changed the title Search and Indexing Transcript Search and Indexing Mar 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant