You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If we continue to store the full transcript results, we need an optional field per part "suffix" to store the punctuation that comes afterwards
For elasticsearch indexing we already convert this format into a text-only form for the elasticsearch payload delemiter. The format is `"foo|start,end,conf bar|start,end,conf". This is much smaller. Possibly switch to this format for storage as well.
We should also define an outer data model for transcript to store meta information. Fields: engine, model, modelVersion, (and processingTime, averageConfidence, createdDate - can be autogenerated).
The text was updated successfully, but these errors were encountered:
Currently the transcript is saved as JSON with the following structure:
Missing things:
"suffix"
to store the punctuation that comes afterwardsengine
,model
,modelVersion
, (andprocessingTime
,averageConfidence
,createdDate
- can be autogenerated).The text was updated successfully, but these errors were encountered: