-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OCR results in analysis results #1
Comments
yes, this is true; OCR is not included in the normal analysis; DanielS did extra OCR analysis for the UMONS scenario, but as far as I remember from the Potsdam meeting in January, he considers OCR too problematic for being integrated into the normal process |
OK. Text visible in a frame could of course be used by an editor to determine the label of an annotation. |
DanielS, e-mail on 24 July 2014:
Benefits of OCR is mainly:
|
Since RBBs curation of its video material [1] seems to indicate OCR could provide useful labels to offer to editors for annotation of persons in the news item would it be feasible (for testing) to add OCR into the analysis of the current scenario video [2]? [1] http://www.linkedtv.eu/wiki/index.php/File:Manual_curation_rbb_aktuell.xlsx |
I can run video OCR on the file offline if this is needed. A fully functional integration into the REST service and the such is not feasible(*). (*) for the time being and with my current time resources. |
Regarding the broken XML output of the OCR tool of @Daniel-Stein, I would like to remind the work of UMons in fixing it, see:
|
fyi, I started processing of the video you mentioned. I can fix the xml beforehand using the scripts I made for mediaeval. ETA tomorrow. Note though that we have not established any porting into EXMARaLDA, so indeed considering the OCR2SRT tool might be an option (I have not worked with it so far). |
file processed, fixed and uploaded. You can find the xml in the ftp, at /Data/RBB/OCR |
Thanks @Daniel-Stein ! Who is doing what now? This is not part of the EXB format, so this is not part of the workflow. We might want to run OCR2SRT for this specific file, and then NERD-ifying it to see what it gives. I believe the WP1 and WP5 leaders should step in to coordinate the work if there is something to do. |
|
Nothing has happened since and clearly it may not be possible to rely on @Daniel-Stein so how do we proceed on OCR in LinkedTV: |
In the TKK video chapter, Nelleke van der Krogt is not detected but her name is visible on screen - seems that OCR results are not included in the current analysis results?
The text was updated successfully, but these errors were encountered: