Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Match Inbox Events Against Schedule #3340

Merged
merged 3 commits into from
Feb 8, 2022

Conversation

lkiesow
Copy link
Member

@lkiesow lkiesow commented Jan 10, 2022

Capture agents often record to filenames containing metadata which
allows us to identify an event by looking at a specific time in the
schedule of a certain location (capture agent).

If created and spatial metadata could be identified, this patch
allows us to look up the schedule, using all available event data fram
there if we had a unique match.

Data taken from the schedule, overwriting inbox data are:

  • All metadata
  • Workflow definition
  • Workflow configuration

Notes:

Addition: Use ffprobe to get additional metadata

In addition to the metadata extraction from the file name, this patch
allows users to have ffprobe run on the media files to get some
additional metadata from the media container.

The extracted metadata are creation time and duration, which may help to
match scheduled events.

With the media file now having a duration, there is now also an
additional option allowing to filter out some events which are barely
conflicting (imaging two back to back recordings and the second one
starts a minute early).

Your pull request should…

@github-actions
Copy link
Contributor

This pull request has conflicts ☹
Please resolve those so we can review the pull request.
Thanks.

Opencast's inbox allows you to add zipped media packages which may
contain arbitrary metadata but are very complex or to add simple files,
which file names will end up as the new event's title. It would be nice
if it would be possible to extract a few more basic metadata from the
filename.

File names like these are commonly generated by capture devices:

```
2020-02-07-hs1-Introduction to Biology.mp4
 |    |  |  |  |                     |
year  |  |  |  +----------+----------+
   month |  |             |
        day |           title
        location
```

To improve the situation, and to be as flexible as necessary, we would
like to allow for specifying regular expressions as matchers for a few
basic metadata.

This resolves opencast#3239
Capture agents often record to filenames containing metadata which
allows us to identify an event by looking at a specific time in the
schedule of a certain location (capture agent).

If `created` and `spatial` metadata could be identified, this patch
allows us to look up the schedule, using all available event data fram
there if we had a unique match.

Data taken from the schedule, overwriting inbox data are:

- All metadata
- Workflow definition
- Workflow configuration

Notes:

- This patch fixes opencast#3240.
- This is based on opencast#3327.
- You want opencast#3339 for this.
In addition to the metadata extraction from the file name, this patch
allows users to have ffprobe run on the media files to get some
additional metadata from the media container.

The extracted metadata are creation time and duration, which may help to
match scheduled events.

With the media file now having a duration, there is now also an
additional option allowing to filter out some events which are barely
conflicting (imaging two back to back recordings and the second one
starts a minute early).
Copy link
Member

@gregorydlogan gregorydlogan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In general this looks ok, a couple of questions remain.

@gregorydlogan gregorydlogan merged commit a3b2a23 into opencast:r/11.x Feb 8, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants