Hi guys,
I've been experimenting with different scene detection tools, mainly PySceneDetect and Google Video Intelligence. I noticed that what these tools call scene detection is really shot detection. They detect different shots, not scene. A scene, technically, is a sequence of shots, constituted of a setting (a place, time period, environment...), characters, events, dialogue, purpose and so on.
I would like to detect entire scenes. Is this possible? If so, how can this be done?