MONAH: Multi-Modal Narratives for Humans

Problem

Analyzing videos in video format (visual + audio + text) needs a lot of human expertise and end-to-end deep learning methods are less interpretable.

Solution

Inspired by how the linguistics community analyze conversations using the Jefferson transcription system. MONAH creates a multi-modal text narrative for dyadic (two-people) video-recorded conversations by weaving what is being said with _ how_ its being said.

ScreenCast

To add later

Required Inputs

Two videos, one for each speaker. Works best when the camera is in front of the speaker, instead of from an angle. Verbatim Transcript from YouTube.

User Interface

Text menu based for easy configuration.

Support modalities in the narratives

Output - MONAH Narrative

To add later

Dependencies (Technology Stack)

To add as we build this repo up.

Fine Narratives

Actions

OpenFace 2.2.0 https://github.com/TadasBaltrusaitis/OpenFace

Prosody

Vokaturi 3.x https://developers.vokaturi.com/downloads/sdk

Coarse Narratives

Demographics

IBM Watson (Deprecated 1 Dec 2021) https://cloud.ibm.com/docs/personality-insights

Semantics

Sentiment
Questions

Mimicry

Dynamic Time Wrapping

Contributions

MOANH is meant to be a modular system that allows for additions to be simple. Joshua to add architectural diagram.

Pipeline (Intermediate Artifacts)

To add later

Continuous Integration

Joshua to add PyLint Python Style Tests Joshua to add Compulsory Unit Tests

Citation

If you find MONAH useful in any of your publications we ask you to cite the following:

Features introduced in Paper 1 are in white, features introduced in paper 2 are in blue.

Paper 1 (white features) Kim, J. Y., Kim, G. Y., & Yacef, K. (2019). Detecting depression in dyadic conversations with multimodal narratives and visualizations. In Australasian Joint Conference on Artificial Intelligence (pp. 303-314). Springer, Cham.
Paper 2 (blue features) Kim, J. Y., Yacef, K., Kim, G., Liu, C., Calvo, R., & Taylor, S. (2021, April). MONAH: Multi-Modal Narratives for Humans to analyze conversations. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (pp. 466-479).

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
Python/Data_Preprocessing		Python/Data_Preprocessing
data/output		data/output
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Ses01F_impro01.mp4		Ses01F_impro01.mp4
linux.yml		linux.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python/Data_Preprocessing

Python/Data_Preprocessing

data/output

data/output

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Ses01F_impro01.mp4

Ses01F_impro01.mp4

linux.yml

linux.yml

Repository files navigation

MONAH: Multi-Modal Narratives for Humans

Problem

Solution

ScreenCast

Required Inputs

User Interface

Support modalities in the narratives

Output - MONAH Narrative

Dependencies (Technology Stack)

Fine Narratives

Coarse Narratives

Contributions

Pipeline (Intermediate Artifacts)

Continuous Integration

Citation

About

Releases

Packages

Contributors 2

Languages

License

SpectData/MONAH

Folders and files

Latest commit

History

Repository files navigation

MONAH: Multi-Modal Narratives for Humans

Problem

Solution

ScreenCast

Required Inputs

User Interface

Support modalities in the narratives

Output - MONAH Narrative

Dependencies (Technology Stack)

Fine Narratives

Coarse Narratives

Contributions

Pipeline (Intermediate Artifacts)

Continuous Integration

Citation

About

Resources

License

Stars

Watchers

Forks

Languages