Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Making Short-Form Videos Accessible with Hierarchical Video Summaries #617

Open
Masa630 opened this issue Jun 13, 2024 · 0 comments
Open
Labels
Captioning Research about Captioning / 画像を言語で説明して支援する研究 CHI ACM CHI Conference on Human Factors in Computing Systems Low Vision A paper which proposed an assistive system for people with low-vision / 低視力者を対象とした研究

Comments

@Masa630
Copy link
Collaborator

Masa630 commented Jun 13, 2024

Links

Abstract

Short videos on platforms such as TikTok, Instagram Reels, and YouTube Shorts (i.e. short-form videos) have become a primary source of information and entertainment. Many short-form videos are inaccessible to blind and low vision (BLV) viewers due to their rapid visual changes, on-screen text, and music or meme-audio overlays. In our formative study, 7 BLV viewers who regularly watched short-form videos reported frequently skipping such inaccessible content. We present ShortScribe, a system that provides hierarchical visual summaries of short-form videos at three levels of detail to support BLV viewers in selecting and understanding short-form videos. ShortScribe allows BLV users to navigate between video descriptions based on their level of interest. To evaluate ShortScribe, we assessed description accuracy and conducted a user study with 10 BLV participants comparing ShortScribe to a baseline interface. When using ShortScribe, participants reported higher comprehension and provided more accurate summaries of video content.

スクリーンショット 2024-06-14 5 59 09

視覚障害者がTiktokのような短いビデオにアクセスしやすくするShortScribeというシステム
・key frameとOCRからshort descriptions / long descriptions / shot-by-shot descriptions の3階層のサマリーを生成

Formative Study
・動画のコンテンツとは関係がない音が流れている場合がある
・短いビデオの場合、スクリーンリーダーがオーバラップしたり、動画内のコンテンツを説明する時間がない

サマリーの評価実験
・人間が詳細な要約を書き、提案手法を人間の要約と比較することで評価を行った

スクリーンショット 2024-06-14 5 59 36

ユーザ実験
・10名の視覚障害者がビデオ理解タスクとビデオ選択タスク(興味があるかどうか)を行った
・ShortScribeはビデオの理解に役立ったと評価
・選択する際の完了時間が短くなったことや、認知負荷が減少したとコメント
・精度向上は今後の課題

スクリーンショット 2024-06-14 5 59 47
@Masa630 Masa630 added CHI ACM CHI Conference on Human Factors in Computing Systems Low Vision A paper which proposed an assistive system for people with low-vision / 低視力者を対象とした研究 Captioning Research about Captioning / 画像を言語で説明して支援する研究 labels Jun 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Captioning Research about Captioning / 画像を言語で説明して支援する研究 CHI ACM CHI Conference on Human Factors in Computing Systems Low Vision A paper which proposed an assistive system for people with low-vision / 低視力者を対象とした研究
Projects
None yet
Development

No branches or pull requests

1 participant