Skip to content

A service utilizing Amazon Transcribe, Amazon Comprehend and S3 in Amazon Web Services (AWS) to help users convert audio to text report with each speaker's name.

Notifications You must be signed in to change notification settings

mre500/jiang-jung-dian

Repository files navigation

Jiang-Jung-Dian (講重點)

It offers users a convenience way to convert recording audios to transcript with identified speakers. The converting part is done by utilizing Amazon Transcribe and Amazon Comprehend, while the identifying part is achieved by utilizing VGGVox speaker identification model. Here is the link of a simple deom vedio for AWS hackathon. It is demonstrated by using 2016 America presidential debate on youtube.

Prerequisites

  • Python >= 3.7
  • R >= 3.6.1

Setup

  • $ pip install -r requirements.txt to install all dependencies wirtten in python.
  • To install all dependencies wirtten in R:
    • $ install.packages("data.table")
    • $ install.packages("dplyr")
    • $ install.packages("shiny")
    • $ install.packages("DT")
    • $ install.packages("shinydashboard")
    • $ install.packages("stringr")

Execute

  1. First, run $ python ui.py, a GUI will pop up. \

  2. Then do enrollment:

    • Enter "speaker's name" in 使用者名稱
    • Click 開始錄音 to start enroll the speaker's voice.
    • Click 結束錄音 if one finishes recording.
    • Iterate over the first three processes if there are multiple speakers.
    • After all speakers are enrolled, click 開始辨識.
  3. Third, start recoding by clicking 會議錄音, and finish recording by clicking 結束會議.

  4. The result can be seen by cicking the generated shiny.bat file.

Processes behind the scene

Result

Contributors

Acknowledgments

Thanks Hack For Good hold by AWS for providing the AWS Services, and the open source of VGGVox. Special thanks to Chen, Stuart, the Solutions Architect at Amazon Web Services (AWS), for technical supports.

About

A service utilizing Amazon Transcribe, Amazon Comprehend and S3 in Amazon Web Services (AWS) to help users convert audio to text report with each speaker's name.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •