Skip to content
This repository has been archived by the owner on Dec 27, 2019. It is now read-only.
/ conjunx Public archive

An automated TTS tool for tagged source videos, it's like a wedding of videos

License

Notifications You must be signed in to change notification settings

yttrian/conjunx

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

conjunx

An automated TTS tool for tagged source videos

Made during the 24 hour UB Hacking 2019 Hackathon

How it works

  1. A .cjxa file is created using a conjunx compatible editor
  • These files are archives containing both source video files and transcripts
  1. A request is made to the render server with the .cjx and a desired dictation to be created
  2. The render server sends back the processed video file

.cjxt transcript file description

The first line of a caption file is the name of the video it refers to. All the following lines are a word/phrase, a starting timestamp, and an ending timestamp (in second + milliseconds)

For example:

Jeff Kaplan Responds to IGNs Overwatch Comments.mp4
so,4.833,5.051
a lot of,5.051,5.264
people,5.264,5.547
think,5.547,5.896
nobody,5.913,6.310
ever,6.332,6.611
reads,6.606,6.972
these,6.981,7.395

Picking the optimal speech path

Let's say we started with sentences like:

  1. I really like apples
  2. I do not like bananas

And we wanted to say:

  • I really do not like apples

The optimal splice would be:

  • I really [1]
  • do not like [2]
  • apples [1]

Instead of possibly:

  • I [2]
  • really [1]
  • do not [2]
  • like apples[1]

The goal is the find the longest contiguous phrases possible

About

An automated TTS tool for tagged source videos, it's like a wedding of videos

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published