Skip to content

mooss/sininen

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sininen

Sininen's goal is to provide tools to perform natural language queries on text data. Right now it is focused on searching though subtitles extracted from YouTube channels.

Usage

Download subtitles from a channel

The channel name is needed. To find it from a video like https://www.youtube.com/watch?v=aq4G-7v-_xI, click on the channel name (here Historia Civilis), landing on the page https://www.youtube.com/channel/UCv_vLHiWVBh_FR9vbeuiY-A. Then click on the HOME tab, this changes the URL to https://www.youtube.com/c/HistoriaCivilis/featured. The channel name is the string after /c/, here HistoriaCivilis.

Download the subtitles for HistoriaCivilis with:

./download-channel-subtitles.sh HistoriaCivilis

Build YouTube CLI

go get
go build cli/search-yt.go

Search through channel subtitles

./search-yt HistoriaCivilis "Crossing the Rubicon"

Requirements

The usage instructions above should work on a recent Linux distribution provided the following packages are installed and reasonably up-to-date:

  • Go
  • youtube-dl

Some adjustments might be needed to make it work on another OS.

About

Natural language search tool

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published