Skip to content

two-heart/igf-transcript-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

igf-transcript-extractor

This is a quick and dirty python3 script that depends on wget downloads all the available transcripts from the IGF 2019.

It would be better to use the python request lib. But sadly the website blocks requests made from a python script and trying changing the headers I ran into a few bugs. Fell free to make a pull request to make this script more inclusive.

Also it please share any kind of text analysis or usage of the gatherd transcripts, I am interested what they can be used for.

All of the python code in this repo is licenced under Apache 2.0.

About

Obtain plaintext transcriptions from the IGF

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages