Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Get the original automatic captions file of a YouTube video, not the translated ones #18195

Closed
stcoats opened this issue Nov 15, 2018 · 0 comments
Closed
Labels

Comments

@stcoats
Copy link

@stcoats stcoats commented Nov 15, 2018

I am downloading only the captions files for videos from YouTube.

Using the flag --write-auto-sub results in the automatically translated English captions being downloaded, not the original automatic captions file. Example: This video is in Russian, and the automatically-generated captions are in Russian. The following command retrieves automatically-translated English captions.

$ youtube-dl https://www.youtube.com/watch?v=dy-ZSwW7dA0 --skip-download --write-auto-sub -v -o /home/cloud-user/Downloads/captions_file.vtt
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: [u'https://www.youtube.com/watch?v=dy-ZSwW7dA0', u'--skip-download', u'--write-auto-sub', u'-v', u'-o', u'/home/cloud-user/Downloads/captions_file.vtt']
[debug] Encodings: locale UTF-8, fs UTF-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2018.11.03
[debug] Python version 2.7.5 (CPython) - Linux-3.10.0-862.14.4.el7.x86_64-x86_64-with-centos-7.5.1804-Core
[debug] exe versions: none
[debug] Proxy map: {}
[youtube] dy-ZSwW7dA0: Downloading webpage
[youtube] dy-ZSwW7dA0: Downloading video info webpage
[youtube] dy-ZSwW7dA0: Looking for automatic captions
[youtube] dy-ZSwW7dA0: Downloading MPD manifest
[youtube] dy-ZSwW7dA0: Downloading MPD manifest
[debug] Default format spec: best/bestvideo+bestaudio
[info] Writing video subtitles to: /home/cloud-user/Downloads/captions_file.en.vtt

I can't define the language of the captions I want to download because I am working with a large number of videos and can't manually check the language of each video before I get its captions.

Is there a way to get the original captions? If not, this would be a good feature.

@dstftw dstftw closed this Nov 15, 2018
@dstftw dstftw added the duplicate label Nov 15, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
2 participants
You can’t perform that action at this time.