Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
[DISCOVERY] Unable to download .scc subtitles! #19920
Comments
|
If it's anything like How The Universe Works (also affected by this, as Science Channel is owned by Discovery), there should also be a Closed Captions stream inside the video stream (I mean that literally; EIA-608 subtitles packed into the H.264 frames as a substream). You can extract them with FFmpeg:
If you absolutely need Scenarist subtitles, there's probably something out there that can convert SRT to .scc. |
|
@Nii-90 Thank you very much Nii for the heads up. Actually to process it via FFMPEG it has to be a mpegts file... But I found this software Thanks again, I didn't know this detail! |
|
FFmpeg doesn't require it to be in MPEG-TS. At one time, yes, but not anymore (or the handling of this particular edge case has gotten better). Whenever I grab HTUW, it's definitely in ISO Base Media/MP4, but the EIA-608 subs are correctly extracted with the command above. Before, you could still do it, but the timings would be all wrong; not anymore, though. On other shows (Miracle Workers off of TBS, for example), ccextractor works correctly, but not for shows where I've had to reconstruct the chapter layout because ytdl doesn't preserve it (stuff from FOX and HTUW - presumably other Science Channel/Discovery shows too, because this is a basic problem with Uplynk-handled streams). On those files, the 'Closed Captions' indicator no longer appears in the h264 stream list, but it is still there and FFmpeg is required to handle it at that point because ccextractor errors out. |
|
To illustrate:
ccextractor didn't extract anything, probably because it didn't recognize one of the atoms that FFmpeg wrote. Meanwhile, FFmpeg on the same file:
I stopped it early, since the lavfi subcc filter is slow. It properly detects the file as MP4, and it extracts the subtitles. |
|
Huh, that's interesting! But I'm actually able to extract correctly subs from HTUW with ccextractor. Just ran the command and it did extracted flawlessly and converted it to .srt. Look:
But in this case I used the parameters that were recommended in the GUI. And that is the difference, you're using different parameters. Why? I played the episode and the characters are fine, the sync seems fine as well. And the most impressive is the time used to process, just 19 seconds that's amazing! In the practical case I'm running a .bat script I just didn't understand the chapters part you mentioned tough. What do you mean? Do the episodes have chapters, why would you use them? |
|
The reason it succeeded in your example with HTUW is because you just told youtube-dl to download it, with hls-native (because I know the --hls-prefer-ffmpeg option doesn't like to operate well on Uplynk), plain as anything else. That's the exact same reason I gave for why Miracle Workers worked as well, although TBS doesn't use Uplynk and the chapter information is preserved in the file youtube-dl does download when you use the --add-metadata option. TTML output and no-BOM UTF-8 is just a personal choice - any format errors out the same way, because there's something about the concatenated file ccextractor doesn't like (my money's on that gmin box error). Yes, the videos do have chapters, as evidenced when you watch them through the online players (they exist at the spots where ads would be inserted, and these boundaries exist in the *.m3u8 files as well, which is how I can recover them). Partially it's for completeness that I want to keep them. And while I could certainly just download it twice, then throw away the individual segments, I at least tried to streamline the process so I used the minimum amount of bandwidth, leading to the problem that the stitched-together file can only have its subs extracted by FFmpeg. |
|
Oh well, now I got it! Thank you for taking the time to explain, I really appreciate it. |
|
As a follow-up, through a bit of a run-around, I found out what laid at the heart of the problem with my fix for chapters not working with ccextractor: FFmpeg creates a junk track (or rather, likely redundant, iTunes-style chapter info) when putting chapter-laden metadata into the file, and this is what ccextractor chokes on. A bug report on FFmpeg's tracker from several years ago pointed out a solution, and now I can generate files without that junk (while retaining all the metadata, cover art/thumbnail, and chapter info), and ccextractor is fine with it. |
|
Well, I'm glad you made it! It's a shame that they didn't fix it, probably someone else is struggling with it somewhere. |
Please follow the guide below
xinto all the boxes [ ] relevant to your issue (like this:[x])Make sure you are using the latest version: run
youtube-dl --versionand ensure your version is 2019.02.18. If it's not, read this FAQ entry and update. Issues with outdated version will be rejected.Before submitting an issue make sure you have:
What is the purpose of your issue?
Description of your issue, suggested solution and other information
Unable to download .scc subtitles from the Discovery networks sites. Another subtitles seems to be download alright only the .scc aren't downloading.
Thank you very much for the time, and attention.