Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
Could someone kindly add a function to download CC subtitle from site "www.tagesschau.de"? #21427
Comments
|
I've found that it downloads ...which gets it from No clue where the JSON comes from, though. Seems to be loading some iframe where it load that JSON |
|
Thanks. |
|
@snowmfx That should work as a workarround. I'd suggest you to leave this issue open until it's implemented in the extractor. (I tried to add it, but it seems like the current extractor isn't even using the JSON. I'll try again once I get the time to do it) |
It is TTML or rather the EBU-TT-D-Basic-DE profile used by the German public broadcasters. If renamed to |
|
For the time being you could have a look at Xidel, a HTML/XML/JSON parser (using CSS, XPath, XQuery, JSONiq and pattern templates).
"Follow" (
If you want to convert to *.srt (subrip), then Xidel can do this too:
|
|
The problem with SubRip here is that the speaker colors all are discarded. |
|
Then we just have to update the extraction query.
When we come across Minified query:
Minified query (Windows):
|

Checklist
Description
WRITE DESCRIPTION HERE
Some video in this site "www.tagesschau.de" has subtitle.
Eg. https://www.tagesschau.de/multimedia/sendung/ts-31835.html .
But I can't download it with latest youtube-dl tool.
I ask a favor.