Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
Issues with News section on vod.tvp.pl #7799
Comments
|
It's not only news programs. Some other sections are also not recognized: e.g. http://vod.tvp.pl/24153473/tegie-chlopy |
|
Another not working example: http://vod.tvp.pl/13364434/nela-mala-reporterka . |
|
FYI, here's a script that does work (save it as bookmark, open the URL above and click on the bookmark): |
|
It seems that vod.tvp.pl is broken again:
Also, "pobieracz" linked above doesn't seem to work either :( |
|
They redesigned the website. As you can see, the URLs are different now. |
|
After some digging, I found that for example for https://vod.tvp.pl/video/jak-to-dziala,kompresja-danych,29644524 I can get the video from the URL of the
youtube-dl works just fine when given the above URL directly, so apparently only extraction of that URL is broken now. Going to |
|
Still cannot download from ie: https://vod.tvp.pl/video/miasto-skarbow,odc-1-cezanne,33724038 edit: [http @ 0000016821e90980] HTTP error 502 Bad Gateway So the "solution" was to use the -F option & chose -f http-xxxx stream, which worked FINE for download `W:\Media>youtube-dl -v https://vod.tvp.pl/sess/player/video/33724038 [debug] System config: [] |
|
The solution for the "news" section on TVP is somewhat trivial. @scerazy gave an example, so I'll flesh it out here a bit.
That's the link to Teleexpress on TVP. Parsing that link into youtube-dl gives an error. However, if you take the last numerical value after the comma (,) and place that into a direct-link player, the video can be seen by youtube-dl: https://vod.tvp.pl/sess/player/video/40998112 Therefore, the REGEX needs to be fixed for TVP VOD in general to look at the last numerical string in the URL, and to paste that link to the player module on TVP. If anything, that last string of digits seems to be a UUID for site-wide TVP content.
If the REGEX is fixed for TVP VOD logic to look at the last string of digits for TVP VOD, and to append that to the /sess/player/video/* link, then I think that would solve this problem for TVP VOD site-wide. @yan12125 ...? |
|
@the-researcher that did it. Thanks for reporting your findings. Can we please have this issue re-opened, so we can track the fix? |
|
I hate flagging people, but I almost have a fix for this issue. I just have no clue how to modify the source to add the fix. @hubertbanas @remitamine @jbuchbinder Here's the REGEX that seems to ALMOST work with unit testing in Python code: https://regex101.com/r/aubLoI/4 The error is that when the REGEX runs, it will work if the URL has more than 1 comma in the string. If there is only one comma in the URL string, then the REGEX example won't parse the string correctly. Somewhat frustrating that I can't get the single comma to work, so help would be appreciated. EDIT 1: Nevermind, the single comma link has a different link structure, i.e. it has |
|
@the-researcher |
|
@hubertbanas I have no clue how to create a pull request. |
|
Opened a new issue, and referenced it here. That will probably fix it...? |
I see no issues when getting TV Shows but was wondering if we should expect tvp.py to handle news section on vod.tvp.pl such as Wiadomosci, Teleexpress, Panorama?
Wiadomosci
Teleexpress
Panorama
Working TV Show example