Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
Can't download US Senate committee hearings #13399
Comments
|
A bit tangentially, when I first ran this I got a redirect that didn't seem to make much sense to me:
but then when I reran with Oh wait, here we go:
bizarre! Anyhow, as a workaround, it seems that the
But changing
I don't know the semantic meaning behind this type and if the extractor should just stop special-casing the arch type, or maybe try to validate it and fall back to the hdcore/M3U8 type automatically. |
|
@remitamine, I noticed you added the geo-restricted label. What led you to conclude that senate.gov is geo-restricted? |
|
@johnhawkinson thank you for looking into this. |
|
i'm getting this:
adding |
|
@remitamine
How is it when you try to access the video normally (through a browser)?
|
the massage that i posted is what i get in the browser trying to access any of US Senate urls posted in this issue. |
|
Committee hearings hosted on senate.gov generally fall into two categories:
Videos in the second category are hosted on a different CDN from the first. The Because the parameter is only a hint, "Live" content is served via Adobe HDS (and does not work on non-Flash devices), while "Archived" content appears to be available as a normal HTTP download. FWIW, all Senate committees are effectively independent, and manage their own websites. The only thing that is shared is the I do not believe that the Senate employs geo-blocking (but, given the current leadership, who the heck knows). |
|
I'd like to add that it's not only the committee hearings, but the actual Senate feed itself that cannot be downloaded: Example URL: Footage from the House (http://houselive.gov/) also can't be downloaded via youtube-dl, but it's less of an issue because they provide a download link right in the video player. The Senate does not. |
|
still an issue...
|
|
Using https://floor.senate.gov/videos/3125/player instead of https://floor.senate.gov/MediaPlayer.php?view_id=2&clip_id=3125 worked for me with the generic extractor. |
|
Thanks, @raleeper! That url works for me, as well. |
|
The rendered page for URLs like |
|
I've updated PR #22181, making the URL regexes more general. This should make it easier to copy and paste the code to create additional extractors for other organizations using granicus.com. Maybe someone who's more familiar with youtube-dl code could make a still more general granicus mp4 extractor? |
|
The current version of youtube-dl remains unable to download the video at urls like https://floor.senate.gov/MediaPlayer.php?view_id=2&clip_id=3125
|
What is the purpose of your issue?
If the purpose of this issue is a site support request please provide all kinds of example URLs support for which should be included (replace following example URLs by yours):
Description of your issue, suggested solution and other information
The senate.gov has many "subcategories", or better known as committees. The list of committees is at https://www.senate.gov/committees/committees_home.htm.
For example,
veterans.senate.gov
indian.senate.gov
Each committee's hearings are available on /hearings, for example, veterans.senate.gov/hearings.