Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
C-SPAN videos start early and end early #13030
Comments
|
Update:
I was mislead. It does indeed now reach the end of the hearing ( |
|
There's also something troubling about the way The HTML player for C-SPAN pages presents a single continuous video. Youtube-dl should therefore present the same number of videos to the user, but instead it presents a series of broken-up videos of various lengths in a playlist. Also, sometimes these videos overlap in time (e.g. 5-minute length videos that overlap by 5 seconds, or 1-hour videos that overlap by 5 minutes), meaning they are not directly amenable to concatenation, and they do not produce a seamless playback experience. It's also a bit challenging to figure out what to download if you want a limited time-window of a long set of videos. I guess The videos in prior comments above were recent and they seemed to be changing. An older example that seems less likely to change is https://www.c-span.org/video/?424989-1/deputy-associate-attorneys-general-testify-confirmation-hearing from March 2017. There youtube-dl claims there is a 6-part playlist, but the first entry appears to be a 3hr34min long clip of the entirety of the hearing. The next is a 1-hour long clip that overlaps with the start of the hearing. So presumably something has gone wrong in I'm not sure how to succinctly summarize this, and also the JSON output seems to disagree with what I actually get with respect to durations. That is:
Which seems to suggest the first entry should be 12868 (3hr34min+some seconds) which seems wright. But then the next is 736 when really it should be 3600. But here's what I actually get:
not sure what to make of that. Oh, because it's not voluminous like the actual output, maybe
|
|
it's a known issue #10662. |
|
I read 10662 to be about starting early and ending late, giving you a superset of the But here we describe Hrmm. |
i'm not sure about, youtube-dl downloads all the parts served by c-span.
this is the same problem in refrenced issue, the real content is part from what is downloaded(the overlapping happend because the video is split into multi-parts), and i did explain in #10662 (comment) why it's happening and how to fix it, however the part that needs to be enabled depend on #8851. |
Sorry, this is confusing and I just made it worse. I should have said ending early (not starting late); I've corrected the above comment. The ending early was what I originally described here:
But unfortunately CSPAN has re-encoded this segment so there is no longer a viable test case. I'm pretty sure that the problem of ending early was not visible in the HTML player, although what I saw may have been skewed by the time it took me to watch the content, so I may have seen early endign during the 21-segment encoding but not during the 7-segment. Bah, what a mess. |
|
During the airing of a live show and in the hour or two after a C-Span video has aired for the first time, YT-dl downloads the show broken up into 5 min. segments. Shortly after that, the number of segments decreases to 2-4 for a one hour program. Eventually, perhaps 6-12 hours later, the whole video is contained in one or two segments. It has been like this for a long time and is not a problem. I have successfully joined a dozen or more 5 minute segments using a tiny little free app, mp4joiner, but usually I wait for C-Span to do the joining. One show I usually watch, the Sunday Q & A program, usually downloads as two one hour segments. One of them may begin a couple of minutes before the Q & A show, while the other may include some of the show that follows the Q & A. The point is that it is easy to view an entire program even if a file contains more than just the show you want to view or less. You have to be a little flexible. I am grateful that C-Span and YT-DL make it possible to download all C-Span shows, albeit in a somewhat unorthodox way. In past years, It was not always possible to download C-Span shows. |
|
I'm not really clear what @bonacker's intent is, but:
Well, no. We should note all the limitations and then file pull requests to implement changes to correct them. Do not accept mediocrity, but strive to make the software as good as we can! Separately:
|
|
My points were more philosophical than technical, but here is one semi-technical point I made: I pointed out that based on my extensive downloading of C-Span programs via YT-DL, I know that those five minute files are a short-lived phenomenon that only exist for an hour or so after a show first airs. If one has a little patience to wait a couple of hours or, more commonly, if one is downloading an archived show from a day, a week or three years ago, one will never encounter one of those five minute files. "Do not accept mediocrity, but strive to make the software as good as we can!" I am a huge fan of YT-DL and am enormously grateful for all the coders who maintain functioning extractors, but YT-Dl is a kluge, and taking that fact into account, IMO, there is nothing mediocre about the user experience of downloading C-Span programs via YT-DL. I get to see all the C-Span videos that I want to see in their entirety and spend an insignificant amount of extra time in the process due to those tiny glitches that so disturb the above poster. I'm not really clear what @bonacker's intent is” While it is technically a bug that YT-DL does not exactly 100% mimic the behavior of the player in which C-Span wants its viewers to stream its videos, in my view there are better ways to spend one's time, including developing extractors for new sites or fixing broken ones than expending time and effort to fix an extractor that is reliably delivering 99% of its mission. Coding perfection is a nice goal but less important than overall user experience, especially in a kluge program that surprises by working at all, given the lack of “cooperation” of its “partners,” i.e. YT, C-Span etc. I realize others, like the poster to whom I am replying, disagree. |
Downloading https://www.c-span.org/video/?427577-1/sally-yates-james-clapper-testify-russian-interference-2016-election&live after-the-fact (not live), I get 21 segments, but the first segment starts too soon (not a big problem) and the final segment ends too soon (a real issue).
Per the clock in the upper-right corner of the video,
Russian Interference in 2016 Election part 21-476924_21.mp4ends at5:24 pm ET. Reviewing it in the in-browser flash player, the video ends at5:43 pm ET, so approx 19-20 minutes later.In the time since I ran that (and then watched the video at realtime speed), it seems to have been repacked into 7 pieces instead of 21.
It also has lost the(update: see below)LIVEtag and the timestamp in the upper-right corner, so it's tougher to tell at what point it cuts off. But it definitely does not reach the end.Note that this is the same content that raised #13028 about livestream downloading, but the issues are not really related.