Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CBS.com's The Late Show with Steven Colbert — Videos cannot be downloaded from specific URL #13890

Open
Wowfunhappy opened this issue Aug 11, 2017 · 0 comments

Comments

@Wowfunhappy
Copy link

@Wowfunhappy Wowfunhappy commented Aug 11, 2017

Make sure you are using the latest version: run youtube-dl --version and ensure your version is 2017.08.09. If it's not, read this FAQ entry and update. Issues with outdated version will be rejected.

  • I've verified and I assure that I'm running youtube-dl 2017.08.09

Before submitting an issue make sure you have:

  • At least skimmed through the README, most notably the FAQ and BUGS sections
  • Searched the bugtracker for similar issues including closed ones

What is the purpose of your issue?

  • Bug report (encountered problems with youtube-dl)
  • Site support request (request for adding support for a new site)
  • Feature request (request for a new functionality)
  • Question
  • Other

If the purpose of this issue is a bug report, site support request or you are not completely sure provide the full verbose output as follows:

Add the -v flag to your command line you run youtube-dl with (youtube-dl -v <your command line>), copy the whole output and insert it here. It should look similar to one below (replace it with your log inserted between triple ```):

$ youtube-dl -v http://www.cbs.com/shows/the-late-show-with-stephen-colbert/video/
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: [u'-v', u'http://www.cbs.com/shows/the-late-show-with-stephen-colbert/video/']
[debug] Encodings: locale UTF-8, fs utf-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2017.08.09
[debug] Python version 2.7.10 - Darwin-16.7.0-x86_64-i386-64bit
[debug] exe versions: ffmpeg 3.2.2-tessus, ffprobe 3.2.2, rtmpdump 2.4
[debug] Proxy map: {}
[generic] video: Requesting header
WARNING: Falling back on generic information extractor.
[generic] video: Downloading webpage
[generic] video: Extracting information
ERROR: Unsupported URL: http://www.cbs.com/shows/the-late-show-with-stephen-colbert/video/
Traceback (most recent call last):
  File "/usr/local/bin/youtube-dl/youtube_dl/extractor/generic.py", line 2077, in _real_extract
    doc = compat_etree_fromstring(webpage.encode('utf-8'))
  File "/usr/local/bin/youtube-dl/youtube_dl/compat.py", line 2539, in compat_etree_fromstring
    doc = _XML(text, parser=etree.XMLParser(target=_TreeBuilder(element_factory=_element_factory)))
  File "/usr/local/bin/youtube-dl/youtube_dl/compat.py", line 2528, in _XML
    parser.feed(text)
  File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1642, in feed
    self._raiseerror(v)
  File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1506, in _raiseerror
    raise err
ParseError: not well-formed (invalid token): line 5, column 92
Traceback (most recent call last):
  File "/usr/local/bin/youtube-dl/youtube_dl/YoutubeDL.py", line 776, in extract_info
    ie_result = ie.extract(url)
  File "/usr/local/bin/youtube-dl/youtube_dl/extractor/common.py", line 433, in extract
    ie_result = self._real_extract(url)
  File "/usr/local/bin/youtube-dl/youtube_dl/extractor/generic.py", line 2944, in _real_extract
    raise UnsupportedError(url)
UnsupportedError: Unsupported URL: http://www.cbs.com/shows/the-late-show-with-stephen-colbert/video/

Description of your issue, suggested solution and other information

Youtube-dl supports downloading episodes of The Late Show with Steven Colbert from CBS.com. Examples of URLs that work include:

http://www.cbs.com/shows/the-late-show-with-stephen-colbert/video/LVgwSwAyjl4ArPOEDsDH4lGhZc7VUk5f/the-late-show-8-10-2017-millie-bobby-brown-jim-jefferies-zeshan-b-/
http://www.cbs.com/shows/the-late-show-with-stephen-colbert/video/2ojGQ6xFhvbsgv1v2SLgh2FoY0XhBKu_/the-late-show-8-9-2017-robert-pattinson-david-tennant-niecy-nash-/
http://www.cbs.com/shows/the-late-show-with-stephen-colbert/video/rN3FHAjaPOwAW_QhnKup6X4T2mLXlAUY/the-late-show-8-8-2017-christoph-waltz-chris-o-dowd-sean-evans-/

However, the URL http://www.cbs.com/shows/the-late-show-with-stephen-colbert/video/ does not work in youtube-dl. When loaded into a web browser, this URL plays the most recent episode of The Late Show. As far as I can tell, this URL is a unique page rather than a redirect, and thus youtube-dl should support it.

Is there any chance this can be fixed? I want to set up a script that automatically downloads the most recent episode every day. I think youtube-dl just needs to be told to use the right extractor.

I don't think this website will work outside of the United States. If there are any international devs who need access, I'd be happy to provide a VPN—please message me privately.

Thanks so much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
1 participant
You can’t perform that action at this time.