Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to extract CNN url #12972

Open
d-damien opened this issue May 2, 2017 · 5 comments
Open

Unable to extract CNN url #12972

d-damien opened this issue May 2, 2017 · 5 comments

Comments

@d-damien
Copy link

@d-damien d-damien commented May 2, 2017

Make sure you are using the latest version: run youtube-dl --version and ensure your version is 2017.05.01. If it's not read this FAQ entry and update. Issues with outdated version will be rejected.

  • I've verified and I assure that I'm running youtube-dl 2017.05.01

Before submitting an issue make sure you have:

  • At least skimmed through README and most notably FAQ and BUGS sections
  • Searched the bugtracker for similar issues including closed ones

What is the purpose of your issue?

  • Bug report (encountered problems with youtube-dl)
  • Site support request (request for adding support for a new site)
  • Feature request (request for a new functionality)
  • Question
  • Other

Log

youtube-dl --verbose http://edition.cnn.com/2010/WORLD/europe/12/11/brittany.language/
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: [u'--verbose', u'http://edition.cnn.com/2010/WORLD/europe/12/11/brittany.language/']
[debug] Encodings: locale UTF-8, fs UTF-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2017.05.01
[debug] Python version 2.7.6 - Linux-4.4.0-34-generic-x86_64-with-LinuxMint-17.1-rebecca
[debug] exe versions: avconv 9.20-6, avprobe 9.20-6
[debug] Proxy map: {}
[CNNArticle] brittany.language: Downloading webpage
ERROR: Unable to extract cnn url; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/YoutubeDL.py", line 760, in extract_info
    ie_result = ie.extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 429, in extract
    ie_result = self._real_extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/cnn.py", line 147, in _real_extract
    cnn_url = self._html_search_regex(r"video:\s*'([^']+)'", webpage, 'cnn url')
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 787, in _html_search_regex
    res = self._search_regex(pattern, string, name, default, fatal, flags, group)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 778, in _search_regex
    raise RegexNotFoundError('Unable to extract %s' % _name)

Issue

CNN urls won't work. Tested with other urls.

@parmjitv
Copy link
Contributor

@parmjitv parmjitv commented Jun 26, 2017

Hi d-damien,

Are you able to view this video on the CNN site in your browser? Nothing happens for me when I click on the Play button.

It seems CNN updated the generated html source for the edition video pages back in September, 2011. Edition videos featured on pages prior to this date will not be detected by the extractor, as you have noted. At any rate these videos no longer appear to be available for viewing or download.

Cheers,

Parmjit V.

@d-damien
Copy link
Author

@d-damien d-damien commented Jul 4, 2017

As of today (sorry I had forgotten I opened this issue), vidéo is definitely played in

Chromium 58.0.3029.110 Built on Ubuntu , running on LinuxMint 18.1 (64-bit)

Although with a console error :

404 on http://us.cnn.com/video/data/3.0/video/international/relateds_12.xml

But it works :)

@Guiraud
Copy link

@Guiraud Guiraud commented Jul 14, 2017

Having same issue : youtube-dl --verbose http://money.cnn.com/mostly-human/silicon-valleys-secret/
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: ['--verbose', 'http://money.cnn.com/mostly-human/silicon-valleys-secret/']
[debug] Encodings: locale UTF-8, fs utf-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2017.05.18.1
[debug] Python version 3.5.3 - Linux-4.9.0-3-amd64-x86_64-with-debian-9.0
[debug] exe versions: ffmpeg 3.2.5-1, ffprobe 3.2.5-1, rtmpdump 2.4
[debug] Proxy map: {}
[generic] silicon-valleys-secret: Requesting header
WARNING: Falling back on generic information extractor.
[generic] silicon-valleys-secret: Downloading webpage
[generic] silicon-valleys-secret: Extracting information
ERROR: Unsupported URL: http://money.cnn.com/mostly-human/silicon-valleys-secret/
Traceback (most recent call last):
File "/usr/lib/python3/dist-packages/youtube_dl/YoutubeDL.py", line 760, in extract_info
ie_result = ie.extract(url)
File "/usr/lib/python3/dist-packages/youtube_dl/extractor/common.py", line 433, in extract
ie_result = self._real_extract(url)
File "/usr/lib/python3/dist-packages/youtube_dl/extractor/generic.py", line 2795, in _real_extract
raise UnsupportedError(url)
youtube_dl.utils.UnsupportedError: Unsupported URL: http://money.cnn.com/mostly-human/silicon-valleys-secret/

@parmjitv
Copy link
Contributor

@parmjitv parmjitv commented Jul 17, 2017

@d-damien - Really! Strange that the video does not play for me. I will try again using the specs you have outlined.

@Guiraud - This appears to be a separate issue with the extractor. Issue #13666 has been created to attend to this.

Many thanks gentlemen!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
4 participants
You can’t perform that action at this time.