Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[brightcove] NYTimes.com video #2529

Closed
jordanf opened this issue Mar 9, 2014 · 5 comments
Closed

[brightcove] NYTimes.com video #2529

jordanf opened this issue Mar 9, 2014 · 5 comments

Comments

@jordanf
Copy link

@jordanf jordanf commented Mar 9, 2014

Verbose output:

Jewford:Downloads jmf$ youtube-dl --verbose http://www.nytimes.com/video/us/100000002757071/the-men-of-atalissa.html
[debug] System config: []
[debug] User config: []
[debug] Command-line args: ['--verbose', 'http://www.nytimes.com/video/us/100000002757071/the-men-of-atalissa.html']
[debug] Encodings: locale 'UTF-8', fs 'utf-8', out 'UTF-8', pref: 'UTF-8'
[debug] youtube-dl version 2014.03.07.1
[debug] Python version 2.7.6 - Darwin-13.1.0-x86_64-i386-64bit
[debug] Proxy map: {}
[generic] the-men-of-atalissa: Requesting header
WARNING: Falling back on generic information extractor.
[generic] the-men-of-atalissa: Downloading webpage
[generic] the-men-of-atalissa: Extracting information
[generic] Brightcove video detected.
[download] Downloading playlist: The Men of Atalissa - Video - NYTimes.com
[generic] playlist The Men of Atalissa - Video - NYTimes.com: Collected 1 video ids (downloading 1 of them)
[download] Downloading video #1 of 1
[Brightcove] ref:100000002757071: Downloading webpage
[Brightcove] ref:100000002757071: Extracting information
ERROR: Unable to extract json; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.
Traceback (most recent call last):
File "/opt/local/bin/youtube-dl/youtube_dl/YoutubeDL.py", line 500, in extract_info
ie_result = ie.extract(url)
File "/opt/local/bin/youtube-dl/youtube_dl/extractor/common.py", line 162, in extract
return self._real_extract(url)
File "/opt/local/bin/youtube-dl/youtube_dl/extractor/brightcove.py", line 170, in _real_extract
videoPlayer[0], query_str, query, referer=referer)
File "/opt/local/bin/youtube-dl/youtube_dl/extractor/brightcove.py", line 186, in _get_video_info
info = self._search_regex(r'var experienceJSON = ({.*?});', webpage, 'json')
File "/opt/local/bin/youtube-dl/youtube_dl/extractor/common.py", line 360, in _search_regex
raise RegexNotFoundError(u'Unable to extract %s' % _name)
RegexNotFoundError: Unable to extract json; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.

@canavan
Copy link

@canavan canavan commented Mar 15, 2014

"Embedded" URLs of nytimes videos fail as well. They do use brightcove, but the URL to the video is assembled by javascript.

http://graphics8.nytimes.com/bcvideo/1.0/iframe/embed.html?videoId=100000002747620&playerType=embed

$ http_proxy= youtube-dl --verbose 'http://graphics8.nytimes.com/bcvideo/1.0/iframe/embed.html?videoId=100000002747620&playerType=embed'
[debug] System config: []
[debug] User config: []
[debug] Command-line args: ['--verbose', 'http://graphics8.nytimes.com/bcvideo/1.0/iframe/embed.html?videoId=100000002747620&playerType=embed']
[debug] Encodings: locale 'ISO8859-1', fs 'ISO8859-1', out 'ISO8859-1', pref: 'ISO8859-1'
[debug] youtube-dl version 2014.03.12
[debug] Python version 2.7.6 - IRIX64-6.5-IP35-mips-n32bit-ELF
[debug] Proxy map: {}
[generic] embed: Requesting header
WARNING: Falling back on generic information extractor.
[generic] embed: Downloading webpage
[generic] embed: Extracting information
ERROR: Unsupported URL: http://graphics8.nytimes.com/bcvideo/1.0/iframe/embed.html?videoId=100000002747620&playerType=embed; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.
Traceback (most recent call last):
File "/usr/nekoware/bin/youtube-dl/youtube_dl/extractor/generic.py", line 288, in _real_extract
doc = parse_xml(webpage)
File "/usr/nekoware/bin/youtube-dl/youtube_dl/utils.py", line 1281, in parse_xml
return xml.etree.ElementTree.XML(s.encode('utf-8'), **kwargs)
File "/usr/nekoware/lib/python2.7/xml/etree/ElementTree.py", line 1300, in XML
parser.feed(text)
File "/usr/nekoware/lib/python2.7/xml/etree/ElementTree.py", line 1642, in feed
self._raiseerror(v)
File "/usr/nekoware/lib/python2.7/xml/etree/ElementTree.py", line 1506, in _raiseerror
raise err
ParseError: mismatched tag: line 1, column 378
Traceback (most recent call last):
File "/usr/nekoware/bin/youtube-dl/youtube_dl/YoutubeDL.py", line 503, in extract_info
ie_result = ie.extract(url)
File "/usr/nekoware/bin/youtube-dl/youtube_dl/extractor/common.py", line 159, in extract
return self._real_extract(url)
File "/usr/nekoware/bin/youtube-dl/youtube_dl/extractor/generic.py", line 491, in _real_extract
raise ExtractorError('Unsupported URL: %s' % url)
ExtractorError: Unsupported URL: http://graphics8.nytimes.com/bcvideo/1.0/iframe/embed.html?videoId=100000002747620&playerType=embed; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.

@naglis
Copy link
Collaborator

@naglis naglis commented Sep 14, 2014

Can't reproduce @jordanf's problem with the latest youtube-dl (2014.09.14.2), although I get an HTTP Error 403: Forbidden when trying to download.

@netllama
Copy link

@netllama netllama commented Oct 12, 2014

$ youtube-dl --version
2014.10.12
$ youtube-dl 'http://www.nytimes.com/2014/10/12/magazine/fine-dining-for-second-graders.html'
[generic] fine-dining-for-second-graders: Requesting header
[redirect] Following redirect to http://www.nytimes.com/2014/10/12/magazine/fine-dining-for-second-graders.html?_r=0
[generic] fine-dining-for-second-graders: Requesting header
WARNING: Falling back on generic information extractor.
[generic] fine-dining-for-second-graders: Downloading webpage
[generic] fine-dining-for-second-graders: Extracting information
ERROR: Unsupported URL: http://www.nytimes.com/2014/10/12/magazine/fine-dining-for-second-graders.html?_r=0; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.

@netllama
Copy link

@netllama netllama commented Oct 12, 2014

$ youtube-dl --verbose 'http://www.nytimes.com/2014/10/12/magazine/fine-dining-for-second-graders.html'
[debug] System config: ['--prefer-free-formats']
[debug] User config: []
[debug] Command-line args: ['--verbose', 'http://www.nytimes.com/2014/10/12/magazine/fine-dining-for-second-graders.html']
[debug] Encodings: locale UTF-8, fs UTF-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2014.10.12
[debug] Python version 2.7.5 - Linux-3.15.10-200.fc20.x86_64-x86_64-with-fedora-20-Heisenbug
[debug] Proxy map: {}
[generic] fine-dining-for-second-graders: Requesting header
[redirect] Following redirect to http://www.nytimes.com/2014/10/12/magazine/fine-dining-for-second-graders.html?_r=0
[generic] fine-dining-for-second-graders: Requesting header
WARNING: Falling back on generic information extractor.
[generic] fine-dining-for-second-graders: Downloading webpage
[generic] fine-dining-for-second-graders: Extracting information
ERROR: Unsupported URL: http://www.nytimes.com/2014/10/12/magazine/fine-dining-for-second-graders.html?_r=0; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.
Traceback (most recent call last):
File "/usr/bin/youtube-dl/youtube_dl/extractor/generic.py", line 540, in _real_extract
doc = parse_xml(webpage)
File "/usr/bin/youtube-dl/youtube_dl/utils.py", line 1550, in parse_xml
tree = xml.etree.ElementTree.XML(s.encode('utf-8'), **kwargs)
File "/usr/lib64/python2.7/xml/etree/ElementTree.py", line 1300, in XML
parser.feed(text)
File "/usr/lib64/python2.7/xml/etree/ElementTree.py", line 1642, in feed
self._raiseerror(v)
File "/usr/lib64/python2.7/xml/etree/ElementTree.py", line 1506, in _raiseerror
raise err
ParseError: not well-formed (invalid token): line 2, column 285
Traceback (most recent call last):
File "/usr/bin/youtube-dl/youtube_dl/YoutubeDL.py", line 524, in extract_info
ie_result = ie.extract(url)
File "/usr/bin/youtube-dl/youtube_dl/extractor/common.py", line 193, in extract
return self._real_extract(url)
File "/usr/bin/youtube-dl/youtube_dl/extractor/generic.py", line 911, in _real_extract
raise ExtractorError('Unsupported URL: %s' % url)
ExtractorError: Unsupported URL: http://www.nytimes.com/2014/10/12/magazine/fine-dining-for-second-graders.html?_r=0; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.

@dstftw
Copy link
Collaborator

@dstftw dstftw commented Nov 14, 2015

All provided example URLs work fine with latest version. Closing.

@dstftw dstftw closed this Nov 14, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
6 participants
@netllama @canavan @naglis @dstftw @jordanf and others
You can’t perform that action at this time.