Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Morningstar URL does not scrape #2729

Closed
jillh510 opened this issue Apr 9, 2014 · 2 comments
Closed

Morningstar URL does not scrape #2729

jillh510 opened this issue Apr 9, 2014 · 2 comments

Comments

@jillh510
Copy link

@jillh510 jillh510 commented Apr 9, 2014

Looks like it's a not just matter of fixing the regex to accept "Cover" as well as "cover" in the Morningstar URLs; the URL http://www.morningstar.com/cover/videoCenter.aspx?id=641059 fails in an identical way.

python -m youtube_dl --skip-download --write-info-json -v http://www.morningstar.com/Cover/videoCenter.aspx?id=641059
[debug] System config: []
[debug] User config: []
[debug] Command-line args: ['--skip-download', '--write-info-json', '-v', 'http://www.morningstar.com/Cover/videoCenter.aspx?id=641059']
[debug] Encodings: locale UTF-8, fs utf-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2014.04.07.4
[debug] Python version 2.7.5 - Darwin-13.1.0-x86_64-i386-64bit
[debug] Proxy map: {}
[generic] videoCenter: Requesting header
WARNING: Falling back on generic information extractor.
[generic] videoCenter: Downloading webpage
[generic] videoCenter: Extracting information
ERROR: Unsupported URL: http://www.morningstar.com/Cover/videoCenter.aspx?id=641059; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.
Traceback (most recent call last):
File "/Users/jill/june/.virtualenv/lib/python2.7/site-packages/youtube_dl/extractor/generic.py", line 387, in _real_extract
doc = parse_xml(webpage)
File "/Users/jill/june/.virtualenv/lib/python2.7/site-packages/youtube_dl/utils.py", line 1377, in parse_xml
return xml.etree.ElementTree.XML(s.encode('utf-8'), **kwargs)
File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1300, in XML
parser.feed(text)
File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1642, in feed
self._raiseerror(v)
File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1506, in _raiseerror
raise err
ParseError: not well-formed (invalid token): line 21, column 78
Traceback (most recent call last):
File "/Users/jill/june/.virtualenv/lib/python2.7/site-packages/youtube_dl/YoutubeDL.py", line 514, in extract_info
ie_result = ie.extract(url)
File "/Users/jill/june/.virtualenv/lib/python2.7/site-packages/youtube_dl/extractor/common.py", line 161, in extract
return self._real_extract(url)
File "/Users/jill/june/.virtualenv/lib/python2.7/site-packages/youtube_dl/extractor/generic.py", line 627, in _real_extract
raise ExtractorError('Unsupported URL: %s' % url)
ExtractorError: Unsupported URL: http://www.morningstar.com/Cover/videoCenter.aspx?id=641059; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.

@jaimeMF jaimeMF closed this in 6b7dee4 Apr 9, 2014
jaimeMF added a commit that referenced this issue Apr 9, 2014
@jaimeMF
Copy link
Collaborator

@jaimeMF jaimeMF commented Apr 9, 2014

Thanks for the report.

@phihag
Copy link
Contributor

@phihag phihag commented Apr 21, 2014

Fixed in the current version of youtube-dl. Type pip -U youtube-dl to update.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
3 participants
You can’t perform that action at this time.