Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Site Request: Huffingtonpost.CA #7046

Open
RingoTheDog opened this issue Oct 2, 2015 · 0 comments
Open

Site Request: Huffingtonpost.CA #7046

RingoTheDog opened this issue Oct 2, 2015 · 0 comments

Comments

@RingoTheDog
Copy link

@RingoTheDog RingoTheDog commented Oct 2, 2015

.....I am hoping this is just a matter of adding the .ca URL to the existing huffpost extractor.

URL: http://www.huffingtonpost.ca/2015/09/28/entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232.html#_methods=onPlusOne2C_ready2C_close2C_open2C_resizeMe2C_renderstart2Concircled2Cdrefresh2Cerefresh2Conloadid=I0_1443544219927parent=http3A2F2Fwwwhuffingtonpostcapfname=rpctoken=47705081

Output:
c:\Transmogrifier>youtube-dl.py -v "http://www.huffingtonpost.ca/2015/09/28/entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232.html#_methods=onPlusOne
open2C_resizeMe2C_renderstart2Concircled2Cdrefresh2Cerefresh2Conloadid=I0_1443544219927parent=http3A2F2Fwwwhuffingtonpostcapfname=rpctoken=47705081"
[debug] System config: []
[debug] User config: []
[debug] Command-line args: [u'-v', u'http://www.huffingtonpost.ca/2015/09/28/entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232.html#_methods=onPlusOn
_open2C_resizeMe2C_renderstart2Concircled2Cdrefresh2Cerefresh2Conloadid=I0_1443544219927parent=http3A2F2Fwwwhuffingtonpostcapfname=rpctoken=47705081']
[debug] Encodings: locale cp1252, fs mbcs, out cp850, pref cp1252
[debug] youtube-dl version 2015.09.28
[debug] Python version 2.7.5 - Windows-7-6.1.7601-SP1
[debug] exe versions: ffmpeg N-71727-g46778ab, rtmpdump 2.4
[debug] Proxy map: {}
[generic] entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232: Requesting header
WARNING: Falling back on generic information extractor.
[generic] entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232: Downloading webpage
[generic] entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232: Extracting information
ERROR: Unsupported URL: http://www.huffingtonpost.ca/2015/09/28/entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232.html#_methods=onPlusOne2C_ready2C_c
eMe2C_renderstart2Concircled2Cdrefresh2Cerefresh2Conloadid=I0_1443544219927parent=http3A2F2Fwwwhuffingtonpostcapfname=rpctoken=47705081
Traceback (most recent call last):
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\generic.py", line 1240, in _real_extract
doc = parse_xml(webpage)
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\utils.py", line 1656, in parse_xml
tree = xml.etree.ElementTree.XML(s.encode('utf-8'), **kwargs)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1300, in XML
parser.feed(text)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1642, in feed
self._raiseerror(v)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1506, in _raiseerror
raise err
ParseError: undefined entity: line 47, column 0
Traceback (most recent call last):
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\YoutubeDL.py", line 660, in extract_info
ie_result = ie.extract(url)
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\common.py", line 288, in extract
return self._real_extract(url)
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\generic.py", line 1838, in _real_extract
raise UnsupportedError(url)
UnsupportedError: Unsupported URL: http://www.huffingtonpost.ca/2015/09/28/entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232.html#_methods=onPlusOne2
pen2C_resizeMe2C_renderstart2Concircled2Cdrefresh2Cerefresh2Conloadid=I0_1443544219927parent=http3A2F2Fwwwhuffingtonpostcapfname=rpctoken=47705081

....and now that I look at it, there seems to be a number international versions:

http://www.huffingtonpost.com.au/
http://www.huffpostarabi.com/
http://www.brasilpost.com.br/
http://www.huffingtonpost.ca/
http://www.huffingtonpost.de/
http://www.huffingtonpost.es/
http://www.huffingtonpost.fr/
http://www.huffingtonpost.gr/
http://www.huffingtonpost.in/
http://www.huffingtonpost.it/
http://www.huffingtonpost.jp/
http://www.huffingtonpost.kr/
http://www.huffpostmaghreb.com/
http://www.huffingtonpost.co.uk/
http://www.huffingtonpost.com/

Thanks
Ringo

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
3 participants
You can’t perform that action at this time.