Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
.....I am hoping this is just a matter of adding the .ca URL to the existing huffpost extractor.
URL: http://www.huffingtonpost.ca/2015/09/28/entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232.html#_methods=onPlusOne2C_ready2C_close2C_open2C_resizeMe2C_renderstart2Concircled2Cdrefresh2Cerefresh2Conloadid=I0_1443544219927parent=http3A2F2Fwwwhuffingtonpostcapfname=rpctoken=47705081
Output:
c:\Transmogrifier>youtube-dl.py -v "http://www.huffingtonpost.ca/2015/09/28/entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232.html#_methods=onPlusOne
open2C_resizeMe2C_renderstart2Concircled2Cdrefresh2Cerefresh2Conloadid=I0_1443544219927parent=http3A2F2Fwwwhuffingtonpostcapfname=rpctoken=47705081"
[debug] System config: []
[debug] User config: []
[debug] Command-line args: [u'-v', u'http://www.huffingtonpost.ca/2015/09/28/entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232.html#_methods=onPlusOn
_open2C_resizeMe2C_renderstart2Concircled2Cdrefresh2Cerefresh2Conloadid=I0_1443544219927parent=http3A2F2Fwwwhuffingtonpostcapfname=rpctoken=47705081']
[debug] Encodings: locale cp1252, fs mbcs, out cp850, pref cp1252
[debug] youtube-dl version 2015.09.28
[debug] Python version 2.7.5 - Windows-7-6.1.7601-SP1
[debug] exe versions: ffmpeg N-71727-g46778ab, rtmpdump 2.4
[debug] Proxy map: {}
[generic] entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232: Requesting header
WARNING: Falling back on generic information extractor.
[generic] entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232: Downloading webpage
[generic] entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232: Extracting information
ERROR: Unsupported URL: http://www.huffingtonpost.ca/2015/09/28/entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232.html#_methods=onPlusOne2C_ready2C_c
eMe2C_renderstart2Concircled2Cdrefresh2Cerefresh2Conloadid=I0_1443544219927parent=http3A2F2Fwwwhuffingtonpostcapfname=rpctoken=47705081
Traceback (most recent call last):
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\generic.py", line 1240, in _real_extract
doc = parse_xml(webpage)
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\utils.py", line 1656, in parse_xml
tree = xml.etree.ElementTree.XML(s.encode('utf-8'), **kwargs)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1300, in XML
parser.feed(text)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1642, in feed
self._raiseerror(v)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1506, in _raiseerror
raise err
ParseError: undefined entity: line 47, column 0
Traceback (most recent call last):
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\YoutubeDL.py", line 660, in extract_info
ie_result = ie.extract(url)
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\common.py", line 288, in extract
return self._real_extract(url)
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\generic.py", line 1838, in _real_extract
raise UnsupportedError(url)
UnsupportedError: Unsupported URL: http://www.huffingtonpost.ca/2015/09/28/entrepreneur-enters-hands-free-hoverboard-market-engulfed-in-patent-war_n_8207232.html#_methods=onPlusOne2
pen2C_resizeMe2C_renderstart2Concircled2Cdrefresh2Cerefresh2Conloadid=I0_1443544219927parent=http3A2F2Fwwwhuffingtonpostcapfname=rpctoken=47705081
....and now that I look at it, there seems to be a number international versions:
http://www.huffingtonpost.com.au/
http://www.huffpostarabi.com/
http://www.brasilpost.com.br/
http://www.huffingtonpost.ca/
http://www.huffingtonpost.de/
http://www.huffingtonpost.es/
http://www.huffingtonpost.fr/
http://www.huffingtonpost.gr/
http://www.huffingtonpost.in/
http://www.huffingtonpost.it/
http://www.huffingtonpost.jp/
http://www.huffingtonpost.kr/
http://www.huffpostmaghreb.com/
http://www.huffingtonpost.co.uk/
http://www.huffingtonpost.com/
Thanks
Ringo