Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Site Request: BuzzFeed #4259

Closed
RingoTheDog opened this issue Nov 20, 2014 · 8 comments
Closed

Site Request: BuzzFeed #4259

RingoTheDog opened this issue Nov 20, 2014 · 8 comments

Comments

@RingoTheDog
Copy link

@RingoTheDog RingoTheDog commented Nov 20, 2014

http://www.buzzfeed.com/abagg/this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia

Most clips are from YouTube.

youtube-dl.py -v http://www.buzzfeed.com/abagg/this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia
[debug] System config: []
[debug] User config: []
[debug] Command-line args: ['-v', 'http://www.buzzfeed.com/abagg/this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia']
[debug] Encodings: locale cp1252, fs mbcs, out cp850, pref cp1252
[debug] youtube-dl version 2014.11.20
[debug] Python version 2.7.5 - Windows-7-6.1.7601-SP1
[debug] exe versions: ffmpeg N-40824-, rtmpdump 2.4
[debug] Proxy map: {}
[generic] this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia: Requesting header
WARNING: Falling back on generic information extractor.
[generic] this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia: Downloading webpage
[generic] this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia: Extracting information
ERROR: Unsupported URL: http://www.buzzfeed.com/abagg/this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-
l with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.
Traceback (most recent call last):
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\generic.py", line 609, in _real_extract
doc = parse_xml(webpage)
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\utils.py", line 1332, in parse_xml
tree = xml.etree.ElementTree.XML(s.encode('utf-8'), **kwargs)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1300, in XML
parser.feed(text)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1642, in feed
self._raiseerror(v)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1506, in _raiseerror
raise err
ParseError: not well-formed (invalid token): line 29, column 86
Traceback (most recent call last):
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\YoutubeDL.py", line 536, in extract_info
ie_result = ie.extract(url)
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\common.py", line 199, in extract
return self._real_extract(url)
File "C:\Transmogrifier\youtube-dl.py\youtube_dl\extractor\generic.py", line 996, in _real_extract
raise ExtractorError('Unsupported URL: %s' % url)
ExtractorError: Unsupported URL: http://www.buzzfeed.com/abagg/this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia; please report this issue on https://yt-dl.org/bug . Be sure to call
youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update.

(JFYI...RSS Feed can be found here: http://www.buzzfeed.com/videos.xml)

Thanks
Ringo

@lofidevops
Copy link

@lofidevops lofidevops commented Nov 26, 2014

Slightly different error message with 2014.11.25.1 on Ubuntu 14.10 (not sure if this helps or just confirms it is indeed a site request and not a generic extractor issue):

youtube-dl --verbose "http://www.buzzfeed.com/abagg/this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia"
[debug] System config: []
[debug] User config: []
[debug] Command-line args: ['--verbose', 'http://www.buzzfeed.com/abagg/this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia']
[debug] Encodings: locale UTF-8, fs utf-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2014.11.25.1
[debug] Python version 3.4.2 - Linux-3.16.0-25-generic-x86_64-with-Ubuntu-14.10-utopic
[debug] exe versions: avconv 11-6, avprobe 11-6, rtmpdump 2.4
[debug] Proxy map: {}
[generic] this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia: Requesting header
WARNING: Falling back on generic information extractor.
[generic] this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia: Downloading webpage
[generic] this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia: Extracting information
ERROR: Unsupported URL: http://www.buzzfeed.com/abagg/this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.
Traceback (most recent call last):
  File "/home/user/.local/lib/python3.4/site-packages/youtube_dl/YoutubeDL.py", line 553, in extract_info
    ie_result = ie.extract(url)
  File "/home/user/.local/lib/python3.4/site-packages/youtube_dl/extractor/common.py", line 240, in extract
    return self._real_extract(url)
  File "/home/user/.local/lib/python3.4/site-packages/youtube_dl/extractor/generic.py", line 1010, in _real_extract
    raise ExtractorError('Unsupported URL: %s' % url)
youtube_dl.utils.ExtractorError: Unsupported URL: http://www.buzzfeed.com/abagg/this-angry-ram-destroys-a-punching-bag-like-a-boss?utm_term=4ldqpia; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.
@phihag
Copy link
Contributor

@phihag phihag commented Nov 26, 2014

This is specific to Buzzfeed - the site doesn't contain any actual video but just animated gifs with a textual reference to the YouTube video. Would you expect youtube-dl to download the gif files or the YouTube video?

@RingoTheDog
Copy link
Author

@RingoTheDog RingoTheDog commented Nov 26, 2014

No YouTube-dl should not download the animated GIFs....but it should find the embedded YouTube clip.

...of possible relevance: When I mouse over the youtube video on BuzzFeed a series of icons pop up on the left (for pintrest, facebook, email etc...)

buzzfeed

Here is the DIV with the embedded link from that page (embed id="embed_" src=)

div class="video-embed-videopost video-embed-area bf_dom" id="video_buzz_element_" rel:thumb="http://img.youtube.com/vi/aVCR29aE_OQ/2.jpg" rel:bf_bucket_data="{"video" : {"size": "videopost", "width":"860", "height":"517", "url":"http://www.youtube.com/v/aVCR29aE_OQ", "id":""}}"

embed id="embed_" src="http://www.youtube.com/v/aVCR29aE_OQ?version=3&hl=en&fs=1&enablejsapi=1&playerapiid=embed_&autoplay=0&showinfo=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="860" height="517" bgcolor="#000" wmode="opaque" class="ytvideo"

/div

@phihag
Copy link
Contributor

@phihag phihag commented Nov 26, 2014

Oh, I'm sorry, the website renders completely differently for me:
ytdl_screenshot

I'll look into the source.

@RingoTheDog
Copy link
Author

@RingoTheDog RingoTheDog commented Nov 26, 2014

ha ha ha ha ha.... grumpy cat

...No problem ...and THANK YOU VERY MUCH!

@phihag phihag closed this in e232f78 Nov 26, 2014
@phihag
Copy link
Contributor

@phihag phihag commented Nov 26, 2014

Thank you for the report. I believe this issue to be fixed in youtube-dl 2014.11.26.4 and newer. See our FAQ if you are having trouble updating. I am not sure that I have captured everything on Buzzfeed though, so I'd appreciate feedback (and more example URLs) about the new extractor.

@RingoTheDog
Copy link
Author

@RingoTheDog RingoTheDog commented Nov 27, 2014

Nicely done...that RAM video works, but here is one that does not:
http://www.buzzfeed.com/sheridanwatson/look-at-this-cute-dog-omg?utm_term=4ldqpia

youtube-dl.py -v http://www.buzzfeed.com/sheridanwatson/look-at-this-cute-dog-omg?utm_term=4ldqpia
[debug] System config: []
[debug] User config: []
[debug] Command-line args: ['-v', 'http://www.buzzfeed.com/sheridanwatson/look-at-this-cute-dog-omg?utm_term=4ldqpia']
[debug] Encodings: locale cp1252, fs mbcs, out cp850, pref cp1252
[debug] youtube-dl version 2014.11.26.4
[debug] Python version 2.7.5 - Windows-7-6.1.7601-SP1
[debug] exe versions: ffmpeg N-40824-, rtmpdump 2.4
[debug] Proxy map: {}
[BuzzFeed] look-at-this-cute-dog-omg: Downloading webpage
[download] Downloading playlist: You Need To Stop What You're Doing And Watching This Dog Walk On A Treadmill
[BuzzFeed] playlist You Need To Stop What You're Doing And Watching This Dog Walk On A Treadmill: Collected 0 video ids (downloading 0 of them)

thanks

@phihag
Copy link
Contributor

@phihag phihag commented Nov 27, 2014

Ok, added support for that type of video too in youtube-dl 2014.11.27.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
4 participants
You can’t perform that action at this time.