Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some TMZ.com videos are not downloaded #24603

Open
diegorodriguezv opened this issue Apr 3, 2020 · 0 comments · May be fixed by #24687
Open

Some TMZ.com videos are not downloaded #24603

diegorodriguezv opened this issue Apr 3, 2020 · 0 comments · May be fixed by #24687

Comments

@diegorodriguezv
Copy link

@diegorodriguezv diegorodriguezv commented Apr 3, 2020

Checklist

  • I'm reporting a broken site support
  • I've verified that I'm running youtube-dl version 2020.03.24
  • I've checked that all provided URLs are alive and playable in a browser
  • I've checked that all URLs and arguments with special characters are properly quoted or escaped
  • I've searched the bugtracker for similar issues including closed ones

Description with Verbose log

Hey! Thanks for this amazing piece of software.
This bug report combines two issues about TMZ.com.

  1. Around november 2019 TMZ started adding videos with a new URL pattern. These videos do not download correctly.
    https://www.tmz.com/videos/071119-chris-morgan-women-4590005-0-zcsejvcr/
pi@telediego:~/Videos $ youtube-dl -v https://www.tmz.com/videos/071119-chris-morgan-women-4590005-0-zcsejvcr/
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: [u'-v', u'https://www.tmz.com/videos/071119-chris-morgan-women-4590005-0-zcsejvcr/']
[debug] Encodings: locale UTF-8, fs UTF-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2020.03.24
[debug] Python version 2.7.9 (CPython) - Linux-4.19.108-v7+-armv7l-with-debian-8.0
[debug] exe versions: avconv 11.12-6, avprobe 11.12-6, ffmpeg N-51907-ge27a35e045-static, ffprobe N-51907-ge27a35e045-static, phantomjs ., rtmpdump 2.4
[debug] Proxy map: {}
[Kaltura] 071119_chris_morgan_women_4590005_0_zcsejvcr: Downloading video info JSON
ERROR: An extractor error has occurred. (caused by KeyError(u'dataUrl',)); please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 530, in extract
    ie_result = self._real_extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/kaltura.py", line 291, in _real_extract
    data_url = info['dataUrl']
KeyError: u'dataUrl'
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/YoutubeDL.py", line 797, in extract_info
    ie_result = ie.extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 543, in extract
    raise ExtractorError('An extractor error has occurred.', cause=e)
ExtractorError: An extractor error has occurred. (caused by KeyError(u'dataUrl',)); please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.

  1. Apparently there is a regression. Some URLs that were reported in previous issues closed as solved are not working anymore.

http://www.tmz.com/2015/04/19/bobby-brown-bobbi-kristina-awake-video-concert/ reported in #5477

pi@telediego:~/Videos $ youtube-dl -v http://www.tmz.com/2015/04/19/bobby-brown-bobbi-kristina-awake-video-concert/
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: [u'-v', u'http://www.tmz.com/2015/04/19/bobby-brown-bobbi-kristina-awake-video-concert/']
[debug] Encodings: locale UTF-8, fs UTF-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2020.03.24
[debug] Python version 2.7.9 (CPython) - Linux-4.19.108-v7+-armv7l-with-debian-8.0
[debug] exe versions: avconv 11.12-6, avprobe 11.12-6, ffmpeg N-51907-ge27a35e045-static, ffprobe N-51907-ge27a35e045-static, phantomjs ., rtmpdump 2.4
[debug] Proxy map: {}
[TMZArticle] bobby-brown-bobbi-kristina-awake-video-concert: Downloading webpage
ERROR: Unable to extract embedded video info; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/YoutubeDL.py", line 797, in extract_info
    ie_result = ie.extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 530, in extract
    ie_result = self._real_extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/tmz.py", line 52, in _real_extract
    r'tmzVideoEmbed\(({.+?})\);', webpage, 'embedded video info'),
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 1014, in _html_search_regex
    res = self._search_regex(pattern, string, name, default, fatal, flags, group)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 1005, in _search_regex
    raise RegexNotFoundError('Unable to extract %s' % _name)
RegexNotFoundError: Unable to extract embedded video info; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.

http://www.tmz.com/2015/09/19/patti-labelle-concert-fan-stripping-kicked-out-nicki-minaj/ reported in #6898

pi@telediego:~/Videos $ youtube-dl -v http://www.tmz.com/2015/09/19/patti-labelle-concert-fan-stripping-kicked-out-nicki-minaj/
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: [u'-v', u'http://www.tmz.com/2015/09/19/patti-labelle-concert-fan-stripping-kicked-out-nicki-minaj/']
[debug] Encodings: locale UTF-8, fs UTF-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2020.03.24
[debug] Python version 2.7.9 (CPython) - Linux-4.19.108-v7+-armv7l-with-debian-8.0
[debug] exe versions: avconv 11.12-6, avprobe 11.12-6, ffmpeg N-51907-ge27a35e045-static, ffprobe N-51907-ge27a35e045-static, phantomjs ., rtmpdump 2.4
[debug] Proxy map: {}
[TMZArticle] patti-labelle-concert-fan-stripping-kicked-out-nicki-minaj: Downloading webpage
ERROR: Unable to extract embedded video info; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/YoutubeDL.py", line 797, in extract_info
    ie_result = ie.extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 530, in extract
    ie_result = self._real_extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/tmz.py", line 52, in _real_extract
    r'tmzVideoEmbed\(({.+?})\);', webpage, 'embedded video info'),
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 1014, in _html_search_regex
    res = self._search_regex(pattern, string, name, default, fatal, flags, group)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 1005, in _search_regex
    raise RegexNotFoundError('Unable to extract %s' % _name)
RegexNotFoundError: Unable to extract embedded video info; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.

http://www.tmz.com/2016/01/28/adam-silver-sting-drake-blake-griffin/ also reported in #6898

pi@telediego:~/Videos $ youtube-dl -v http://www.tmz.com/2016/01/28/adam-silver-sting-drake-blake-griffin/ 
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: [u'-v', u'http://www.tmz.com/2016/01/28/adam-silver-sting-drake-blake-griffin/']
[debug] Encodings: locale UTF-8, fs UTF-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2020.03.24
[debug] Python version 2.7.9 (CPython) - Linux-4.19.108-v7+-armv7l-with-debian-8.0
[debug] exe versions: avconv 11.12-6, avprobe 11.12-6, ffmpeg N-51907-ge27a35e045-static, ffprobe N-51907-ge27a35e045-static, phantomjs ., rtmpdump 2.4
[debug] Proxy map: {}
[TMZArticle] adam-silver-sting-drake-blake-griffin: Downloading webpage
ERROR: Unable to extract embedded video info; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/YoutubeDL.py", line 797, in extract_info
    ie_result = ie.extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 530, in extract
    ie_result = self._real_extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/tmz.py", line 52, in _real_extract
    r'tmzVideoEmbed\(({.+?})\);', webpage, 'embedded video info'),
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 1014, in _html_search_regex
    res = self._search_regex(pattern, string, name, default, fatal, flags, group)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 1005, in _search_regex
    raise RegexNotFoundError('Unable to extract %s' % _name)
RegexNotFoundError: Unable to extract embedded video info; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.

http://www.tmz.com/2016/10/27/donald-trump-star-vandal-arrested-james-otis/ reported in #11052

pi@telediego:~/Videos $ youtube-dl -v http://www.tmz.com/2016/10/27/donald-trump-star-vandal-arrested-james-otis/ 
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: [u'-v', u'http://www.tmz.com/2016/10/27/donald-trump-star-vandal-arrested-james-otis/']
[debug] Encodings: locale UTF-8, fs UTF-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2020.03.24
[debug] Python version 2.7.9 (CPython) - Linux-4.19.108-v7+-armv7l-with-debian-8.0
[debug] exe versions: avconv 11.12-6, avprobe 11.12-6, ffmpeg N-51907-ge27a35e045-static, ffprobe N-51907-ge27a35e045-static, phantomjs ., rtmpdump 2.4
[debug] Proxy map: {}
[TMZArticle] donald-trump-star-vandal-arrested-james-otis: Downloading webpage
ERROR: Unable to extract embedded video info; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/YoutubeDL.py", line 797, in extract_info
    ie_result = ie.extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 530, in extract
    ie_result = self._real_extract(url)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/tmz.py", line 52, in _real_extract
    r'tmzVideoEmbed\(({.+?})\);', webpage, 'embedded video info'),
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 1014, in _html_search_regex
    res = self._search_regex(pattern, string, name, default, fatal, flags, group)
  File "/usr/local/lib/python2.7/dist-packages/youtube_dl/extractor/common.py", line 1005, in _search_regex
    raise RegexNotFoundError('Unable to extract %s' % _name)
RegexNotFoundError: Unable to extract embedded video info; please report this issue on https://yt-dl.org/bug . Make sure you are using the latest version; see  https://yt-dl.org/update  on how to update. Be sure to call youtube-dl with the --verbose flag and include its complete output.

Here are examples that do work. Some of them are the same videos as reported above but within a different URL.
http://www.tmz.com/videos/0_6snoelag/
http://www.tmz.com/videos/0_jerz7s3l/
https://www.tmz.com/videos/0-zzwdhycc/
http://www.tmz.com/videos/0_a2lkk7ba/
http://www.tmz.com/videos/0_lbsyncxu/

Thanks in advance for your help.

@diegorodriguezv diegorodriguezv linked a pull request that will close this issue Apr 8, 2020
5 of 9 tasks complete
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

1 participant
You can’t perform that action at this time.