Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Sohu] sohu.py exists, but getting Unsupported URL error #28944

Open
5 tasks done
someziggyman opened this issue May 2, 2021 · 0 comments
Open
5 tasks done

[Sohu] sohu.py exists, but getting Unsupported URL error #28944

someziggyman opened this issue May 2, 2021 · 0 comments

Comments

@someziggyman
Copy link

Checklist

  • I'm reporting a broken site support
  • I've verified that I'm running youtube-dl version 2021.04.26
  • I've checked that all provided URLs are alive and playable in a browser
  • I've checked that all URLs and arguments with special characters are properly quoted or escaped
  • I've searched the bugtracker for similar issues including closed ones

Verbose log

youtube-dl -v -F https://tv.sohu.com/v/dXMvNTAyMjA5MTMvNjg1NjIyNTYuc2h0bWw=.html?src=pl
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: [u'-v', u'-F', u'https://tv.sohu.com/v/dXMvNTAyMjA5MTMvNjg1NjIyNTYuc2h0bWw=.html?src=pl']
[debug] Encodings: locale UTF-8, fs utf-8, out UTF-8, pref UTF-8
[debug] youtube-dl version 2021.04.26
[debug] Python version 2.7.16 (CPython) - Darwin-20.4.0-x86_64-i386-64bit
[debug] exe versions: none
[debug] Proxy map: {}
[generic] dXMvNTAyMjA5MTMvNjg1NjIyNTYuc2h0bWw=: Requesting header
WARNING: Falling back on generic information extractor.
[generic] dXMvNTAyMjA5MTMvNjg1NjIyNTYuc2h0bWw=: Downloading webpage
[generic] dXMvNTAyMjA5MTMvNjg1NjIyNTYuc2h0bWw=: Extracting information
ERROR: Unsupported URL: https://tv.sohu.com/v/dXMvNTAyMjA5MTMvNjg1NjIyNTYuc2h0bWw=.html?src=pl
Traceback (most recent call last):
  File "/usr/local/bin/youtube-dl/youtube_dl/extractor/generic.py", line 2497, in _real_extract
    doc = compat_etree_fromstring(webpage.encode('utf-8'))
  File "/usr/local/bin/youtube-dl/youtube_dl/compat.py", line 2571, in compat_etree_fromstring
    doc = _XML(text, parser=etree.XMLParser(target=_TreeBuilder(element_factory=_element_factory)))
  File "/usr/local/bin/youtube-dl/youtube_dl/compat.py", line 2560, in _XML
    parser.feed(text)
  File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1659, in feed
    self._raiseerror(v)
  File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/etree/ElementTree.py", line 1523, in _raiseerror
    raise err
ParseError: not well-formed (invalid token): line 45, column 136
Traceback (most recent call last):
  File "/usr/local/bin/youtube-dl/youtube_dl/YoutubeDL.py", line 806, in wrapper
    return func(self, *args, **kwargs)
  File "/usr/local/bin/youtube-dl/youtube_dl/YoutubeDL.py", line 827, in __extract_info
    ie_result = ie.extract(url)
  File "/usr/local/bin/youtube-dl/youtube_dl/extractor/common.py", line 534, in extract
    ie_result = self._real_extract(url)
  File "/usr/local/bin/youtube-dl/youtube_dl/extractor/generic.py", line 3506, in _real_extract
    raise UnsupportedError(url)
UnsupportedError: Unsupported URL: https://tv.sohu.com/v/dXMvNTAyMjA5MTMvNjg1NjIyNTYuc2h0bWw=.html?src=pl

Description

Even though Sohu.py is present, for some reason ytdl fails to extract data with ERROR: Unsupported URL.
Or I'm I confusing sohu.py with some other website?

Test link:
https://tv.sohu.com/v/dXMvNTAyMjA5MTMvNjg1NjIyNTYuc2h0bWw=.html

@someziggyman someziggyman changed the title [Sohu] soho.py exists, but getting Unsupported URL error [Sohu] sohu.py exists, but getting Unsupported URL error May 2, 2021
foodcourt2021 added a commit to foodcourt2021/youtube-dl that referenced this issue May 19, 2021
foodcourt2021 added a commit to foodcourt2021/youtube-dl that referenced this issue May 20, 2021
foodcourt2021 added a commit to foodcourt2021/youtube-dl that referenced this issue May 20, 2021
gaming-hacker added a commit to gaming-hacker/youtube-dl that referenced this issue Sep 11, 2021
* commit '69a40d3eb06316ac944aea2a0d28d2ce15d447cd':
  [sohu] fix extractor conflict
  [sohu] Add extractor for playlist
  Added a multipart video testcase for sohu.py
  [sohu] Fix extraction (closes ytdl-org#18542, closes ytdl-org#28944)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant