New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[youtube] Redefine uploader metadata #6384
[youtube] Redefine uploader metadata #6384
Conversation
…channel-map # Conflicts: # yt_dlp/extractor/youtube.py
cc @dirkf |
I compared the extractor test-cases. Where yt-dlp specifies uploader metadata here, the yt-dl test-cases in the latest git master generally match if present. Here are the non-matching ones. YoutubeIE: --- yt-dl
+++ yt-dlp
...
'url': 'OtqTfy26tG0',
...
- 'uploader': 'The Cinematic Orchestra - Topic',
+ 'uploader': 'The Cinematic Orchestra',
... I didn't change this result, which is Sergey's from 0cf09c2, though the current yt-dlp result looks more plausible. I think these remaining differences are all results that I changed in ytdl-org/youtube-dl@f7ce98a. YoutubePlaylistIE: --- yt-dl
+++ yt-dlp
...
'url': 'PLBB231211A4F62143',
...
- 'uploader_id': '@WickmanVT',
+ 'uploader_id': 'UCKSpbfbl5kRQpTdL7kMc-1Q',
...
'url': 'https://www.youtube.com/embed/videoseries?list=PL6IaIsEjSbf96XFRuNccS_RuEXwNdsoEu',
...
'id': 'PL6IaIsEjSbf96XFRuNccS_RuEXwNdsoEu',
...
- 'uploader_id': '@milan5503',
+ 'uploader_id': 'UCEI1-PVPcYXjB73Hfelbmaw',
...
'url': 'http://www.youtube.com/embed/_xDOZElKyNU?list=PLsyOSbh5bs16vubvKePAQ1x3PhKavfBIl',
...
- 'uploader_id': '@music_king',
+ 'uploader_id': 'UC21nz3_MesPLqtDqwdvnoxA',
... YoutubeYtBeIE: --- yt-dl
+++ yt-dlp
...
'url': 'https://youtu.be/yeWKywCrFtk?list=PL2qgrgXsNUG5ig9cat4ohreBjYLAPC0J5',
...
- 'uploader_id': '@backuspagemuseum',
+ 'uploader_id': 'backuspagemuseum',
... |
Co-authored-by: pukkandan <pukkandan.ytdlp@gmail.com>
Co-authored-by: pukkandan <pukkandan.ytdlp@gmail.com>
Co-authored-by: pukkandan <pukkandan.ytdlp@gmail.com>
Co-authored-by: pukkandan <pukkandan.ytdlp@gmail.com>
TODO:
|
Reviewing these tests for a new yt-dl extractor version, I find a contradiction between the IB3lcPjvWLA: Currently the yt-dl extractor doesn't test these values because it just sets The second test is commented [Separately, what are these The first value is here (similar to where yt-dlp gets it) traverse_obj(initial_data, (
'contents', 'twoColumnWatchNextResults', 'results', 'results',
'contents', ..., lambda k, _: k.endswith('SecondaryInfoRenderer'),
('owner', 'videoOwner'), 'videoOwnerRenderer', 'title'),
get_all=False) and here traverse_obj(initial_data, (
'engagementPanels', ..., 'engagementPanelSectionListRenderer',
'content', 'structuredDescriptionContentRenderer', 'items', ...,
'videoDescriptionHeaderRenderer', 'channel'),
get_all=False) Everywhere else, eg What should be the correct resolution of this I expect a similar issue applies to tab extraction of video data for playlists. The |
An issue should be raised for this. The topic suffix is correct unless YouTube has changed the channel. It's a topic channel which are quite messy to deal with. |
IMPORTANT: PRs without the template will be CLOSED
Description of your pull request and other information
uploader
metadata for videos, tabs, etc. is/was inconsistent and not well defined. With handles and the channel UCID url now being the two primary urls to a channel, this updates the metadata to reflect that.New mapping will be strictly:
Note this is slightly different than ytdl-org/youtube-dl@f7ce98a
Template
Before submitting a pull request make sure you have:
In order to be accepted and merged into yt-dlp each piece of code must be in public domain or released under Unlicense. Check all of the following options that apply:
What is the purpose of your pull request?