[c13cl] Adding support for www.13.cl and rudo.video #8664

nicodato · 2023-11-26T23:52:46Z

IMPORTANT: PRs without the template will be CLOSED

Description of your pull request and other information

Adds extractor for www.13.cl and rudo.video.
13.cl is a TV channel from Chile, and it uses rudo.video.

With this PR, yt-dlp now supports (georestricted to Chile):
https://www.13.cl/en-vivo
https://www.13.cl/en-vivo-2
https://rudo.video/live/c13
https://rudo.video/live/t13-13cl
https://rudo.video/live/bbtv

Template

Before submitting a pull request make sure you have:

At least skimmed through contributing guidelines including yt-dlp coding conventions
Searched the bugtracker for similar pull requests
Checked the code with flake8 and ran relevant tests

In order to be accepted and merged into yt-dlp each piece of code must be in public domain or released under Unlicense. Check all of the following options that apply:

I am the original author of this code and I am willing to release it under Unlicense
I am not the original author of this code but it is in public domain or released under Unlicense (provide reliable evidence)

What is the purpose of your pull request?

Fix or improvement to an extractor (Make sure to add/update tests)
New extractor (Piracy websites will not be accepted)
Core bug fix/improvement
New feature (It is strongly recommended to open an issue first)

seproDev

Please also revert the changes to supportedsites.md. The file will get updated with the next release.

yt_dlp/extractor/rudovideo.py

yt_dlp/extractor/c13cl.py

yt_dlp/extractor/rudovideo.py

* implementing PR suggestions * removing c13cl extractor in favor of RudoVideo with _EMBED_REGEX * supporting VODs and Podcasts from rudo.video * supporting embeded youtube * adding tests

nicodato · 2023-12-02T21:12:26Z

Hello @seproDev , I have just implemented your suggestions.
I added support for VODs and podcast. Some interesting data:
Both https://rudo.video/podcast/cz2wrUy8l0o and https://rudo.video/vod/cz2wrUy8l0o gives you the same site (a podcast). Same thing happens with a real podcast. It looks like doesn't work with /live/ URLs.
And I found this content https://rudo.video/vod/czfvKUuULV8 with in reality is an embedded youtube

yt_dlp/extractor/rudovideo.py

nicodato · 2023-12-09T14:45:05Z

No problem :) Thanks for your advice @seproDev
I have just split the class into two clasess, RudoVideoIE and RudoVideoLiveIE. Also, I used those _og_search_* methods.

nicodato · 2023-12-10T23:25:59Z

@seproDev I have used @pukkandan 's commit and modified just a little. The line to obtain the m3u8_url must first check for the streamURL variable, and only if it fails, search for the source tag.
This is because with https://rudo.video/live/bbtv (this is not geo-restrictred) the source tag has a generic m3u8 that doesn't work. That site also has the streamURL variable and that's the url that works. So using one regex to search for either streamURL or source tag failed. It was returning the generic m3u8 from the source instead of the streamURL variable.
IIRC podcasts have the m3u8 in the soruce tag.

Everything else should be just like pukkandan's suggestion

yt_dlp/extractor/rudovideo.py

bashonly · 2023-12-12T00:20:46Z

yt_dlp/extractor/rudovideo.py

+            m3u8_url = update_url_query(m3u8_url, {
+                'auth-token': traverse_obj(access_token, ('data', 'authToken'))
+            })


should this be fatal?

Suggested change

m3u8_url = update_url_query(m3u8_url, {

'auth-token': traverse_obj(access_token, ('data', 'authToken'))

})

m3u8_url = update_url_query(m3u8_url, {

'auth-token': access_token['data']['authToken'],

})

I would do this instead

if token_array: token_url = traverse_obj(token_array, (..., {url_or_none}), get_all=False) if not token_url: raise ExtractorError('Invalid access token array') access_token = self._download_json( token_url, video_id, note='Downloading access token')['data']['authToken'] m3u8_url = update_url_query(m3u8_url, {'auth-token': access_token})

Edit:

My first PR instead of using authToken and auth-token, it used the token_array elements.

are you saying the request to download token is unnecessary?

thanks @bashonly , I have just pushed your suggestion
(somehow I removed my previous comment by mistake)

are you saying the request to download token is unnecessary?

No. We need to download the token.
I meant that the array has something like this:

["https://example.com/tokenapi", ..., "authToken", "auth-token", ...]

So previously, I was downloading the token, and then using the access_token and the token_array elements to construct the query string this:

yt-dlp/yt_dlp/extractor/rudovideo.py

Lines 22 to 26 in 00edf40

access_token_webpage = self._download_webpage(token_array[0], video_id)

access_token = self._parse_json(access_token_webpage, video_id)

if "data" not in access_token or token_array[3] not in access_token.get("data"):

raise ExtractorError('Couldnt get access token', video_id=video_id)

query_string = token_array[5] + traverse_obj(access_token, ("data", token_array[3]))

ah alright. LGTM then

yt_dlp/extractor/rudovideo.py

…PI response doesn't contain the data.authToken value

seproDev · 2023-12-17T15:40:03Z

yt_dlp/extractor/rudovideo.py

+            access_token = self._download_json(
+                token_url, video_id, note='Downloading access token')['data']['authToken']
+            m3u8_url = update_url_query(m3u8_url, {'auth-token': access_token})
+


Podcasts such as https://rudo.video/podcast/b42ZUznHX0 are sometimes served as direct mp3 files, which currently break the extractor. A simple solution would be to check by extension:

Suggested change

if determine_ext(media_url) == 'm3u8':

formats = self._extract_m3u8_formats(media_url, video_id, live=is_live)

else:

formats = [{'url': media_url}]

I'd also rename the variable to media_url.

if the extension is mp3, we may also want to add 'vcodec': 'none' to the format dict

yt_dlp/extractor/rudovideo.py

nicodato · 2023-12-21T18:42:38Z

@seproDev I pushed your suggestions, adding support for the MP3 podcast, including a test

Authored by: nicodato

nicolasdato added 3 commits November 26, 2023 19:50

[rudovideo] Add rudo.video extractor

00edf40

[13.cl] Add https://www.13.cl extractor

10b2873

fix linter error

b2d9837

seproDev added the site-request Request to support a new website label Nov 27, 2023

garret1317 added the geo-blocked Content is geo-blocked label Nov 27, 2023

seproDev requested changes Nov 30, 2023

View reviewed changes

seproDev added the pending-fixes PR has had changes requested label Nov 30, 2023

nicodato added 3 commits November 30, 2023 22:47

revert supportedsites.md

9df466e

[rudovideo] improvements to rudovideo extractor:

cfce954

* implementing PR suggestions * removing c13cl extractor in favor of RudoVideo with _EMBED_REGEX * supporting VODs and Podcasts from rudo.video * supporting embeded youtube * adding tests

Merge branch 'master' into c13cl

0560d30

seproDev requested changes Dec 5, 2023

View reviewed changes

yt_dlp/extractor/rudovideo.py Outdated Show resolved Hide resolved

yt_dlp/extractor/rudovideo.py Outdated Show resolved Hide resolved

bashonly self-requested a review December 6, 2023 18:42

nicodato added 2 commits December 9, 2023 11:40

[rudovideo] split it into RudoVideoIE and RudoVideoLiveIE

cf87ca6

Merge remote-tracking branch 'origin/master' into c13cl

6314563

[rudovideo] join RudoVideoLiveIE and RudoVideoIE in one extractor

6c5983a

pukkandan reviewed Dec 11, 2023

View reviewed changes

yt_dlp/extractor/rudovideo.py Outdated Show resolved Hide resolved

Update yt_dlp/extractor/rudovideo.py

d0a6427

pukkandan approved these changes Dec 11, 2023

View reviewed changes

pukkandan assigned seproDev Dec 11, 2023

pukkandan removed the pending-fixes PR has had changes requested label Dec 11, 2023

bashonly approved these changes Dec 12, 2023

View reviewed changes

nicodato commented Dec 12, 2023

View reviewed changes

yt_dlp/extractor/rudovideo.py Outdated Show resolved Hide resolved

nicodato added 2 commits December 12, 2023 11:47

[rudovideo] improving the access_token. now it's fatal if the token A…

66582ba

…PI response doesn't contain the data.authToken value

fix flake8

b01da23

seproDev requested changes Dec 17, 2023

View reviewed changes

nicodato added 2 commits December 21, 2023 15:30

Merge branch 'master' into c13cl

878c6bc

[rudovideo] adding support for MP3 podcasts

9c8175c

[rudovideo] forgot to rename a variable

6511c09

seproDev approved these changes Dec 21, 2023

View reviewed changes

Add vcodec none to mp3 files

52cd9fd

seproDev merged commit 0d531c3 into yt-dlp:master Dec 22, 2023
15 checks passed

nicodato deleted the c13cl branch December 26, 2023 17:37

aalsuwaidi pushed a commit to aalsuwaidi/yt-dlp that referenced this pull request Apr 21, 2024

[ie/RudoVideo] Add extractor (yt-dlp#8664)

ec3a51c

Authored by: nicodato

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[c13cl] Adding support for www.13.cl and rudo.video #8664

[c13cl] Adding support for www.13.cl and rudo.video #8664

nicodato commented Nov 26, 2023

seproDev left a comment

nicodato commented Dec 2, 2023 •

edited

nicodato commented Dec 9, 2023

nicodato commented Dec 10, 2023

bashonly Dec 12, 2023

bashonly Dec 12, 2023 •

edited

nicodato Dec 12, 2023

nicodato Dec 12, 2023 •

edited

bashonly Dec 12, 2023

seproDev Dec 17, 2023 •

edited

bashonly Dec 21, 2023

nicodato commented Dec 21, 2023

	access_token_webpage = self._download_webpage(token_array[0], video_id)
	access_token = self._parse_json(access_token_webpage, video_id)
	if "data" not in access_token or token_array[3] not in access_token.get("data"):
	raise ExtractorError('Couldnt get access token', video_id=video_id)
	query_string = token_array[5] + traverse_obj(access_token, ("data", token_array[3]))

+        if determine_ext(media_url) == 'm3u8':
+            formats = self._extract_m3u8_formats(media_url, video_id, live=is_live)
+        else:
+            formats = [{'url': media_url}]

[c13cl] Adding support for www.13.cl and rudo.video #8664

[c13cl] Adding support for www.13.cl and rudo.video #8664

Conversation

nicodato commented Nov 26, 2023

Description of your pull request and other information

Before submitting a pull request make sure you have:

In order to be accepted and merged into yt-dlp each piece of code must be in public domain or released under Unlicense. Check all of the following options that apply:

What is the purpose of your pull request?

seproDev left a comment

Choose a reason for hiding this comment

nicodato commented Dec 2, 2023 • edited

nicodato commented Dec 9, 2023

nicodato commented Dec 10, 2023

bashonly Dec 12, 2023

Choose a reason for hiding this comment

bashonly Dec 12, 2023 • edited

Choose a reason for hiding this comment

nicodato Dec 12, 2023

Choose a reason for hiding this comment

nicodato Dec 12, 2023 • edited

Choose a reason for hiding this comment

bashonly Dec 12, 2023

Choose a reason for hiding this comment

seproDev Dec 17, 2023 • edited

Choose a reason for hiding this comment

bashonly Dec 21, 2023

Choose a reason for hiding this comment

nicodato commented Dec 21, 2023

nicodato commented Dec 2, 2023 •

edited

bashonly Dec 12, 2023 •

edited

nicodato Dec 12, 2023 •

edited

seproDev Dec 17, 2023 •

edited