[dnb] Add new extractor #18725

user706 · 2019-01-02T22:19:25Z

Please follow the guide below

You will be asked some questions, please read them carefully and answer honestly
Put an x into all the boxes [ ] relevant to your pull request (like that [x])
Use Preview tab to see how your pull request will actually look like

Before submitting a pull request make sure you have:

At least skimmed through adding new extractor tutorial and youtube-dl coding conventions sections
Searched the bugtracker for similar pull requests
Checked the code with flake8

In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under Unlicense. Check one of the following options:

I am the original author of this code and I am willing to release it under Unlicense
I am not the original author of this code but it is in public domain or released under Unlicense (provide reliable evidence)

What is the purpose of your pull request?

Bug fix
Improvement
New extractor
New feature

Description of your pull request and other information

New extractor for e.g. https://portal.dnb.de/audioplayer/do/show/1077188552

user706 · 2019-01-02T22:33:22Z

strange...
Download only works correctly the 2nd time.

Example:

$ python3 -m youtube_dl https://portal.dnb.de/audioplayer/do/show/107679887X
[DNB] 107679887X: Downloading webpage
[download] Downloading playlist: None
[DNB] playlist None: Collected 1 video ids (downloading 1 of them)
[download] Downloading video 1 of 1
ERROR: unable to download video data: HTTP Error 404: Not Found

$ python3 -m youtube_dl https://portal.dnb.de/audioplayer/do/show/107679887X
[DNB] 107679887X: Downloading webpage
[download] Downloading playlist: None
[DNB] playlist None: Collected 1 video ids (downloading 1 of them)
[download] Downloading video 1 of 1
[download] Destination: Gavotte und Bourée aus der 'Französischen Suite Nr. 5' [Elektronische Ressource] _ Johann Sebastian Bach-107679887X.mp3
[download] 100% of 4.48MiB in 00:01
[download] Finished downloading playlist: None

Here are some urls to test with:
https://portal.dnb.de/audioplayer/do/show/1077188552
https://portal.dnb.de/audioplayer/do/show/1077187920
https://portal.dnb.de/audioplayer/do/show/107745936X
https://portal.dnb.de/audioplayer/do/show/1077188145
https://portal.dnb.de/audioplayer/do/show/1076798888
https://portal.dnb.de/audioplayer/do/show/107679887X
https://portal.dnb.de/audioplayer/do/show/1076798861

dstftw · 2019-01-03T11:16:55Z

youtube_dl/extractor/dnb.py

+
+
+class DNBIE(InfoExtractor):
+    _VALID_URL = r'https?://(?:portal\.dnb\.de/audioplayer/do/show/|d-nb\.info/)(?P<id>\w+)[/&]?'


[/&]? does not make any sense at the end.

dstftw · 2019-01-03T11:17:20Z

youtube_dl/extractor/dnb.py

+    }]
+
+    @staticmethod
+    def update_and_return_dic(info_dict, update_info):


dstftw · 2019-01-03T11:17:42Z

youtube_dl/extractor/dnb.py

+                'id': obj.get('idn'),
+                'title': obj.get('title'),
+                'author': obj.get('author'),
+                'url': 'https://portal.dnb.de/' + obj.get('media_url'),


dstftw · 2019-01-03T11:17:55Z

youtube_dl/extractor/dnb.py

+        for obj in objs:
+            thumbnail = obj.get('cover_url')
+            if thumbnail:
+                thumbnail = 'https://portal.dnb.de/' + thumbnail


dstftw · 2019-01-03T11:19:18Z

youtube_dl/extractor/dnb.py

+
+            info_dict = {
+                'id': obj.get('idn'),
+                'title': obj.get('title'),


Mandatory. Read coding conventions.

dstftw · 2019-01-03T11:20:07Z

youtube_dl/extractor/dnb.py

+        webpage = self._download_webpage(url, video_id)
+
+        m = re.search(r'fdnbpl.media\s*=\s*(\[.*\]);', webpage)
+        objs = json.loads(m.group(1))


_search_regex, _parse_json. Again: read coding conventions.

user706 added 3 commits January 2, 2019 23:17

[dnb] Add new extractor

b491341

[dnb] fix id (can also have letters such as "X")

db2672e

[dnb] remove debug print

2444cad

dstftw requested changes Jan 3, 2019

View reviewed changes

dstftw added the pending-fixes label Jan 3, 2019

dstftw force-pushed the master branch from d99bab0 to e118a87 Compare January 23, 2019 18:41

dstftw force-pushed the master branch 2 times, most recently from 7b956a1 to 5e26784 Compare September 13, 2020 13:51

cypheron mentioned this pull request Feb 3, 2021

Evaluation / overview of new proposed extractors / sites #28054

Open

dirkf force-pushed the master branch from 01bf89e to 4c6fba3 Compare August 26, 2022 07:51

dirkf closed this Aug 1, 2023

dirkf added the defunct PR source branch is not accessible label Oct 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[dnb] Add new extractor #18725

[dnb] Add new extractor #18725

user706 commented Jan 2, 2019

user706 commented Jan 2, 2019 •

edited

Loading

dstftw Jan 3, 2019

dstftw Jan 3, 2019

dstftw Jan 3, 2019

dstftw Jan 3, 2019

dstftw Jan 3, 2019

dstftw Jan 3, 2019



		class DNBIE(InfoExtractor):
		_VALID_URL = r'https?://(?:portal\.dnb\.de/audioplayer/do/show/\|d-nb\.info/)(?P<id>\w+)[/&]?'

[dnb] Add new extractor #18725

[dnb] Add new extractor #18725

Conversation

user706 commented Jan 2, 2019

Please follow the guide below

Before submitting a pull request make sure you have:

In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under Unlicense. Check one of the following options:

What is the purpose of your pull request?

Description of your pull request and other information

user706 commented Jan 2, 2019 • edited Loading

dstftw Jan 3, 2019

Choose a reason for hiding this comment

dstftw Jan 3, 2019

Choose a reason for hiding this comment

dstftw Jan 3, 2019

Choose a reason for hiding this comment

dstftw Jan 3, 2019

Choose a reason for hiding this comment

dstftw Jan 3, 2019

Choose a reason for hiding this comment

dstftw Jan 3, 2019

Choose a reason for hiding this comment

user706 commented Jan 2, 2019 •

edited

Loading