[scientology] Add new extractor #16184

sguan28 · 2018-04-14T07:18:51Z

Please follow the guide below

You will be asked some questions, please read them carefully and answer honestly
Put an x into all the boxes [ ] relevant to your pull request (like that [x])
Use Preview tab to see how your pull request will actually look like

Before submitting a pull request make sure you have:

At least skimmed through adding new extractor tutorial and youtube-dl coding conventions sections
Searched the bugtracker for similar pull requests
Checked the code with flake8

In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under Unlicense. Check one of the following options:

I am the original author of this code and I am willing to release it under Unlicense
I am not the original author of this code but it is in public domain or released under Unlicense (provide reliable evidence)

What is the purpose of your pull request?

Bug fix
Improvement
New extractor
New feature

Description of your pull request and other information

Added a new extractor for scientology.tv for this issue

dstftw · 2018-04-14T09:36:25Z

youtube_dl/extractor/scientology.py

+        webpage = self._download_webpage(url, video_id)
+
+        title = self._html_search_regex(r'<title>(.+?)</title>', webpage, 'title').strip()
+        description = self._html_search_regex(r'<meta name="description" content="(.+?)" />', webpage, 'description').strip()


_html_search_meta.

dstftw · 2018-04-14T09:37:01Z

youtube_dl/extractor/scientology.py

+        webpage = self._download_webpage(url, video_id)
+
+        title = self._html_search_regex(r'<title>(.+?)</title>', webpage, 'title').strip()
+        description = self._html_search_regex(r'<meta name="description" content="(.+?)" />', webpage, 'description').strip()


Bother to read coding conventions on optional meta fields.

dstftw · 2018-04-14T09:37:03Z

youtube_dl/extractor/scientology.py

+
+        title = self._html_search_regex(r'<title>(.+?)</title>', webpage, 'title').strip()
+        description = self._html_search_regex(r'<meta name="description" content="(.+?)" />', webpage, 'description').strip()
+        description = re.sub("[^a-zA-Z0-9.,_\s-]+", " ", description)


dstftw · 2018-04-14T09:37:34Z

youtube_dl/extractor/scientology.py

+
+        # changing address for extration url
+        extract_ext = re.search(r'<episode-video>(.*?)</episode-video>', webpage).group(0)
+        extract_ext = extract_ext.replace('<episode-video>', '').replace('</episode-video>', '')


_search_regex.

dstftw · 2018-04-14T09:37:42Z

youtube_dl/extractor/scientology.py

+        # changing address for extration url
+        extract_ext = re.search(r'<episode-video>(.*?)</episode-video>', webpage).group(0)
+        extract_ext = extract_ext.replace('<episode-video>', '').replace('</episode-video>', '')
+        url = url[:url.find('/', 10)] + extract_ext


scientology.py Add new extractor

b0434ad

sguan28 changed the title ~~scientology.py Add new extractor~~ [scientology] Add new extractor Apr 14, 2018

dstftw requested changes Apr 14, 2018

View reviewed changes

dstftw added the pending-fixes label Apr 14, 2018

dstftw force-pushed the master branch from d99bab0 to e118a87 Compare January 23, 2019 18:41

dstftw force-pushed the master branch 2 times, most recently from 7b956a1 to 5e26784 Compare September 13, 2020 13:51

cypheron mentioned this pull request Feb 3, 2021

Evaluation / overview of new proposed extractors / sites #28054

Open

dirkf force-pushed the master branch from 01bf89e to 4c6fba3 Compare August 26, 2022 07:51

dirkf closed this Aug 1, 2023

dirkf added the defunct PR source branch is not accessible label Oct 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[scientology] Add new extractor #16184

[scientology] Add new extractor #16184

sguan28 commented Apr 14, 2018

dstftw Apr 14, 2018

dstftw Apr 14, 2018

dstftw Apr 14, 2018

dstftw Apr 14, 2018

dstftw Apr 14, 2018

[scientology] Add new extractor #16184

[scientology] Add new extractor #16184

Conversation

sguan28 commented Apr 14, 2018

Please follow the guide below

Before submitting a pull request make sure you have:

In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under Unlicense. Check one of the following options:

What is the purpose of your pull request?

Description of your pull request and other information

dstftw Apr 14, 2018

Choose a reason for hiding this comment

dstftw Apr 14, 2018

Choose a reason for hiding this comment

dstftw Apr 14, 2018

Choose a reason for hiding this comment

dstftw Apr 14, 2018

Choose a reason for hiding this comment

dstftw Apr 14, 2018

Choose a reason for hiding this comment