Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with URLs containing apostrophes converted to %27 in HTML #17374

Closed
digitalhybrid opened this issue Aug 29, 2018 · 2 comments
Closed

Issue with URLs containing apostrophes converted to %27 in HTML #17374

digitalhybrid opened this issue Aug 29, 2018 · 2 comments

Comments

@digitalhybrid
Copy link

@digitalhybrid digitalhybrid commented Aug 29, 2018

Please follow the guide below

  • You will be asked some questions and requested to provide some information, please read them carefully and answer honestly
  • Put an x into all the boxes [ ] relevant to your issue (like this: [x])
  • Use the Preview tab to see what your issue will actually look like

Make sure you are using the latest version: run youtube-dl --version and ensure your version is 2018.08.28. If it's not, read this FAQ entry and update. Issues with outdated version will be rejected.

  • I've verified and I assure that I'm running youtube-dl 2018.08.28

Before submitting an issue make sure you have:

  • At least skimmed through the README, most notably the FAQ and BUGS sections
  • Searched the bugtracker for similar issues including closed ones
  • Checked that provided video/audio/playlist URLs (if any) are alive and playable in a browser

What is the purpose of your issue?

  • Bug report (encountered problems with youtube-dl)
  • Site support request (request for adding support for a new site)
  • Feature request (request for a new functionality)
  • Question
  • Other

The following sections concretize particular purposed issues, you can erase any section (the contents between triple ---) not applicable to your issue


If the purpose of this issue is a bug report, site support request or you are not completely sure provide the full verbose output as follows:

Add the -v flag to your command line you run youtube-dl with (youtube-dl -v <your command line>), copy the whole output and insert it here. It should look similar to one below (replace it with your log inserted between triple ```):

[debug] System config: []
[debug] User config: []
[debug] Command-line args: [u'-v', u'http://www.youtube.com/watch?v=BaW_jenozKcj']
[debug] Encodings: locale cp1251, fs mbcs, out cp866, pref cp1251
[debug] youtube-dl version 2018.08.28
[debug] Python version 2.7.11 - Windows-2003Server-5.2.3790-SP2
[debug] exe versions: ffmpeg N-75573-g1d0487f, ffprobe N-75573-g1d0487f, rtmpdump 2.4
[debug] Proxy map: {}
...
<end of log>

If the purpose of this issue is a site support request please provide all kinds of example URLs support for which should be included (replace following example URLs by yours):

Note that youtube-dl does not support sites dedicated to copyright infringement. In order for site support request to be accepted all provided example URLs should not violate any copyrights.


Description of your issue, suggested solution and other information

Explanation of your issue in arbitrary form goes here. Please make sure the description is worded well enough to be understood. Provide as much context and examples as possible.
If work on your issue requires account credentials please provide them or explain how one can obtain them.

Hello guys,

Long time user, but I'm having an issue when trying to download files containing an apostrophe in the hyperlink. Here are two examples that refuse to download ...

https://www.nbc.com/up-all-night/video/mr.-bob%27s-toddler-kaleidoscope/n2179
https://www.nbc.com/up-all-night/video/day-after-valentine%27s-day/n2189

The log file says the following for each of them ...

[08/29/18 19:44:50] WARNING: Failed to download m3u8 information: HTTP Error 403: Forbidden
[08/29/18 19:46:13] ERROR: list index out of range
[08/29/18 19:46:13] ERROR: list index out of range

All other URLs from these series work without issue. It's only these two that are a problem, and it seems the common denominator is the HTML apostrophe (%27) that's causing the problem.

Any chance of a fix, please?

@digitalhybrid
Copy link
Author

@digitalhybrid digitalhybrid commented Aug 29, 2018

Actually, a fix may not be absolutely necessary, but it would certainly make life easier.

I decided to try one other thing, and that was to change %27 in the hyperlink to be an actual apostrophe (') and youtube-dl finally decided to download the files.

@kartik-karz
Copy link

@kartik-karz kartik-karz commented Aug 29, 2018

@dstftw The problem with this is regarding the utf8 conversion for the url and I'm not sure if using this would have any unintended side effects, I would like to work on this for my first request here to just get to know the code and would like to know your opinion on this one
from urllib.parse import unquote as uq
url = uq(url,'utf-8')

@dstftw dstftw closed this in d0c5fab Sep 8, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
2 participants
You can’t perform that action at this time.