[ArchiveOrg] Video with (encoded) question mark in file name cannot be downloaded with 404 error #9173
Closed
11 tasks done
Labels
patch-available
There is patch available that should fix this issue. Someone needs to make a PR with it
site-bug
Issue with a specific website
DO NOT REMOVE OR SKIP THE ISSUE TEMPLATE
Checklist
Region
Europe/Czechia
Provide a description that is worded well enough to be understood
As an example the Mothy Python's "Whither Canada?" cannot be downloaded (for free without an account) from Internet Archive (http://archive.org) when using the page URL ("details" in URL) but can be downloaded with direct link ("download" in URL).
Problematic URLs are:
All above URLs reports "
ERROR: unable to download video data: HTTP Error 404: Not Found
"All above URLs tested both quoted and unquoted and from Command shell and PowerShell (Windows)
Trying to escape the question mark ends with message that no video is found: "
[archive.org] Playlist Monty Python's Flying Circus: The Complete Series 1 to 4 [BLU-RAY] [SD]: Downloading 0 items
"The problem is probably in the last line
[debug] Invoking http downloader on "https://archive.org/download/mpfc-remastered_20210305_1553/01. Whither Canada?.mkv"
which indicates that the question mark in the URL was decoded from%3F
to?
(or vice-versa was not correctly encoded when read from the downloaded page) which then converts the extension.mkv
into QUERY parameter and server cannot find the file without the extension.The file can be correctly downloaded via yt-dlp when manually searched for the file on server (Download options - Matroska - "01. Whither Canada?.mkv"):
All other parts can be downloaded from archive.org without a problem, e.g.
Provide verbose output that clearly demonstrates the problem
yt-dlp -vU <your command line>
)'verbose': True
toYoutubeDL
params instead[debug] Command-line config
) and insert it belowComplete Verbose Output
The text was updated successfully, but these errors were encountered: