Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Archive.org torrents stucks and not reach 100% #19747

Open
Jacksoft opened this issue Oct 21, 2023 · 9 comments
Open

Archive.org torrents stucks and not reach 100% #19747

Jacksoft opened this issue Oct 21, 2023 · 9 comments
Labels
Waiting info Waiting for participants to supply more info/fulfill template requirements

Comments

@Jacksoft
Copy link

Jacksoft commented Oct 21, 2023

qBittorrent & operating system versions

qBittorrent: 4.5.5 x64
Operating System: Windows 10 LTSC 2019 x64 (10.0.17763)
Qt: 5.15.10
libtorrent: 1.2.19.0

What is the problem?

Near all, but not all, Archive.org torrents stucks most of time at ~99%. Most of time are just two useless files (.xml and .sql) in the root, but in other scenarios are the data itself.
This problem was issued on two more PCs with the same torrents and an already up-to-date qbittorent client for Windows x64.

In alternative I've used Tixati and uTorrent to check if the problem was the torrent itself and not the client, but with tixati everything reach 100%.
No particular settings were touched, just the download/upload limit speeds.

archive1qbit
archive2qbit
archive3tixati

Steps to reproduce

Go to archive.org and search for any stuff (Recently I looked for PSP Media Go 11) and get it as Torrent.
Open it by double click, add it to qBittorrent and just wait.

Additional context

No response

Log(s) & preferences file(s)

(Issued reproduced on Windows 7 x64 VM with a freshly installed qBittorent just to collect new logs for privacy purposes)

qbittorrent_logs.zip

@xavier2k6
Copy link
Member

@Jacksoft Does it also happen with libtorrent 2.0.9 based qBittorrent build?

@xavier2k6 xavier2k6 added the Waiting info Waiting for participants to supply more info/fulfill template requirements label Oct 21, 2023
@stalkerok
Copy link
Contributor

stalkerok commented Oct 21, 2023

There were no problems.
magnet:?xt=urn:btih:fee51d5165497f25d8ba0b9cea0a87b639a830ab&dn=psp-media-go-11&tr=http%3a%2f%2fbt1.archive.org%3a6969%2fannounce
https://archive.org/download/psp-media-go-11/psp-media-go-11_archive.torrent

@Jacksoft
Copy link
Author

@xavier2k6 I've installed qB 4.5.5 lt20 qt5 and it worked fine. It looked faster too!
So the issue could be a sort of incompatibility with the old version of libtorrent?

@stalkerok
Copy link
Contributor

There are no issue, I downloaded and you downloaded, most likely, partially from me. You don't connect to anyone, that's the whole issue.

@LazyPajen
Copy link

This site uses all of Qbits possibilities of dl
"normal" torrent, DHT and WEB-DL
I downloaded another filer from there.
"they" change their servers often and the servers seems to be on an rotating scheme at least to me.
Had to refresh the .torrent file at least 3 times

@Vort
Copy link
Contributor

Vort commented Oct 23, 2023

I expect it may happen because of torrent file changing.
This is what such torrents have in their description:

This content hosted at the Internet Archive at [link]
Files may have changed, which prevents torrents from downloading correctly or completely; please check for an updated torrent at [link]
Note: retrieval usually requires a client that supports webseeding (GetRight style).
Note: many Internet Archive torrents contain a 'pad file' directory. This directory and the files within it may be erased once retrieval completes.
Note: the file [name] contains metadata about this torrent's contents.

Initially files are downloaded using web seeds. This is where problem may happen.
Later "real" seeds may hide problem with web seeds.

@ymgve
Copy link

ymgve commented Oct 23, 2023

I expect it may happen because of torrent file changing. This is what such torrents have in their description:

This content hosted at the Internet Archive at [link]
Files may have changed, which prevents torrents from downloading correctly or completely; please check for an updated torrent at [link]
Note: retrieval usually requires a client that supports webseeding (GetRight style).
Note: many Internet Archive torrents contain a 'pad file' directory. This directory and the files within it may be erased once retrieval completes.
Note: the file [name] contains metadata about this torrent's contents.

Initially files are downloaded using web seeds. This is where problem may happen. Later "real" seeds may hide problem with web seeds.

It's not because of torrent file changes. Re-downloading the torrent file from archive.org gives the exact same file with the same infohash.

@coldacid
Copy link

Happening with v4.6.2 as well.

Qt: 6.4.3
Libtorrent: 2.0.9.0
Boost: 1.83.0
OpenSSL: 1.1.1w
zlib: 1.3

@coldacid
Copy link

After a little bit of investigation, it is the torrent files that are at fault, but not because they've changed. In fact the problem is because they don't change even though the web seed URLs have changed on archive.org's end (both the subdomain and the bucket number in the URL path). Internet Archive's torrent file generator seems to have a penchant for throwing in invalid URLs as well that are comprised solely of the path part of the URL, but it looks like qBittorrent (or more likely libtorrent) works around that bit of brain-damage these days. However, it doesn't seem to handle HTTP redirects, and just keeps trying and failing to hit the old web seed URLs.

Fortunately this can be worked around: For any Internet Archive item {itemname} with a web seed URL matching https://ia{svrnum}.{cc}.archive.org/{bucket}/items/ just go to https://ia{svrnum}.{cc}.archive.org/{bucket}/items/{itemname}/ (trailing slash is important!) in a web browser. You'll get redirected to the new web source for the item's files, which you can then use to replace the old one on the HTTP Sources panel of the torrent details in qBittorrent (remember to remove the item name from the URL first).

e.g. item MSDNARC's torrent file says that http://ia601500.us.archive.org/33/items/ is the web seed location. I went to http://ia601500.us.archive.org/33/items/MSDNARC/ and was redirected to https://ia804503.us.archive.org/35/items/MSDNARC/ instead; I then replaced that original web seed location with https://ia804503.us.archive.org/35/items/ and was able to finish downloading the item via torrent.

Related: #15193, #17884

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Waiting info Waiting for participants to supply more info/fulfill template requirements
Projects
None yet
Development

No branches or pull requests

7 participants