Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
Don't require a full archive mirror #5
Comments
|
Hi!
To mitigate the last issue, one could move the "select interesting packages" step to the archive software, and have it return a list of |
iainlane
commented
Dec 2, 2015
|
It seems annoying to have to maintain an extra place (dak, Launchpad) to tell us which packages are interesting. It would be possible to remember which versions you've looked at, so that you don't miss any new packages, but the problem that Contents can lag behind the archive remains. |
|
A potential solution could be mirroring packages based on previous processing of the Contents file, and make a missing-deb-package not an error in case some special |
cjwatson
commented
Dec 9, 2015
|
We could also perhaps rescan anything newer than Contents - fuzz-factor when Contents information becomes available. An alternative option comes to mind. One way or another, it sounds like we'll end up fetching a large fraction of the archive, but we only use any given file once because the DEP-11 generator remembers which packages it's already examined. Would it make sense to have an option to stream packages on demand rather than generating them? That wouldn't change the network requirements, and the first run would of course take a very long time, but it would radically reduce the disk storage requirements which might simplify things in the sort of environment we'll be deploying to. |
cjwatson
commented
Dec 9, 2015
|
Regarding putting the interesting test in the archive software, while at some level this makes sense it also introduces quite a lot of coupling that I think would get annoying as time goes on, so I think I agree with @iainlane there. The current design has the virtue of having practically no coupling with the archive software at all; in fact as far as I can tell it requires no changes to Launchpad itself, only to some Ubuntu-specific publishing scripts. |
iainlane commentedDec 1, 2015
We just talked about deploying appstream-dep11 into production for Ubuntu. In our environment we don't have access to a mountable full mirror of the archive, so we will need to create one locally. That is quite a lot of space if you multiply by all arches and (future) releases.
Is it possible to instead rely on the Contents files to tell us which packages to be interested in, and only download those packages - so that we don't need a full mirror?
Would be happy to work on this if it's going to be feasible.