Automated Repository Testing #2731

fearphage · 2019-05-15T12:33:38Z

It's not uncommon for the (Debian?) repository to be broken (#2608).

Is there an automated (CI-able) way to confirm this is working so the owners don't have to find out from user-initiated bug reports?

dagood · 2019-05-15T15:42:48Z

As far as I know, this has only happened twice, and only on the Ubuntu 14.04 repository. I'm asking about this in my mail about the recent instance to the repository owners, though--other Microsoft products depend on these feeds so it shouldn't be on .NET Core to set up some sort of health check.

dagood · 2019-05-15T15:54:26Z

/cc @leecow

fearphage · 2019-05-15T22:10:13Z

As far as I know, this has only happened twice, and only on the Ubuntu 14.04 repository.

That's one time too many in my opinion. That's the purpose of (regression) tests. When you find a bug, you write a test case for it and then at a minimum, you'll know the next time it becomes broken.

dagood · 2019-05-15T22:15:11Z

Agreed, just contrasting vs. "not uncommon".

dagood · 2019-07-29T16:45:19Z

The repo owners have set up some monitoring and alerting to catch this. My impression is that the system is too complex for pre-publish testing to actually catch these things, so this is probably the best we can hope for right now. (The owners automatically knowing there's a problem rather than us having to ping them.) There are continuing conversations about how to improve the service.

fearphage · 2019-08-21T13:54:26Z

The repo owners have set up some monitoring and alerting to catch this.

It may be ineffective since the issue is back yet again. It seems like running an Ubuntu docker image with sudo apt-get update after the deploy and ensuring a successful (0) exit code would suffice.

Note: This doesn't feel fixed to me, but I'm unable to reopen the issue.

dagood · 2019-08-21T15:30:10Z

I'll be interested to hear from them whether the monitoring caught this. I don't know what response time we expect them to have when it is caught automatically.

dagood · 2019-08-21T15:30:39Z

Rolling back to last known good on error would be ideal, of course. 😕

herebebeasties · 2019-08-21T15:57:06Z

My impression is that the system is too complex for pre-publish testing to actually catch these things

If your InRelease PGP signed file is missing, or has an earlier timestamp than your Release file does, it's clearly not going to work. I have a complete lack of context around this process so may be wrong, but that seems easy enough to test/monitor. 😕

herebebeasties · 2019-09-04T07:58:09Z

Can we get this re-opened? Or another ticket made. It's an improvement if you have monitoring to catch when this happens, but the actual root cause clearly needs addressing.

dagood · 2019-09-04T15:00:55Z

Does an open issue help you in some way? (Linking to it from somewhere, etc.) I'm not opposed to having one, but there's no work we (.NET Core) can do since we rely on a shared Microsoft resource to operate the repository properly for a variety of teams. I don't have any visibility into the underlying problems.

herebebeasties · 2019-09-04T16:35:53Z

Not if its not in the right place to get fixed, obviously. It's not uncommon for "master" tickets like this one seems to have become to be held open while the thing they are dependent on is fixed, especially if this is the public-facing view of it all.

It's a great shame that there's no external visibility (or internal, you say) on something that breaks a ton of stuff across the globe whenever it happens, probably costing (tens of?) thousands of man hours to people using the Microsoft stack each time. Especially when it's recurring and clearly not fixed yet. Can't you escalate this with the right team or something?

dagood · 2019-09-04T16:46:00Z

Can't you escalate this with the right team or something?

@leecow has plans for this, I'll let him decide the best course of action for this (or a new) issue.

leecow · 2019-09-04T17:18:24Z

We do escalate when issues like this are encountered and I am planning a meeting with them to cover this, and other areas of concern with respect to SLA and validation.

fearphage · 2019-09-12T21:51:40Z

@leecow Any updates?

leecow · 2019-09-12T22:36:07Z

Meeting set for next Thurs.

karelz added the enhancement label May 17, 2019

karelz assigned dagood May 17, 2019

dagood closed this as completed Jul 29, 2019

dagood mentioned this issue Dec 13, 2019

Can't install .NET Core on Ubuntu, Debian from the Linux package repository: "File has unexpected size (23655 != 156065)" #3988

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automated Repository Testing #2731

Automated Repository Testing #2731

fearphage commented May 15, 2019

dagood commented May 15, 2019

dagood commented May 15, 2019

fearphage commented May 15, 2019

dagood commented May 15, 2019

dagood commented Jul 29, 2019

fearphage commented Aug 21, 2019 •

edited

dagood commented Aug 21, 2019

dagood commented Aug 21, 2019

herebebeasties commented Aug 21, 2019

herebebeasties commented Sep 4, 2019

dagood commented Sep 4, 2019

herebebeasties commented Sep 4, 2019 •

edited

dagood commented Sep 4, 2019

leecow commented Sep 4, 2019

fearphage commented Sep 12, 2019

leecow commented Sep 12, 2019

Automated Repository Testing #2731

Automated Repository Testing #2731

Comments

fearphage commented May 15, 2019

dagood commented May 15, 2019

dagood commented May 15, 2019

fearphage commented May 15, 2019

dagood commented May 15, 2019

dagood commented Jul 29, 2019

fearphage commented Aug 21, 2019 • edited

dagood commented Aug 21, 2019

dagood commented Aug 21, 2019

herebebeasties commented Aug 21, 2019

herebebeasties commented Sep 4, 2019

dagood commented Sep 4, 2019

herebebeasties commented Sep 4, 2019 • edited

dagood commented Sep 4, 2019

leecow commented Sep 4, 2019

fearphage commented Sep 12, 2019

leecow commented Sep 12, 2019

fearphage commented Aug 21, 2019 •

edited

herebebeasties commented Sep 4, 2019 •

edited