Health check of an instance #18922

MorrisJobke · 2015-09-09T09:14:37Z

@butonic and I had thought about this many times: It would be nice to have a health check for an instance. This could check for stuff like:

weird entries in the DB
duplicate entries
zombie entries - with no reference to others
not connected subtrees in the filecache
duplicate share entries - see remove duplicate shares before mounting them #18835 for code to detect this
...

This is maybe something for a cleanup step or just for info to start investigations.

I started this ticket to collect possible candidates of weird entries and to be able to check other existing instances for the same symptoms.

cc @karlitschek @PVince81 @nickvergessen @schiesbn @icewind1991 @rullzer @Xenopathic Opinions on this?

oparoz · 2015-09-09T10:52:32Z

wrong media types
broken thumbnails
files which think they're folders

schiessle · 2015-09-09T10:59:46Z

Great idea. Some more ideas:

encrypted files where "encrypted" is set to '0' at the file cache
unencrypted files where "encrypted" is set to '1' at the file cache

butonic · 2015-09-09T11:09:23Z

Some of the checks do take a while. For the implementation I would recommend a section in the admin settings that has buttons to trigger individual checks as well as a check all button.
It should be possible to get the SQL that is executed to check for problems as well as SQL that might have been generated to clean up inconsistencies before executing it. Thet won;t be possible for all checks but anyway.

Also checks for:

files for not existing storages
share entries for not existing files
home storages starting with local::
incorrectly formatted etags (should not start or end with double quotes)

karlitschek · 2015-09-09T12:02:37Z

Very good idea. Should this be part of the repair script? I guess some of the issues are repairable and others not?

MorrisJobke · 2015-09-09T13:51:12Z

Very good idea. Should this be part of the repair script? I guess some of the issues are repairable and others not?

This is more for non-repairable stuff. If it is possible to repair we can do this instead of showing it, but this should primarily help to detect problems that not yet occured, but could be bad.

karlitschek · 2015-09-09T13:57:55Z

true. but maybe it should be added to the same occ command. a user does't know if something can be repaired or not.

tflidd · 2015-09-09T14:35:58Z

remove old tables (i.e. from old apps that are not used any more)

MorrisJobke · 2015-09-09T15:31:48Z

remove old tables (i.e. from old apps that are not used any more)

There is already a repair step for this. Any specific tables, then we can add them.

nickvergessen · 2015-09-10T06:49:34Z

@butonic

share entries for not existing files

This is done from 8.1 or 8.2 in onwards with a cron job

rullzer · 2015-09-11T08:36:35Z

Yes this would be great. But! We must be careful here. Automatic repairs need to be very very well analyzed before they are done! And in some cases it might be best just to advice people to post an issue here if the step reports something is broken.

I would suggest to make sure this can only be run for the CLI. Timeouts are dangerous when handling potentially complex/long running tasks.

butonic · 2015-09-11T10:49:09Z

Agreed. Let us start with a health check first. Keep repair steps in occ.

MorrisJobke · 2015-09-11T12:22:07Z

detecting filecache entries that has a parent in a different storage -> we have seen this, but can't find the reason

PVince81 · 2015-09-18T09:24:31Z

detect whether all indices from db_structure.xml do exist. I have the feeling that in some situations people somehow managed to skip index creation, see upgrade 8.1.1-3 -> 8.1.3-13.1 'CREATE UNIQUE INDEX' failed #19142 (comment)

oparoz · 2015-09-18T09:31:23Z

Find a way to detect id oc_file_locks is full of rubbish by comparing the list of cached files to the actual list of files

nickvergessen · 2015-09-18T09:57:34Z

@oparoz that is more of a repair step? 😉

oparoz · 2015-09-18T10:00:23Z

@nickvergessen It's both. First you need to know that something is completely wrong.

As a user, you get all the weird messages about files being locked, so you suspect something is wrong
You notify the admin
As an admin, you test the health of you system
You see that something is wrong
You run the repair step

No?

nickvergessen · 2015-09-18T11:05:47Z

Well repair steps dont need "first you need to know", you just run them and they fix it.
The health check is for cases where we can't fix stuff automatically.

MorrisJobke · 2015-09-24T14:01:10Z

LDAP entries that are created before a change was done in the LDAP settings (i.e username attribute, homefolder naming rule, ...)

PVince81 · 2015-10-29T07:02:12Z

child share entries without parent (repair step raised here: Repair step: delete shares where the parent doesn't exist any more #20130)

MorrisJobke · 2015-10-29T07:10:28Z

child share entries without parent (repair step raised here: #20130)

Nice one :)

MorrisJobke · 2015-11-26T01:10:28Z

unique share tokens Sharing tokens should be unique #20741

MorrisJobke · 2016-01-08T09:42:00Z

duplicate tags per user Duplicated entries in the oc_vcategory table cause favourite tags to not be permanently deleted #20952

nickvergessen · 2016-01-08T09:47:42Z

duplicate tags per user #20952

That should not be a health check, it needs a fix in the db layer and a pre-update script to fix it.

MorrisJobke · 2016-01-08T09:49:03Z

That should not be a health check, it needs a fix in the db layer and a pre-update script to fix it.

Correct - but we should also find out the reason. We can drop it from here if it is not needed anymore - just collecting stuff.

cdamken · 2016-02-19T19:23:39Z

@bboule This is related to https://github.com/owncloud/enterprise/issues/832#issuecomment-143771260 Shouldn't it have the same milestone?

nickvergessen · 2016-02-22T07:55:02Z

@cdamken this is an overview ticket with things that could be implemented in little steps...

bboule · 2016-02-22T14:45:54Z

I think we need to loop @MTRichards into this as well as it seems product related

MorrisJobke · 2016-02-22T15:12:04Z

I have some alpha grade code ... needs some love: https://github.com/owncloud/serverhealth and has already first tests.

PVince81 · 2016-03-04T15:16:37Z

repair filecache parents + path_hash if inconsistent: Repair filecache parents + path_hash #22866 add a repair script for path_hash mismatching md5(path) #10705

PVince81 · 2016-10-10T14:39:28Z

detect unmigrated legacy storages, in case the warnings were ignored...

PVince81 · 2017-01-27T17:25:02Z

@pmaier1

PVince81 · 2017-06-28T19:05:13Z

repair parent-child relationships (non-matching path or storage): #28253

PVince81 · 2017-07-03T09:44:51Z

repair mime type of non-folders that do have children

@jvillafanez FYI ^

PVince81 · 2017-07-03T11:06:31Z

delete stray file cache entries that have no matching files and are not accessible through any parents. Normally entries accessible through parents are already cleared by occ files:scan --all, but inaccessible ones aren't and need a different approach.

ownclouders · 2018-01-13T01:10:50Z

Hey, this issue has been closed because the label status/STALE is set and there were no updates for 7 days. Feel free to reopen this issue if you deem it appropriate.

(This is an automated comment from GitMate.io.)

PVince81 · 2018-01-15T09:41:39Z

deal with orphaned storages (see Maintenance tool to deal with orphaned storages #23364)

ownclouders · 2018-02-23T01:14:31Z

Hey, this issue has been closed because the label status/STALE is set and there were no updates for 7 days. Feel free to reopen this issue if you deem it appropriate.

(This is an automated comment from GitMate.io.)

MorrisJobke added the discussion label Sep 9, 2015

MorrisJobke self-assigned this Oct 20, 2015

nickvergessen added the overview label Feb 22, 2016

PVince81 mentioned this issue Mar 4, 2016

Repair filecache parents + path_hash #22866

Closed

PVince81 unassigned MorrisJobke Jan 27, 2017

PVince81 added this to the backlog milestone Jan 27, 2017

PVince81 mentioned this issue Jul 7, 2017

Add repair step to repair mismatch filecache paths #28253

Merged

11 tasks

ownclouders added the status/STALE label Jan 5, 2018

ownclouders closed this as completed Jan 13, 2018

ownclouders removed the status/STALE label Jan 13, 2018

PVince81 reopened this Jan 15, 2018

PVince81 mentioned this issue Jan 15, 2018

Maintenance tool to deal with orphaned storages #23364

Closed

ownclouders added the status/STALE label Feb 15, 2018

ownclouders closed this as completed Feb 23, 2018

ownclouders removed the status/STALE label Feb 23, 2018

PVince81 added the enhancement label Feb 23, 2018

PVince81 reopened this Feb 23, 2018

ownclouders added the status/STALE label Mar 26, 2018

AlexAndBear closed this as completed Sep 21, 2021

Health check of an instance #18922

Health check of an instance #18922

Comments

MorrisJobke commented Sep 9, 2015 • edited by DeepDiver1975

oparoz commented Sep 9, 2015

schiessle commented Sep 9, 2015

butonic commented Sep 9, 2015

karlitschek commented Sep 9, 2015

MorrisJobke commented Sep 9, 2015

karlitschek commented Sep 9, 2015

tflidd commented Sep 9, 2015

MorrisJobke commented Sep 9, 2015

nickvergessen commented Sep 10, 2015

rullzer commented Sep 11, 2015

butonic commented Sep 11, 2015

MorrisJobke commented Sep 11, 2015

PVince81 commented Sep 18, 2015

oparoz commented Sep 18, 2015

nickvergessen commented Sep 18, 2015

oparoz commented Sep 18, 2015

nickvergessen commented Sep 18, 2015

MorrisJobke commented Sep 24, 2015

PVince81 commented Oct 29, 2015

MorrisJobke commented Oct 29, 2015

MorrisJobke commented Nov 26, 2015

MorrisJobke commented Jan 8, 2016

nickvergessen commented Jan 8, 2016

MorrisJobke commented Jan 8, 2016

cdamken commented Feb 19, 2016

nickvergessen commented Feb 22, 2016

bboule commented Feb 22, 2016

MorrisJobke commented Feb 22, 2016

PVince81 commented Mar 4, 2016 • edited

PVince81 commented Oct 10, 2016

PVince81 commented Jan 27, 2017

PVince81 commented Jun 28, 2017

PVince81 commented Jul 3, 2017

PVince81 commented Jul 3, 2017

ownclouders commented Jan 13, 2018

PVince81 commented Jan 15, 2018 • edited

ownclouders commented Feb 23, 2018

MorrisJobke commented Sep 9, 2015 •

edited by DeepDiver1975

PVince81 commented Mar 4, 2016 •

edited

PVince81 commented Jan 15, 2018 •

edited