Separating conserve library and utility functionality #173

WolverinDEV · 2022-08-03T18:24:43Z

Aim

The aim is to separate library and the actual conserve command line utility.
Currently, progress output was vastly achieved with the conserve library therefore prevent other uses except for the conserve command line utility. The goal is provide a simple but yet powerful developer interface to interfere and monitor conserve process state.

Due to the initial aim, to provide a better way of logging this PR archives two things:

Providing a more powerful developer interface (the main goal, described above)
Implementing a logging system conserve utility (the terminal cli)

Attention:
This PR submits API breaking changes!

Providing a more powerful developer interface

Tapping into library functions as well as observe the current function progress has been achieved by so called Monitors.
These monitors can be passed as the last function argument. Depending on the function the monitor receives general progress notifications or other kind of events.

Implementing a logging system conserve utility

The logging system has been based on the tracing logging system and includes support for a logfile (fixing #104).
The actual output should not differ from the previous conserve utility output.

…tputs to stdout

WolverinDEV · 2022-08-04T07:24:39Z

@sourcefrog any thoughts on this one?
Turned out logging got into a more complex thing to do.

sourcefrog · 2022-08-04T11:50:57Z

Hi, yeah, I had not realized you were going to start right away. I can give you some guidance on the weekend.

WolverinDEV · 2022-08-04T15:31:52Z

Ahh yeah all right.
I got some spare time yesterday so I thought I would give it a shot.

After some fiddling I came up with the concept of extracting the whole logging thing from the library and move it into the binary/into the responsibility of the library users. The prevents conflicts with the use of any other logging systems along with other libraries.

My approach would be passing some kind of statistic callbacks for progress logging.
May I push some first drafts this evening (UTC+2).

sourcefrog

Without going line by line here are a few thoughts:

Please don't delete -D, I use it.
This would be easier to review as a few PRs, but I understand how you might need to write the whole thing first to understand the code or what you want to change. e.g. introducing better structured error codes could be its own PR.
Converting messages to info! is probably OK in general, but as I mentioned on the bug I'm not sure it's a good idea for e.g. file listings that might be consumed by other programs, or show_versions. Or for example json will not be useful if it's mixed with log prefixes.
The src/bin entry point should remain small and should configure things in the library. I don't have a goal to migrate lots of code into the binary, rather the reverse. But, making the library configurable by different binaries that e.g. don't want terminal output, would be good.

.gitignore

WolverinDEV · 2022-08-04T19:46:41Z

This would be easier to review as a few PRs, but I understand how you might need to write the whole thing first to understand the code or what you want to change. e.g. introducing better structured error codes could be its own PR.

Yes, perspective changed a bit.
Initially I thought, piping output trough a logging system including several targets (terminal and file in particular) would do the job. But this seemed to be a bit inflexible especially if you want to use conserve as library and not as stand alone tool.

I'm not sure it's a good idea for e.g. file listings that might be consumed by other programs, or show_versions

I skipped answering that point since I agree on that one as well, this is still one of my concerns.
I'm playing with some ideas for that:

General log only into a file (which kinda defeats the whole purpose of this)
Add a flag to omit all message prefixes (I'm not really a fan of that)
Using a different output stream for the log and process output (e.g. log into stderr and output into stdout)

Right not I'm not feeling well about any of these solutions.
As the PR title mentions it's WIP and seems to bring more issues than initially thought.

The src/bin entry point should remain small

Okey, I'll keep that in mind.

Edit:
Basically this PR includes multiple steps:

Split-up library usage, process logging, and output logging
Determine feasible output formats for the end user
Implement proper logging targets

sourcefrog · 2022-08-05T19:26:30Z

I'm happy to talk about it more when you want! I would suggest perhaps starting with options to write trace to a log file that is not stdout, without removing -D. But, it can of course towards whatever you're energized by.

… process bar

WolverinDEV · 2022-08-05T23:00:52Z

I'd thought about the system in the recent time and think, I came up with a pretty need system.
Currently I've mainly focused on the backup function (my test dummy :P). All other functions will be refactored to follow that pattern. I've also resolved the issue with logging to stdout and using a progress bar at the same time (I'm not a that big fan of the current solution, but there might be some clean-ups).

Using ProgressMonitors like I've shown in backup would be my way to go.
This easily allowed me to factor out the UI related code (ref here). There is still a lot work needed, but the basic concept is pretty much done. Lot's of polishing and cleanup is needed tought!

PS: Don't worry, you'll get your -D switch back.

…instead

…e library.

sourcefrog · 2022-08-14T15:14:11Z

Thinking about this a bit more overnight: perhaps the monitor interface should look more like something that just accepts Event and Problem enums.

Then, the implementation used in testing can simply record the events/problems into a vec to inspect later.

A different implementation can report significant events to log events. Another monitor (or perhaps the same one with a flag set) can update nutmeg models.

pub enum Event {
    StartListBlocks,
    ListBlocks { block_count: usize },
    FinishListBlocks { block_count: usize },
    StartValidateBand { band_id: BandId },
}
pub enum Problem {
    ...
}
trait Monitor {
    fn report_event(&self, event: &Event);
}

This would still have, to some extent, parallel state machines in the application code and the monitor. But, the states are not really that complex: validate just proceeds through a few phases. So that would be fine.

WolverinDEV · 2022-08-15T11:51:08Z

perhaps the monitor interface should look more like something that just accepts Event and Problem enums.

I had the same thought at the beginning, but didn't go for it.
The reason why I didn't is that I actually tried it (first draft I had in mind):

When using an enum to propagate all status events we could just pass a callback functions which takes one argument (the
progress enum). I've tried that pattern and imo DX wasn't really good. It just felt off, made code harder to understand and didn't look appealing to me. Maybe try it yourself and see how it goes, might just be me :)
Side note: A problem is only an event which would mean intuitively you rather have a ProbleEvent than a separate callback for that.

In terms of testing, your right.
Saving events in a Vec<_> would be way more easy.

WolverinDEV · 2022-08-15T12:01:32Z

The more I think about it, the satisfied am I with the amount of callback methods currently supplied.
I'll think any experiment a bit more as well.

WolverinDEV · 2022-08-24T13:55:02Z

Hey,
I've been pretty busy with my last exam for this semester but I had a couple of thoughts about the enum thing.

I intuitively used separate functions since I mainly use that kind of pattern when the state of a struct needs to be observed.
An example would be a server with rooms where each room has a room monitor which gets called on user join, leave, etc...
For such concept having extra methods works well because most likely the callback code will be a bit larger.

Adapting this approach for events, especially if these happen in a predefined order works but might not be the best.
Instead propagating the progress should just be seen as one "event" containing the current state.
These progress updates should not be used for error reporting.

Therefore I'll adjust the monitor style accordingly.

WolverinDEV · 2022-09-05T10:36:02Z

@sourcefrog any notes on the changes made?

sourcefrog · 2022-09-06T16:21:36Z

Hi, I appreciate all the work you put into it.

At the moment this gets only a small amount of time from me because it's a spare time project and I'm trying to finish cargo-mutants on my weekends.

I'm not sure about merging it just as is. It is very big. It does at least indicate some interesting directions. I think the proliferation of monitor methods, the handling of bulk output, and the particular way that trace interacts with the nutmeg view is not quite what I'm comfortable with at the moment.

sourcefrog · 2023-01-03T18:37:52Z

Hi, so I finally came back to this over the break to try to work out how to merge or otherwise resolve it. I think overall this is a really good step. The monitor seems fine as it is.

The main thing I'd want to change is to have a concept of output to stdout that is not a log message: listings whether json or text of files, versions, etc. I'll try to build on this to put that back in...

WolverinDEV · 2023-01-04T23:19:09Z

The main thing I'd want to change is to have a concept of output to stdout that is not a log message

If I' not mistaken you're referring to "show_index_json" within the "show" command?
I agree, that piping the json output trought the logger is quite a crual task to to,
but I haven't really had any better idea. Maybe use for the conserve binary some kind of logging abstraction?

PS: I've updated the PRs description and tile to a more (imo) accurate one as it has quite changed over time.

Attention: Disabled test 'long_listing_old_archive' as it will currently not compiling as it is

sourcefrog · 2023-01-05T05:47:49Z

What might work for this kind of case:

A library API like "write a list of versions as json to this Write."
The CLI can call that, asking it to just write to stdout.

Both of these could then be extended:

There are a few cases of "produce a list of things, render them as text or json, or maybe verbose text": versions, file listings. (Maybe just two for now?) Possibly those should change to iterators of objects that impl Serialize and Display.
Maybe the CLI code that coordinates logging and so on should have a function that when called turns off trace to stdout and progress, to make sure that nothing interferes with this bulk output.
Perhaps really trace and progress should go to stderr rather than stdout, so that you could potentially redirect output but still see error messages. (This may be annoying if it breaks a lot of existing tests making assertions about stdout.)

sourcefrog · 2023-01-08T05:02:48Z

It occurs to me that it might be possible to separate out just the addition of the monitor concept from everything else. That would, at least, get the PR size down a bit, be a step forward itself, and also probably be possible to land without regressing any behavior by introducing log prefixes on bulk output and without needing to update so many tests...

WolverinDEV · 2023-01-08T13:14:17Z

Well, along with the monitor come the logging/ui changes. A core part of the new monitor system is to separate the whole display part. Therefore we could not just leave the old UI stuff within the conserve library. A possible solution tough would be to replace the logging done via tracing within the binary by just println!. I suggest keeping tracing as the logging backend for the library. We could just change the tracing subscriber and print them as standard messages (without the fancy colouring) as well.

sourcefrog · 2023-01-08T20:41:50Z

08d26ce has a work-in-progress experiment with using the monitor pattern only for validate, including collecting a structured list of problems, which will make testing validation/corruption better.

A possible solution tough would be to replace the logging done via tracing within the binary by just println!. I suggest keeping tracing as the logging backend for the library. We could just change the tracing subscriber and print them as standard messages (without the fancy colouring) as well.

I'm not sure I'm following here. I agree with using tracing for reporting errors, warning, debug info, status messages and so on. I just think that file listings, version listings, and so on are a different kind of thing and probably should not go through tracing.

sourcefrog · 2023-02-12T03:20:57Z

I decided to instead merge #197 which builds on these ideas. Thanks for the thoughts and provocation!

WolverinDEV added 2 commits August 3, 2022 20:20

Initial draft of using tracing as logging library

afddcb1

Merge remote-tracking branch 'original/main'

f4032c1

WolverinDEV changed the title ~~[WIP] Using tracing for logging and implement file logging~~ [WIP] Separating library functions from log/display utilities Aug 3, 2022

WolverinDEV added 3 commits August 3, 2022 20:59

Moved display functions in show.rs to the binary and removed all ou…

fd49388

…tputs to stdout

Using log levels instead of a debug flag

deb9d3a

Using own exit code enum for exit codes

2ef2b4b

sourcefrog mentioned this pull request Aug 4, 2022

Better log/terminal abstractions #104

Open

sourcefrog reviewed Aug 4, 2022

View reviewed changes

.gitignore Outdated Show resolved Hide resolved

WolverinDEV added 3 commits August 5, 2022 22:19

Removed personal test folder from .gitignore

2392651

Using a monitor to monitor the backup progress

aba2603

Adding an option to pipe logging output through nutmeg when showing a…

1b9f1b0

… process bar

WolverinDEV marked this pull request as draft August 6, 2022 09:38

WolverinDEV changed the title ~~[WIP] Separating library functions from log/display utilities~~ Separating library functions from log/display utilities Aug 6, 2022

WolverinDEV and others added 10 commits August 6, 2022 16:07

Improved monitor handling and nutmeg logging

6e13c6a

Removing terminal output from validate functions and using a monitor …

fae7f35

…instead

Fixed missing set_symlink_file_times import for unix

42fe67b

Adding a file output writer to tracing subscribers allowing file logging

5cc17a7

Fixed a build error preventing the tests from building

b12b6fe

Applying new monitor pattern for diretory size listing

dfbaf66

Moved monitor definitions into a seperate file

c932a20

Using a monitor for validating the top level archive as well

7c5f2fc

Using a monitor for deletions

7b3091c

Using a monitor for restore. This removes all nutmeg modals within th…

1e25ddc

…e library.

WolverinDEV mentioned this pull request Aug 15, 2022

Don't pipe JSON outputs trough tracing #183

Open

Renaming DefaultMonitor to NullMonitor and using a single const instance

55bf218

WolverinDEV mentioned this pull request Aug 15, 2022

Write tests for monitors #184

Open

WolverinDEV added 3 commits August 15, 2022 13:35

Implementing the simple review points

e65b8d1

Merge remote-tracking branch 'original/main'

3101f94

Using From instead of Into

8208152

Seperating nutmeg monitors into a seperate file

7859ce2

Simplifying process updates via monitors

8c8c244

WolverinDEV changed the title ~~Separating library functions from log/display utilities~~ Separating conserve library and utility functionality Jan 4, 2023

WolverinDEV mentioned this pull request Jan 4, 2023

Restructuring cargo project using workspaces #195

Open

WolverinDEV added 6 commits January 5, 2023 01:05

Merge remote-tracking branch 'original/main'

a023436

Attention: Disabled test 'long_listing_old_archive' as it will currently not compiling as it is

Fixing unix builds

df90928

Fixed unix tests

6f4097a

Using raw output for unix_permissions.rx test

5131148

Merge branch 'main' of https://github.com/WolverinDEV/conserve

e7cc4f0

Removing duplicate new line for restore file list

426b4e3

sourcefrog closed this Feb 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separating conserve library and utility functionality #173

Separating conserve library and utility functionality #173

WolverinDEV commented Aug 3, 2022 •

edited

Loading

WolverinDEV commented Aug 4, 2022

sourcefrog commented Aug 4, 2022

WolverinDEV commented Aug 4, 2022

sourcefrog left a comment

WolverinDEV commented Aug 4, 2022 •

edited

Loading

sourcefrog commented Aug 5, 2022

WolverinDEV commented Aug 5, 2022 •

edited

Loading

sourcefrog commented Aug 14, 2022

WolverinDEV commented Aug 15, 2022

WolverinDEV commented Aug 15, 2022

WolverinDEV commented Aug 24, 2022

WolverinDEV commented Sep 5, 2022

sourcefrog commented Sep 6, 2022

sourcefrog commented Jan 3, 2023

WolverinDEV commented Jan 4, 2023

sourcefrog commented Jan 5, 2023

sourcefrog commented Jan 8, 2023

WolverinDEV commented Jan 8, 2023

sourcefrog commented Jan 8, 2023

sourcefrog commented Feb 12, 2023

Separating conserve library and utility functionality #173

Separating conserve library and utility functionality #173

Conversation

WolverinDEV commented Aug 3, 2022 • edited Loading

Aim

Providing a more powerful developer interface

Implementing a logging system conserve utility

WolverinDEV commented Aug 4, 2022

sourcefrog commented Aug 4, 2022

WolverinDEV commented Aug 4, 2022

sourcefrog left a comment

Choose a reason for hiding this comment

WolverinDEV commented Aug 4, 2022 • edited Loading

sourcefrog commented Aug 5, 2022

WolverinDEV commented Aug 5, 2022 • edited Loading

sourcefrog commented Aug 14, 2022

WolverinDEV commented Aug 15, 2022

WolverinDEV commented Aug 15, 2022

WolverinDEV commented Aug 24, 2022

WolverinDEV commented Sep 5, 2022

sourcefrog commented Sep 6, 2022

sourcefrog commented Jan 3, 2023

WolverinDEV commented Jan 4, 2023

sourcefrog commented Jan 5, 2023

sourcefrog commented Jan 8, 2023

WolverinDEV commented Jan 8, 2023

sourcefrog commented Jan 8, 2023

sourcefrog commented Feb 12, 2023

WolverinDEV commented Aug 3, 2022 •

edited

Loading

WolverinDEV commented Aug 4, 2022 •

edited

Loading

WolverinDEV commented Aug 5, 2022 •

edited

Loading