Client/training UI by subdavis · Pull Request #487 · Kitware/dive

subdavis · 2020-12-10T14:58:53Z

Work in progress

fixes #467
fixes #518

subdavis · 2021-01-04T19:30:39Z

For web, this is done and appears to work.

For desktop, the UI is done, and all that remains is to implement the apispec training function. Until then, the router tabs are disabled.

Rebased after the major schema change from #516

subdavis · 2021-01-04T19:57:20Z

FWIW, I'm starting to have doubts that a specific "Training" page is a good idea.

I wonder if a general-purpose "CRUD" type data table with checkboxes and then a toolbar with tabs for different actions would be better.

The toolbar area would almost be like microsoft office. Tabs like "trainining", "pipelines" and "manage data". Basically it would be just like the current web data tab, with without the folders/heirarchy and moving all the stuff in the right-side of the browser toolbar into a tabbed toolbar.

The staging area might still be useful for all purposes. Its' just a more robust way of checking checkboxes when you have multiple pages of results: you can always see the things you've checked.

Training

You could name a new pipe, select training config, and start a train

Pipelines

Choose a pipe, run it

Management

Import/export, delete data, manage permissions, other stuff.

@BryonLewis ptal, let me know what you think.

I believe I'd prefer to merge this first then circle back, but if you really dislike the training UI, I'm willing to scrap it.

BryonLewis · 2021-01-04T21:09:34Z

my main worry with this current one is that it may eventually be impossible to get to the dataset you specifically want without the ability to hierarchy down. Using the findWithPermissions Access.Read on the server could give you a couple hundred datasets that are sorted by name (when really you only want to use some specific public ones from a specific user, or even just you're own).
I.E

Some of these with the same name are from different users I have on my system for testing using the same source media with different annotations. I can't tell whose version of that data it is.

I do like the idea of having a shared with me/not shared with me toggle to simply, but I kind of feel that a flat list without hierarchy is going to be harder to navigate and discern what data you want to use for training. Like you may want to combine your own data with a specific dataset in a collection on the server like the HABCAM stuff. This could be mitigated but would require implementation of powerful filtering/searching tools.

I would see if we could get an indication of how often a user might train on their own data, plus a collection or source dataset either on the server or from another user. I know we added the ability to train on multiple folders, I just don't fully know if one of the use cases is to accentuate your own data with larger ground truth like the JRS or HABCAM data.

Additionally to this I think we need to clean up the pipelines/trained pipeline menus as well because they can have the potential again to be an unwieldy list of text that doesn't provide enough information. This is all UI stuff, but trained pipelines could indicate the owner and what they were trained on (we already have all this data). Making it easier to share and prevent issues with naming conflicts.

subdavis · 2021-01-04T21:33:11Z

Yeah, I really had blinders on for this one. All the points you make are valid, particularly about how "you could solve this problem, but only by implementing complicated search/filtering" which is effectively what the hierarchy is.

How to proceed?

What would you recommend? I like the staging area. I guess we could change the "available" area to be a regular data browser without checkboxes and a + button instead. (Checkboxes disappear between directories, it's a whole thing).

Also, for training on desktop, there is no browser (and none of these problems apply) so I guess we just have to keep this page desktop-only.

BryonLewis · 2021-01-04T22:57:30Z

I think the solution works well for the desktop in which datasets are limited and aren't typically shared. (Might want to implement/add the new created or other information to the view)

I like the idea of a staging area for the web as well. Other features and ease of use are mostly determined by how often it will be used in specific ways.

With the web version I would want some questions answered first:
How often would they be training on datasets that aren't theirs?
If they do this frequently how often would it be other users vs how often would it be on the server collections?
Would it help to include a recents/quick selection on the web version for selecting datasets that have been used or trained on before? (localStorage should be sufficient for this)
Retrain button? (allows you to take pipeline and retrain it taking updated data and outputting the same name?
Should you be able to add items to a training group through multiple windows? (sometimes it's easier to have multiple tabs open for navigation)

I think your + button or something similar would be the first step. With it processing when the user click's + and giving feedback if they are allowed to add the item to the staging area. I don't have a good idea on what exactly this looks like yet (in terms of the staging area and the capabilities of the staging area in regards to pipelines, training, export, delete). I need a little bit of time to create another list of possible annoying things that could occur using this mode. Like I'm not sure this should replace what is there for a single folder and operating on one or two items in a folder. I need to do a bit more thinking.

BryonLewis · 2021-01-05T16:50:18Z

How often would they be training on datasets that aren't theirs?
If they do this frequently how often would it be other users vs how often would it be on the server collections?
Would it help to include a recents/quick selection on the web version for selecting datasets that have been used or trained on before? (localStorage should be sufficient for this)
Retrain button? (allows you to take pipeline and retrain it taking updated data and outputting the same name?
Should you be able to add items to a training group through multiple windows? (sometimes it's easier to have multiple tabs open for navigation)

Through discussion with Matt it looks like access to other folders isn't a frequent thing so it can be left using the basic file browser with additional buttons.
Only other thing is eventually an advanced way to set up training and resume training on specific models to do longer runs, but this is much further off. I think just swapping your flat file on desktop for the file browser with the extra button for the web version.

subdavis · 2021-01-05T18:47:29Z

Had another discussion.

Going to remove the MultiDatasetTraining tab from web for now while we think about the best way to unify (or separate) the Data tab's data browser with the UI for running training. To have a data browser in multiple places could be confusing, and is likely not a permanent solution.

Bryon convinced me that #252 should be closed because the browser is valuable, at least as long as Girder 3 is our platform.

Instead, this PR is going to focus on finishing training implementation on desktop. This will become a desktop-only PR and will not change web.

subdavis · 2021-01-13T17:52:11Z

Blocked on availability of new VIAME version with manifest training support.

Ready for review of code quality, design, and testing on pipeline execution.

BryonLewis · 2021-01-14T00:19:48Z

After a quick look at this I may want to base some of the ffmpeg/ffprobe stuff on this and the concept of the viame.ts in the native folder. Mostly because I kind of did a similar structure with the calling of different sources and settings depending on what ffmpeg is found for linux. (VIAME with libx264, local with libx264, fallback to Viame nv_enc). A sort of initial first time test on import to figure out what ffmpeg to run, how to run it and where to find it, which then will be used on all subsequent runs.

subdavis · 2021-01-14T01:50:02Z

Only if you think you should. I refactored because the overlap was like 90%.

If there's significant difference between linux and windows, just make them separate functions. We can always refactor later. I just had a quick look at your changes and I have no problem with a little bit of duplication.

BryonLewis

Couple of very minor things. Obviously I can't test desktop training yet but will once it's available. Basic testing seems that with some small fixes (on windows) everything else works base on my tests.

Test Procedure:

Linux Desktop: loading [image list & video] -> saving/reloading -> running pipelines [image list & video] -> looking at training general tab -> training behavior when it can't find kwiver
Windows Desktop: loading [image list & video] -> saving/reloading -> running pipelines [image list & video] -> looking at training general tab -> training behavior when it can't find kwiver
Web Testing: loading [image list & video] -> saving/reloading -> running pipelines [image list & video] -> running training [image list & video] -> using training output on another dataset

BryonLewis

Just adding some other minor things for windows before I forget.

BryonLewis

Windows version with a modified VIAME Install works. I didn't go through the entire training process each time, but I did have it complete at least once.

I think the final part (displaying trained pipelines for usage) could be added to another PR.

subdavis force-pushed the master branch from bef8d73 to c9c142e Compare December 10, 2020 19:57

subdavis mentioned this pull request Dec 21, 2020

Multiple dataset training Web Support #503

Merged

subdavis force-pushed the client/training-ui branch 2 times, most recently from eb0b160 to e3bf1ce Compare January 4, 2021 19:19

subdavis marked this pull request as ready for review January 4, 2021 21:09

subdavis requested review from BryonLewis and jjnesbitt January 4, 2021 21:09

subdavis mentioned this pull request Jan 7, 2021

Desktop CSV Serializer #522

Merged

subdavis force-pushed the client/training-ui branch 3 times, most recently from 3e012ef to edcef31 Compare January 12, 2021 02:42

subdavis mentioned this pull request Jan 12, 2021

Rename VIAME-Web to DIVE #502

Closed

Desktop Training

a1c52a6

subdavis force-pushed the client/training-ui branch from c16bee8 to a1c52a6 Compare January 13, 2021 17:39

Resolve merge inaccuracy

680c88f

BryonLewis requested changes Jan 14, 2021

View reviewed changes

subdavis added 2 commits January 15, 2021 15:28

training changes

7014028

QUOTE ALL THE FILEPATHS

b674766

BryonLewis requested changes Jan 17, 2021

View reviewed changes

Comment thread client/platform/desktop/backend/native/windows.ts Outdated

Comment thread client/platform/desktop/frontend/components/MultiTrainingMenu.vue

Windows changes

915849e

Merge branch 'master' into client/training-ui

afef5c3

BryonLewis approved these changes Jan 18, 2021

View reviewed changes

subdavis merged commit 775bbb3 into master Jan 18, 2021

subdavis deleted the client/training-ui branch January 18, 2021 17:27

subdavis mentioned this pull request Jan 20, 2021

Custom branding #532

Merged

f4str mentioned this pull request Jun 7, 2021

Drop "file browser" interface #252

Closed

Conversation

subdavis commented Dec 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

subdavis commented Jan 4, 2021

Uh oh!

subdavis commented Jan 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Training

Pipelines

Management

Uh oh!

BryonLewis commented Jan 4, 2021

Uh oh!

subdavis commented Jan 4, 2021

How to proceed?

Uh oh!

BryonLewis commented Jan 4, 2021

Uh oh!

BryonLewis commented Jan 5, 2021

Uh oh!

subdavis commented Jan 5, 2021

Uh oh!

subdavis commented Jan 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BryonLewis commented Jan 14, 2021

Uh oh!

subdavis commented Jan 14, 2021

Uh oh!

BryonLewis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BryonLewis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

BryonLewis left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

subdavis commented Dec 10, 2020 •

edited

Loading

subdavis commented Jan 4, 2021 •

edited

Loading

subdavis commented Jan 13, 2021 •

edited

Loading