Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow archiving histories and workflows #1734

Open
hexylena opened this issue Feb 15, 2016 · 18 comments
Open

Allow archiving histories and workflows #1734

hexylena opened this issue Feb 15, 2016 · 18 comments

Comments

@hexylena
Copy link
Member

hexylena commented Feb 15, 2016

My users report having "too many histories" to deal with effectively. I feel their pain, and there are histories that I'm not going to reference very often but would like to retain forever.

I'd like to propose an "archive" for histories, it's another state that histories can transition into.

Properties:

  • no modifications allowed (unless unarchived) (this need not be done for the initial implementation)
  • not displayed by default (in the context of the "old" history menu where you have to click the advanced -> deleted to see deleted histories)

Update (2016-05-16): I've found the same issue with workflows. There are workflows which I should retain for historical reasons, but will never use again (or are broken, but we want to record exactly what was run)

@blankenberg
Copy link
Member

To add to this, I'd actually find just a 'locked' attribute to be great to have separately from archived.

This is particularly important for e.g. published histories (especially ones referenced in papers) -- it is a sad day when you accidentally add some cruft to that perfectly created history that you want to share (instead of the copy you made).

@blankenberg
Copy link
Member

Instead of just archived, could we create "Groups" for histories, where "Archived" could be a particular group? That way we can organize things even a bit better.

@hexylena
Copy link
Member Author

@blankenberg sorry, missed your replies. A locked attribute would be amazing, I definitely agree w/r/t published histories.

Groups is an interesting thought. What sort of other groups would you envision having?

@martenson martenson mentioned this issue Mar 15, 2016
@hexylena hexylena changed the title Allow archiving histories Allow archiving histories and workflows May 16, 2016
@hexylena
Copy link
Member Author

Update: added workflows to this list. Same problem. I want them to be locked down and moved to a different area, out of my sight.

@lparsons
Copy link
Contributor

This sounds like a great idea. I fully support it. I'd like to add another suggestion: the ability to delete underlying datasets in a history (to conserve space), but save the tool. Pretty much the same as a deleted dataset, but without deleting the item in the history. It would be great for users to be able to archive (online and offline) certain outputs (counts, fold changes, plots, etc.) but not some of the redundant or big data (raw fastq stored elsewhere, bam files, etc.). Of course, preserving the provenance of those final outputs would be needed.

Not sure if I made this totally clear, so please let me know if further clarification would be helpful or if there is a better place to record this.

@dannon dannon added this to Histories in The Roadmap Apr 20, 2017
@hexylena
Copy link
Member Author

related: #3269

@bimbam23
Copy link

bimbam23 commented Jul 5, 2019

Ideas on how to archive data which is old, unused, ...

  • add location where to archive data (maybe in galaxy.yml?)
  • find out how old datasets are or if they are still used?
  • define live time of data when uploading
  • change color (gray) if dataset is archived
  • make request of archived data if need

@hexylena
Copy link
Member Author

hexylena commented Aug 4, 2020

I think I heard some discussion somewhere that an "archive histories" function could also enable moving them to colder, tape storage (for admins that configure it) which would be super fantastic.

@nomadscientist
Copy link
Contributor

This would be brilliant! I have major issues with accidentally modifying histories I previously shared as examples, I'm a big fan of this. Plus presumably this could make them citable as well. i.e. 'made this tool, history is here for its use, cite me for being awesome' or some such.

@bernt-matthias
Copy link
Contributor

Like the idea:

moving them to colder, tape storage

Wanted something like this for a long time.

@bimbam23
Copy link

It's alive again ;-)

@bgruening
Copy link
Member

This is the current history table.

id | create_time | update_time | user_id | name | hid_counter | deleted | purged | genome_build | importable | slug | published | importing

We could add a new column locked and include a toggle in the UI to freeze a history. Locked histories are not mutable. The complicated part is a little bit where do we need to add the checkpoints if history is locked? We probably also want to permit changing datatypes in this history, e.g. metadata.

@blankenberg would tags instead of groups work? We have the tagging infrastructure in place. We could think about a tag-centric UI where you see all histories with the same tag in a tree-view?

Could the archived or please-archive feature be a tag with special meaning?

@dannon dannon added this to the 22.09 milestone Jul 21, 2022
@hexylena
Copy link
Member Author

hexylena commented Oct 5, 2022

@foellmelanie mentioned this during her talk, @Slugger70 DM'd me about it as well his users want this. Especially in the context of RO-Crate maybe we can finally do something about this.

Archive the history, and move it to another system with storage (e.g. zenodo)

@Slugger70
Copy link
Contributor

Or other cheap storage like an object store (swift, s3 etc.)

@hexylena
Copy link
Member Author

hexylena commented Oct 5, 2022

Yes! Cheap object store, exactly. Could be tape. Could be anything, just a method (especially using the File Sources) letting users archive something and have a URL they can point to that sufficiently describes it, while not taking up space.

@dannon
Copy link
Member

dannon commented Oct 5, 2022

Pinging @davelopez who presented some ideas for this at a recent meeting; we're hoping to address this soon.

@davelopez
Copy link
Contributor

Yes, will open a planning issue soon and link it here 👍

@davelopez davelopez mentioned this issue Oct 10, 2022
9 tasks
@martenson martenson added this to 23.1 To do in Backend Working Group Oct 28, 2022
@dannon dannon removed this from the 23.0 milestone Feb 9, 2023
@dannon dannon added this to the 23.1 milestone Feb 9, 2023
@davelopez davelopez modified the milestones: 23.1, 23.2 Jul 21, 2023
@davelopez
Copy link
Contributor

The part for histories is already done in #16003

I'll keep this open for the potential Workflow part but I'll move the milestone.

@jdavcs jdavcs modified the milestones: 23.2, 24.0 Jan 10, 2024
@jdavcs jdavcs removed this from the 24.0 milestone Mar 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
The Roadmap
Histories
Development

No branches or pull requests