-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create GO-CAM v1.0 landing page for end users #558
Comments
@cmungall I'm not sure why we'd want it out of the main pipeline, unless we need cycles faster that a day or so. As it stands, mostly of the things needed should be or will be part of the pipeline (expect, of cource, TBD formats, etc.). |
As we consider how best to provide end users access, I'd like to suggest thinking about a portion of the Web app driven directly from SPARQL query. This would allow for a flexible UI that really can take advantage of the network structure being assembled both within and across linked go-cams. I think this pattern will likely be of use in Noctua itself as well. This concept is coming from thinking through how to provide curators access to the knowledge in the Reactome-generated GO-CAMs. In looking at them in Noctua-dev and discussing how curators would want to use them with @vanaukenk , it is apparent that something else that is more flexible and more user-request driven is needed. These models are large and interconnected. The large models don't work well in the editor and its hard to know where exactly one should stop and another one should begin (which makes a lot of sense from the biology standpoint..). As an example, @thomaspd was looking at the mapping of reactome for Wnt signaling - specifically 'TCF-dependent signaling in response to Wnt' go-cam, reactome. He immediately wanted to see the subpathways involved (e.g. WNT mediated activation of DVL in the same graphical context but they are divided up right now into independent, unlinked models, making navigation via Noctua difficult. Given access to the graph of all models, the UI could grow out from the point of interest as requested by the user. This could be captured graphically in a Noctua-like network view or via other more tabular paradigms. The key is that the user sees as much as they want to see, of what and from where at any given time and no arbitrary lines are drawn around subsections of the knowledge base. |
If that's ok, I will start working on a mockup of the landing page, just to be clear on what we want (resp. don't want) and how to display it. Proposed Plan:
PS1: @cmungall URL: I would choose /cam since www.geneontology.org is already stating the "go". Also, I would add to the to-do list a "Statistics" section (how many models, what species, how many "annoton" / model, how many relation used for each type, etc) PS2: @goodb your note on a more interactive / connected graph is quite true and is linked to this issue: #519 . I have worked with D3 in the past so I could come up with some more interactive graph with collapsible nodes (D3 example)... But depending of other projects, It would probably not be for the 1.0 release |
@lpalbou The future framework for geneontology/noctua#399 is about reusable components within the world of Noctua, AmiGO, and other places where we want to have separation and components written once and reused. This should definitely not preclude the use of other frameworks other places for unique or one-off page/apps, like possibly a new landing page. |
I agree with @goodb that it would be quite cool for a user to be able to build up a sort of "knowledge graph" that looks across multiple models. One thing to consider is how to represent links across class instances that live in different models. Probably fine at the general knowledge graph level, but as you traverse some class from one model to another, the context may be quite different and so the relationships wouldn't have the same meaning as within an OWL ontology. |
@lpalbou two thoughts here. Unless you are very good/fast at web dev is might be faster / more productive to first build a mockup for the design in a tool like https://balsamiq.com versus making something 'real'. Really depends on you and are your skill set, but for me, that has worked very well in the past. Regarding graph view implementations, I'd definitely urge a look at higher level libs e.g. cytoscape.js before making the call to go directly to d3. There is a lot there to take advantage of that you might end up having to re-write in d3. It does have its own problems, but something to look at that we've built for the translator project that does the graph explorer thing with a combo of cytoscape.js on the front end and neo4j on the back is up at https://tkbio.ncats.io/ (its going very very slow today.. but still can work if you are patient). There is code there that might be useful. @balhoff regarding explorer (maybe but not necessarily graph..) views, its not just the ability to see connections between different models, its control over the that parts of models that you see at one point in time. Loading up more than 7 things at once into a view is very rarely a good idea and happens almost immediately when people get their hands on graph layout tools ;).. For complex things like this interactive, user-centered expansion/contraction is very important to produce one way or another. |
Sorry I should have scoped this ticket a bit better. Let's take ideas for v1.1 (post-May) over to #561 and keep this on what we're doing for May |
@goodb thanks for the link, I did not know balsamiq, but coding a mockup with angular will not take me long (1-3 hours top). Most of the time is really more on what to show and how, and can we retrieve the data or not. |
Here is a first graphical mockup: GO-CAM mockup |
Nice!
|
Thanks! This looks great. My main suggestion at this point is to make the download section/links really prominent, maybe by moving toward the top. Also, it would be nice to have a thumbnail of a GO-CAM image in the top box. For the statistics, we should also track the number of triples in the repository. |
I like it !!!
Thanks ! Pascale |
Note from hack day:
|
Idea. Show running list of most recently created/modified models, perhaps showing what references were used in the CAM. Users like to read 'the news' and it makes the site feel alive. (like or perhaps even driven by a twitter update feed widget) |
Thanks for the comments and useful suggestions. I will integrate them in the next mockup, as well as a few others from this morning hack day meeting. Good news also: when we have reached a sufficiently stable version, Julie M. and Suzanna L. have agreed to show the site to a few biologists that are new to these concepts in order to gather their feedbacks. |
OK, sorry for the lack of update, but here is the New Mockup Version Notes:
Additional Notes:
|
@lpalbou looking really good. Some thoughts:
|
I think there are multiple approaches to this depending on the use case. We can provide a pre-generated combined OWL file with an import module merged in, that would allow for cross-model exploration. For people who want to look at individual models in Protege, we can provide a docker container that makes a module and writes a catalog file. But really Protege needs better dynamic module creation abilities. |
@goodb Thanks for the comments ! A few answers/comments:
|
@lpalbou Go ahead and like "that view". I will update it a little bit so that it is a little less dumpster fiery and pull in the pathwayview. |
@lpalbou To clarify for future historians: I am referring to the tomodachi URL. That is a temporary "dev" URL until AmiGO production is transferred to Berkeley, at which point we would change it. |
That is correct--this is an issue with the loader: owlcollab/owltools#212 that @yy20716 is currently looking at, mentioned yesterday. |
Let's come up with another name than "that view". We can subdivide graphical views into 3 categories based on their content
This is not exhaustive as novel combinations are possible. We can also have a separate orthogonal partition based on jsplumb vs cytoscapejs etc; whether nesting is employed etc. Different views serve different purposes (although I'm not sure there is a use case for an occurrent-centric view, I would move to deprecate this). Entity-centric views are very familiar and will give people the warm fuzzies, as they will think of every other interaction display they have ever seen. This view will obviously scale better to multiple models or large models. But these are problematic if the goal is to demonstrate clearly what a GO-CAM is, because it excludes the GO terms. We run the risk of confusing people and having them think we are building Yet Another interaction database. For pedagogic purposes I would argue for having some form of complete view as primary while we are still in the process of rolling out the concept to the community, when showing an individual model. Of course, we will soon want to show combined models in which case some form of reduction at the zoomed out level will be required. We used to have a complete view rendered using graphviz in amigo, I rather liked that, not sure what happened to that. |
I think that as a first priority we need a stable URL for each model that resolves to a page with multiple ways to view that model including one that is the editor (including access control of course). Perhaps before (or while) digging into building the various views and arguing over which one shows up as the default it would be good to get the URL patterns in place and stable so that links could start being dependable. Then we can iterate indefinitely on the specific views shown - including linked-data views 'shown' to RDF-aware software and content shown to search engines.
I actually found this the most helpful view for understanding the go-cam model style. It helped break me free from the inclination to see entity-centric interaction maps. In separate discussions with Huaiyu he pointed me to http://sbgn.github.io/sbgn/ and suggested that their Activity Flow pattern was the best match to the go-cam modeling paradigm. And activity flow is pretty much what I see in "that view". So anyway I wouldn't drop it from the list of possible views just yet. |
Hi, this is the most recent update of the GO-CAM site: Notes
WIP
Additional NoteThere will also be an "expert page" for each biological process. The idea is to give access to the list of experts (both curators and authors from articles) working on each specific biological process. |
@lpalbou looks great! I would replace "downloaded (as ttl)" with "downloaded (as RDF Turtle)". |
This looks nice. I am going to modify the Dlk1 example. I notice that it does not include the asserted regulatory processes. |
What is included in the 20 MB |
@lpalbou it looks good. Probably a good time to circle back with @jmcmurry about getting it in front of some real users. Would be helpful for those tests to write down the specific use cases the design is intended to support to direct and quantify the user evaluation. (e.g. bio-user: 'find a model with a particular gene in it', info-user:'download sif file', etc.) |
@lpalbou Yes, this looks really nice. I have a few questions about the NEDD4 example. http://noctua.berkeleybop.org/editor/graph/gomodel:5323da1800000002 For example, the relationship between NEDD4 ligase activity and the ubiquitin-dependent protein catabolic process is different and the example uses the has_substrate relation rather than has_input. What is the source of the example model? @thomaspd - should I update the NEDD4 example in Noctua and add appropriate evidence? |
Thanks, updating the example would be very helpful! The source is https://www.ncbi.nlm.nih.gov/pubmed/17996703 The has_substrate relation is, I think, not the official name, but it will be more meaningful to users. |
Thanks @thomaspd - I will align the landing page and graph editor NEDD4 models and let you know if I have any questions. Wrt the 'has substrate' relation, I agree that it is more meaningful to users, but note that it is neither in RO nor gorel as a name or id, respectively. In gorel, 'has substrate' is a narrow synonym of 'has direct input', and in RO there is a comment under 'has direct input' that the term may be obsoleted and replaced with 'has substrate'. http://www.ontobee.org/ontology/RO?iri=http://purl.obolibrary.org/obo/RO_0002400 I raise the issue because I have gotten feedback from curators that it is difficult to know which relations are the right ones to use and so I think we need to be consistent with what is used in Noctua vs what is displayed vs what is in the GPAD output, etc. What is more meaningful to users is probably more meaningful to curators, too! |
DavidOS and I worked on this proposal for has-substrate |
Thanks for the feedbacks 👍 Some answers
@ukemi let me know when the model is updated, and I'll change the example on the main page @goodb yes, but before showing to real users, I would prefer to correct a few things first (e.g. missing BPs, MFs, update the REST endpoint, create the SPARQL example page, update some documentations). Let's give us one week to discuss and implement the changes we want before. @vanaukenk @thomaspd thanks for the feedbacks on NEDD4. Indeed has_substrate is more meaningful to users but has_input is the true relation and it's already difficult for curators to select the right relations. Let me know which you prefer or what modification you would like. GO-CAMs CarouselIt would be great if the community could select 2-3 additional GO-CAMs or create new schemas / animations to:
Usage ScenariosI know some of you have been working on scenario of usages, could you share the link(s) with your findings here, so I can highlight these usages on the main page ? |
The model is updated. |
I've updated and fully referenced the NEDD4 model: I made some slight changes to the model based on the data in the paper, but overall I think it still reflects the biology as intended. I also added supporting evidence statements to each edge for reference. Paul, we should go over the model together to make sure you're okay with the final version. Thx. |
Hi, a few updates:
Also, since we are able to display 6 models in the main page carousel, I thought it may be interesting to start with one model from each MOD, what do you think ? If you agree, perhaps each MOD could select a specific model to be presented on this carousel ? Thanks |
Live on geneontology.org/gocam [Edit] Known Current Bugs
Furthermore, some queries can further be optimized, in particular the GroupMeta SPARQL Query. I also have to update the swagger documentation of the current REST API |
New version online, including some design updates to highlight the browsing capabilities and make the site more interactive. Technical Update
The site is also now fully https (distributed through CloudFront with SSL certificate) Note: pages are served using the following REST API https://api.geneontology.cloud (e.g. https://api.geneontology.cloud/models) |
what is the plan for getting this in front of users? |
We're currently trying to work out things like harmonization and deployment, started here: |
Well, it's still unclear to me what you would want for harmonization (we have no particular reason to rewrite the gocam site in vue2; we can harmonize the styling but with what ? do we want amigo styling everywhere ? gocam styling everywhere ? or something else to use also on the future go site ?) and deployment (the site is hosted on S3 + Cloudfront with SSL certificate from geneontology.cloud while waiting for a full transition to geneontology.org/cam or /gocam and the data is fetch directly from the production rdf endpoint, so any time there is a new GO release, the site is automatically updated). Not to say there is nothing to improve: for instance the documentation on the wiki is still a problem, I could fetch the data from BioLink once my API is transferred and deployed there, we could create additional views, as suggested by Pascale, more biologically oriented, to present users with large and common families of processes, as another way to explore the current gocams, etc. But the gocam site is up and running since the NYC meeting and got a few improvements since (multiple different ways to search, also showing PMIDs, MF, CC, GPs, etc). |
By “get in front of users” I believe she was referring to user testing, not deployment or implementation details. Julie, please correct me if I’m wrong.
… On Aug 9, 2018, at 6:11 AM, lpalbou ***@***.***> wrote:
Well, it's still unclear to me what you would want for harmonization (we have no particular reason to rewrite the gocam site in vue2; we can harmonize the styling but with what ? do we want amigo styling everywhere ? gocam styling everywhere ? or something else to use also on the future go site ?) and deployment (the site is hosted on S3 + Cloudfront with SSL certificate from geneontology.cloud while waiting for a full transition to geneontology.org/cam or /gocam and the data is fetch directly from the production rdf endpoint, so any time there is a new GO release, the site is automatically updated).
Not to say there is nothing to improve: for instance the documentation on the wiki is still a problem, I could fetch the data from BioLink once my API is transferred and deployed there, we could create additional views, as suggested by Pascale, more biologically oriented, to present users with large and common families of processes, as another way to explore the current gocams, etc. But the gocam site is up and running since the NYC meeting and got a few improvements since (multiple different ways to search, also showing PMIDs, MF, CC, GPs, etc).
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or mute the thread.
|
@goodb yes, sorry, I was more answering Seth than Julie. The site is here and works. I have gathered feedbacks and improvement requests from curators, but it's indeed not sufficient and does not cover all usages. Do you think you could help, Julie, in gathering some additional user feedbacks about what they like / don't like / what they would want or expect ? I do think we may need the additional page suggested by Pascale, to browse models based on common families of processes (e.g. transcription, etc). |
Aged out |
We need a landing page for GO-CAM, that is designed for users (not curators) to access and download the production GO-CAM models. This would be distinct from the curator landing page at noctua.geneontology.org. This would be on the main GO website (drupal).
v1
by May meeting
page/cam
orpage/go-cam/
or/page/GO-CAM/
v1.1
Dependencies
@kltm should we couple to main pipeline or have this be its own stream? We have a separate ticket for the jobs required:
#169
Additional notes
The text was updated successfully, but these errors were encountered: