Skip to content

Test repo for SLiMScape: a Cytoscape plugin for discovery and analysis of Short Linear Motifs

License

Notifications You must be signed in to change notification settings

passeriforma/SLiMscape

Repository files navigation

SLiMScape

SLiMScape, a Cytoscape plugin, is a platform for performing short linear motif analyses of protein interaction networks by integrating motif discovery and search tools in a network visualization environment. This aims to aid in the discovery of novel short linear motifs, as well as visualisation of the distribution of known motifs.

This readme gives an overview of the key SLiMScape features. More information can be found at the SLiMScape GitHub wiki and the SLiMSuite blog.

Installation

To install, simply navigate to the Cytoscape App Store and click the install button. This will add the app to your local Cytoscape version.

Input

The minimum input for SLiMScape is an interaction network which contains a Uniprot attribute for each node OR a run ID from the UNSW servers. At the moment, the name of the node needs to be its Uniprot ID or Accession Number for the server to know what to do with it.

An example input with Uniprot IDs is:

name
P54253
O35147
Q61337
P04049
Q99683
Q9EPK5

An example input with a run ID is:

Run ID:15020400003 (SLiMProb run)

SLiMProb will also need query motif input in the form of regular expressions (see FAQs for details).

Once a search has been completed, the nodes that contain SLiMs will change from the default blue circles to red diamonds. The rest of the nodes will remain as they were.

If you want to pre generate a job on the Slimsuite servers, you can do that from here. If you do this, you can input the generated run ID to Cytoscape, and a graph will be generated for you! It's also handy for larger jobs that may take a while to finish.

Results

A run will produce both graph changes, and a results panel.

In the graph, nodes possessing SLiMs will become red diamonds; if a node has more than one SLiM it will be a daker shade of red. Note that there is not currently a layout set automatically, and the user will have to go to the layouts tab at the top of the page, and select one from there in order to see all of the nodes.

Results for SLiMProb are displayed in two panels.

Main Results Panel contains:

  • Motif: The name of the motif.
  • Pattern: The motif definition.
  • IC: Information content.
  • N_Occ: Number of specified SLiM occurrences.
  • N_Seq: Number of nodes the SLiM occurred in.
  • N_Upc: Number of nodes in the unrelated protein cluster (UPC).
  • E_Upc: Enrichment score for the unrelated protein cluster.
  • p_Upc: Significance value of the unrelated protein cluster.
  • pUnd_Upc: Probability of seeing the observed number of occurrences or less of the UPC.

OCC Panel contains:

  • Motif: The motif found in the sequence.
  • Seq: The name of the sequence.
  • Start_Pos: The start position of the sequence.
  • End_Pos: The end position of the sequence.

Results for SLiMFinder and QSLiMFinder are displayed identically, also in two panels:

Main Results Panel contains:

  • Rank: Rank of the motifs found, from most to least common.
  • Sig: Motif significance value.
  • Pattern: Regex of the motif found.
  • IC: Information content of the motif.
  • OCC: Enrichment score for the occurence(s).
  • Support: Number of sequences the motif occurs in (no distinction made between motifs with multiple occurences).
  • UP: Unrelated proteins the motif occurs in.z

OCC Panel contains:

  • Pattern: Regex of the motif found.
  • Sig: Motif significance value.
  • Seq: The name of the sequence.
  • Start_Pos: The start position of the sequence.

Further results can be obtained on the internet, at the URL: http://rest.slimsuite.unsw.edu.au/retrieve&jobid={jobid produced when the run is performed}. Alternatively, there is a button at the bottom of the results panel which will open the above link in your browser.

SLiMProb

SLiMProb searches the protein sequences of the selected nodes for occurrences of a specified regular expressions; useful for locating new instances of a motif found using SLiMFinder (e.g. R[SFYW].S.P). Conventions as used in the program can be found here.

SLiMProb has one input; a SLiM to be searched for in the nodes selected on the graph. To run SLiMProb you must provide a regular expressions in the motifs box and select one or more nodes. Alternatively, if you know the run ID of a previous search, you can instead input that to the Run ID textbox.

If the motif is found, the colour and shape of the target node will change.

SLiMProb Options

Masking

  • Disorder Masking is used to mask residues which have an IUPred disorder score of less than 0.3
  • Conservation Masking is used to mask residues which have a relative local conservation score of less than X.

Custom Parameters

  • Custom parameters can be used to add other command line arguments which can be found here.

##SLiMFinder SLiMFinder aims to discover new motifs in the selected protein interaction network by searching for statistically overrepresented sequences within a set of proteins.

It has no mandatory inputs other than a selection of graph nodes or a run ID. If a motif is found in a node, the node will change colour and shape. The specific motif is presented in an output table, along with the Uniprot ID and other data.

SLiMFinder Options

Masking

  • Disorder Masking is used to mask residues which have an IUPred disorder score of less than 0.3
  • Conservation Masking is used to mask residues which have a relative local conservation score of less than X.
  • Feature Masking is used to mask residues which occur in features such as domains or transmembrane regions.

SLiMBuild

  • Maximum SLiM length lets the user set the maximum length of SLiMs to return.
  • Maximum consecutive wildcards allows the user to alter the number of wildcards to return.
  • Ambiguous motifs either allows or disallows the returning of ambiguous motifs.

Miscellaneous Options

  • Custom parameters is used to add other command line arguments which can be found here.

QSLiMFinder

QSLiMFinder is a modification of SLiMFinder, aimed at looking for SLiMs shared by a query sequence and one or more additional sequences. SLiMs are drawn from the "Query" sequence, then searched for in the other nodes in the set.

QSLiMFinder requires a "Query" input; the Uniprot ID or Accession Number for the node you'd like to get your SLiMs from. This node does not have to be present in the graph. If a motif is found in a selected node, the node will change colour and shape. The specific motif is presented in an output table, along with the Uniprot ID and other data.

QSLiMFinder Options

Masking

  • Disorder Masking is used to mask residues which have an IUPred disorder score of less than 0.3
  • Conservation Masking is used to mask residues which have a relative local conservation score of less than X.
  • Feature Masking is used to mask residues which occur in features such as domains or transmembrane regions.

SLiMBuild

  • Maximum SLiM length lets the user set the maximum length of SLiMs to return.
  • Maximum consecutive wildcards allows the user to alter the number of wildcards to return.
  • Ambiguous motifs either allows or disallows the returning of ambiguous motifs.

Miscellaneous Options

  • Custom parameters is used to add other command line arguments which can be found here.

About

Test repo for SLiMScape: a Cytoscape plugin for discovery and analysis of Short Linear Motifs

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages