We present an R Shiny application that allows for users to merge multiple datasets of expression and meta data, as well as the ability to perform a brief check of the quality between the datasets.
This application can be downloaded and ran locally or through out public mergeQC page below.
http://shawlab-moffitt.shinyapps.io/mergeqc
| shiny_1.8.1.1 | shinycssloaders_1.0.0 | shinyjqui_0.4.1 | svglite_2.1.3 |
| ggplot2_3.5.1 | dplyr_1.1.4 | reshape2_1.4.4 | plotly_4.10.4 |
| RecordLinkage_0.4-12.4 | DT_0.33 | data.table_1.15.4 |
After the required pacakges are installed and the app.R file is in a local repository or your preference, you can run the app using the command runApp("[path to app.R file]")
- Example data is available to download through the GitHub repo or through a link on the app interface.
- Tab-delimited data table
- First column consists of feature names
- Following columns are of numeric expression values, with the column names referring to the sample name
- Tab-delimited data table
- The first column should consist of sample names that are found in the expression data column names
- The following columns can be additional information on these samples
Use CTRL + left click to select the proper files for each input section
- Expression matrices are merged based on similar features between input data. Users can view and download both an inner merged matrix or full merged matrix. The inner merged matrix will only contain features that were similar throughout all of the matrices input. The full merged matrix will contain all genes from the input matrices, and if there is a gene not found in one matrix, that value will be NA.
- The meta data will be merged based on similar sample names found in the first column and similar column names. If similarities are not found the data will be added as additional rows and NA values in columns that do not match.
- At the top of the data preview page a summary of the merged data will be displayed.
- Users can view the density of the average expression over all genes in each data set, as well as overlap datasets by selecting multiple rows.
- This can help users check their data and see if any transformations need to be done to ensure it is comparable to the other datasets.
- Users can select datasets on the sidebar panel and the datasets selected will be log2 transformed and the data will be updated on all tabs.
- Users can select two datasets to compare expression data through an interactive scatter plot.
- Upon hover users can see the gene and expression value from each dataset.
- The red points have a higher expression in the y-axis dataset, and the bluse have a higher expression int eh x-axis dataset.
- User can also sear for and highlight specifc genes on the sidebar.
