# Example Manuscript

An outline of what to include in a [nature protocols](https://www.nature.com/nprot/for-authors/preparing-your-submission) paper.  
The `Keywords` section at the end of this notebooks shows which sections will take text from the website/notebooks. 

## Title

Nature Protocols Recommendations: 30 words max

## Authors


Hao Sun, Francis Grenn, developers and leaders on [website table](https://cumc.github.io/xqtl-pipeline/README.html#our-team)

## Abstract


Nature Protocols Recommendations: 250 words max

## Introduction


Nature Protocols Recommendations: could use suggested subheadings (below)

### Development of the protocol


Nature Protocols Recommendations: include references to our own peer-reviewed primary research publications

### Applications of the method


Nature Protocols Recommendations: discuss diversity of the applications of the method

### Comparison with other methods


Nature Protocols Recommendations: Reference alternative methods that are commonly used to achieve similar results as the protocol. Discus advantages and disadvantages of our protocol compared to alternatives.

### Experimental Design


Nature Protocols Recommendations: Information on design of experiments that would allow readers to adapt the protocol to their own experiments. Also discuss controls necessary for the protocol. 

Description of tools and parts in the `Procedure` section below.  
Include a paragraph describing each tool/part that would be extracted form the `Description` part of the module page/notebook for the website.

#### Molecular Phenotype Quantification (Step 1)

paragraph(s) extracted from `Description` section of RNA-seq Expression module website/notebook page  
paragraph(s) extracted from `Description` section of scRNA-seq Expression Calling module website/notebook page  
paragraph(s) extracted from `Description` section of Alternative Polyadenylation module website/notebook page  
paragraph(s) extracted from `Description` section of Methylation module website/notebook page  
paragraph(s) extracted from `Description` section of Alternative Splicing from RNA-seq Data module website/notebook page  

#### Data Pre-Processing (Step 2)

paragraph(s) extracted from `Description` section of Genotype Data Preprocessing module website/notebook page  
paragraph(s) extracted from `Description` section of Phenotype Data Preprocessing module website/notebook page  
paragraph(s) extracted from `Description` section of Covariate Data Preprocessing module website/notebook page  

#### QTL Association Analysis (Step 3)


paragraph(s) extracted from `Description` section of cisQTL Analysis Workflows module website/notebook page  
paragraph(s) extracted from `Description` section of transQTL Analysis Workflows module website/notebook page  

#### Integrative Analysis (Step 4)

### Expertise needed to implement the protocol


Nature Protocols Recommendations: Note if usage of protocol requires a specific facility or specific training.

### Limitations


Nature Protocols Recommendations: Discuss situations where protocol is unreliable 

## Materials

### Software

### Hardware

## Procedure


Nature Protocols Recommendations: 
* Numbered list of direct experimental instructions. 
* Use active, not passive, tense
* Include subheadings to separate stages
* Include a TIMING callout with each subheading and state how long the section will take to complete
* Include PAUSE POINT flag after steps where you can resume the rest of the procedure later (I think that is all steps in our case, so no need)
* Include CRITICAL STEP flag with explanation for steps that need to be performed in a precise manner to maximize likelihood of success 
* Include TROUBLESHOOTING flags after steps wehre problems are encountered. Include details in Troubleshooting main section below.
* Use letters (A, B, C, ...) to identify different options
* Use Roman numerals (i,ii,iii,...) to break down steps


### 1. Molecular Phenotype Quantification
Any one of the steps below must be completed

#### A) RNA-seq Expression

extract `Minimal Working Example Steps` text from the different module webpages/notebooks under the RNA-seq Expression miniprotocol. If no module pages, then just use the steps outlined on the miniprotocol.

#### B) scRNA-seq Expression Calling

#### C) Alternative Polyadenylation

#### D) Methylation

#### E) Alternative Splicing from RNA-seq Data

### 2. Data Pre-Processing
Each step below must be completed

#### i) Genotype Data Preprocessing

#### ii) Phenotype Data Preprocessing

#### iii) Covariate Data Preprocessing

### 3. QTL Association Analysis
One of the steps below must be completed

#### A) cisQTL Analysis Workflows

#### B) transQTL Analysis Workflows

### 4. Integrative Analysis

## Timing


Nature Protocols Recommendations: include timeline indicating time each step will take.

For us:  
extract `Miniprotocol Timing` text from miniprotocol notebooks. The smaller module notebooks may have timing information as well, but that can just be included in the Procedure section above, which is taken from the module notebooks. So the table below will have the timing information listed in the miniprotocol notebook which is the overall time it takes to run all steps in that miniprotocol and its modules. 

For example:  
Running all steps for RNA-seq Expression takes \~4 hours. List the "\~4 hours" in the table here.  
Each module notebook under that (RNA_calling.ipynb, bulk_expression_QC.ipynb, bulk_expression_normalization.ipynb) would have its own timing data, but that would be listed in the Procedures section above under each step.

| Step | Substep | Time|
|------|-----|----|
|Molecular Phenotype Quantification |RNA-seq Expression|  extract `Miniprotocol Timing` text from miniprotocol notebook |
| |scRNA-seq Expression Calling|  extract `Miniprotocol Timing` text from miniprotocol notebook |
| |Alternative Polyadenylation|  extract `Miniprotocol Timing` text from miniprotocol notebook |
| |Methylation|  extract `Miniprotocol Timing` text from miniprotocol notebook |
| |Alternative Splicing from RNA-seq Data|  extract `Miniprotocol Timing` text from miniprotocol notebook |
|Data Pre-Processing |Genotype Data Preprocessing|  extract `Miniprotocol Timing` text from miniprotocol notebook |
| |Phenotype Data Preprocessing|  extract `Miniprotocol Timing` text from miniprotocol notebook |
| |Covariate Data Preprocessing|  extract `Miniprotocol Timing` text from miniprotocol notebook |
|QTL Association Analysis |cisQTL Analysis Workflows|  extract `Miniprotocol Timing` text from miniprotocol notebook |
| |transQTL Analysis Workflows|  extract `Miniprotocol Timing` text from miniprotocol notebook |
|Integrative Analysis | ... |  extract `Miniprotocol Timing` text from miniprotocol notebook |

## Troubleshooting


Nature Protocols Recommendations: 
* Information on how to troubleshoot most likely problems mentioned in the Procedures section
* Provide this information in a table

For us:  
This would be extracted from the `Troubleshooting` table of the module notebooks

| Step | Substep | Problem | Possible Reason | Solution |
|------|---------|---------|------------------|---------|
| Molecular Phenotype Quantification | RNA-seq Expression iii) | STAR and Picard run separately | STAR_align,picard_qc,strand_detect were run separately | run these together by running STAR_output |
| Molecular Phenotype Quantification | RNA-seq Expression v) | Error when running RSEM | STAR alignment not run with gtf file generated before collapsing to gene | use the gtf file used to generate the RSEM index |



## Anticipated Results


Nature Protocols Recommendations: 
* Information about likely outcome of protocol
* If possible, include one set of data from an experiment that worked well, and a second for an experiment that required troubleshooting to get meaningful results.
* Include advice on how to interpret and analyze raw data, including equations where needed.

For us:  
Anticipated results from steps in  `Procedure` section above.  
Include a paragraph for each step. Would be extracted form the `Anticipated Results` part of the miniprotocol website/notebooks. So this would not include exact details of the results of each invdividual notebook to be run in each module, just the output of the final one in the miniprotocol. 

#### Molecular Phenotype Quantification (Step 1)

paragraph(s) extracted from `Anticipated Results` section of RNA-seq Expression miniprotocol website/notebook page  
paragraph(s) extracted from `Anticipated Results` section of scRNA-seq Expression Calling miniprotocol website/notebook page  
paragraph(s) extracted from `Anticipated Results` section of Alternative Polyadenylation miniprotocol website/notebook page  
paragraph(s) extracted from `Anticipated Results` section of Methylation miniprotocol website/notebook page  
paragraph(s) extracted from `Anticipated Results` section of Alternative Splicing from RNA-seq Data miniprotocol website/notebook page  

#### Data Pre-Processing (Step 2)

paragraph(s) extracted from `Anticipated Results` section of Genotype Data Preprocessing miniprotocol website/notebook page  
paragraph(s) extracted from `Anticipated Results` section of Phenotype Data Preprocessing miniprotocol website/notebook page  
paragraph(s) extracted from `Anticipated Results` section of Covariate Data Preprocessing miniprotocol website/notebook page  

#### QTL Association Analysis (Step 3)

paragraph(s) extracted from `Anticipated Results` section of cisQTL Analysis Workflows miniprotocol website/notebook page  
paragraph(s) extracted from `Anticipated Results` section of transQTL Analysis Workflows miniprotocol website/notebook page  

#### Integrative Analysis (Step 4)

## Figures


Nature Protocols Recommendations: 
* To be uploaded separately when submitting
* Title and legend for each figure
* Sans-serif typeface for text on figures and font no smaller than 7 point
* For multipart figures use lower case bold letters
* 300 dpi
* For numbers, separate thousands by commas (1,000)

## Tables


Nature Protocols Recommendations: 
* Submit in Word format
* Title for each table

## Supplementary Information


Nature Protocols Recommendations: One of three categories
* EXTENDED DATA: integral part of paper tha includes data that directly contributes to main message. Includes up to 10 figures.
* SUPPLEMENTARY INFORMATION: material essential to background of study, but not practical to include in PDF version of paper. Tables in excel format. Figures can also be included. 
* SOURCE DATA: Source data for figures. Can include Excel file tables. 

## Author Contributions Statements

## Acknowledgements

## Competing Interests

## References

## Keywords


keywords in the manuscript that identify parts to be read from the website for use in the paper
