Skip to content

Commit b37ff56

Browse files
authored
Update filtering_vcf.md
1 parent abd06ac commit b37ff56

File tree

1 file changed

+16
-16
lines changed

1 file changed

+16
-16
lines changed

docs/src/filtering_vcf.md

Lines changed: 16 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -1,14 +1,14 @@
1-
#Variant Record and Sample Selection
1+
# Filtering your VCF file: Variant Record and Sample Selection
22

3-
##General Note:s: Extracting and Reshaping VCF Data
3+
## General Notes: Extracting and Reshaping VCF Data
44

55
VIVA supports flexible filters for selecting variant records for visualization.
66

77
Additionally, the tool supports selecting and grouping samples by common traits for visualization.
88

99
Grouping samples is particularly useful for exploring phenotypic and genotypic associations, displaying differential distribution of variants between groups of samples, and identifying batch effect on coverage between groups of samples in variant analysis experiments.
1010

11-
##Choose a VCF file to Visualize *REQUIRED*
11+
## Choose a VCF file to Visualize *REQUIRED*
1212

1313
Specify filename of VCF file.
1414

@@ -19,16 +19,16 @@ Specify filename of VCF file.
1919
*Note*: This is the *only required argument* for VIVA. If you run with none of the other options, default options will be used. These default options are described in detail below.
2020

2121
```
22-
julia viva -f example.vcf [OPTIONS]
22+
viva -f example.vcf [OPTIONS]
2323
```
2424

25-
##Selecting Variant Records
25+
## Selecting Variant Records
2626

2727
VIVA offers three filters for selecting variant records to visualize from VCF files.
2828

2929
It is recommended to use one or a combination of these filters to reduce the number of variant records extracted from the VCF for plotting. This is recommended for reasons related to technical limitations and practical visual interpretation. The number of variant records able to be plotted is limited by both the user's available computing resources as well as the number of pixels in their display for displaying data points. While it is possible to visualize many thousands of variant records at one time with VIVA, **we recommend visualizing fewer than 2000 variants** so that all data points can be displayed that your computing resources are not overburdened. However, VIVA is capable of extracting and plotting hundreds of thousands of data points from VCF files.
3030

31-
###Genomic range
31+
### Genomic range
3232

3333
Select rows within a given genomic range.
3434

@@ -39,10 +39,10 @@ Select rows within a given genomic range.
3939
*Note*: To visualize genomic ranges within multiple chromosomes, you may create a batch script to run VIVA multiple times using different genomic ranges.
4040

4141
```
42-
julia viva -f example.vcf -r chr1:20000-30000000
42+
viva -f example.vcf -r chr1:20000-30000000
4343
```
4444

45-
###Variant list
45+
### Variant list
4646

4747
Select variants matching list of chromosomal positions.
4848

@@ -51,10 +51,10 @@ Select variants matching list of chromosomal positions.
5151
*arguments*: Provide filename of text file formatted with two columns in .csv format as an argument. There should be a header row with "chr" and "start" in row 1 of column 1 and 2 respectively. Column 1 should contain chromosome number in the format "chr1" or "1" and should match the syntax of the VCF file (that is, if the VCF file lists chromosome numbers in the form "chrX", use "chrX" in your positions list, not "X") You can find an example of this file [here]("[here]("https://github.com/compbiocore/VariantVisualization.jl/tree/master/test/test_files/positions_list.csv")")
5252

5353
```
54-
julia viva -f example.vcf -l "example_positions_list.txt"
54+
viva -f example.vcf -l "example_positions_list.txt"
5555
```
5656

57-
###Pass filter
57+
### Pass filter
5858

5959
Select rows that passed filters originally set during variant calling and VCF file generation. Selects records with "PASS" in the FILTER column of the VCF file. This filter alone is often not stringent enough to reduce the number of variants for plotting and visual interpretation. For analyzing large VCF files with many "passed" filter records, use genomic range,
6060

@@ -63,12 +63,12 @@ Select rows that passed filters originally set during variant calling and VCF fi
6363
*arguments*: This flag is a positional argument and does not take options.
6464

6565
```
66-
julia viva -f example.vcf -p
66+
viva -f example.vcf -p
6767
```
6868

69-
##Selecting and Grouping Samples
69+
## Selecting and Grouping Samples
7070

71-
###Group samples by sample metadata traits
71+
### Group samples by sample metadata traits
7272

7373
Group sample columns using your sample metadata and visualize metadata attributes in a colorbar above heatmap visualizations.
7474

@@ -94,10 +94,10 @@ Metadata traits are stored as rownames in the first column of the table and shou
9494
This matrix should be saved as a comma delimited .csv file. Microsoft Excel is commonly used for this purpose, but sometimes creates extra delimiter characters in the output file that produce an error in VIVA. You can check to make sure the .csv file was saved properly by opening the file with a text editor such as BBEdit to inspect for and delete empty values or extra delimiter characters at the end of each row.
9595

9696
```
97-
julia viva -f example.vcf -g sample_metadata_matrix.csv case,control
97+
viva -f example.vcf -g sample_metadata_matrix.csv case,control
9898
```
9999

100-
###Select samples to include in visualization
100+
### Select samples to include in visualization
101101

102102
Select specific samples to be extracted from the VCF for visualization.
103103

@@ -108,5 +108,5 @@ Select specific samples to be extracted from the VCF for visualization.
108108
*Note*: To use the sample selection feature in combination with the sample grouping feature, the sample metadata matrix must only contain the sample ids to be selected.
109109

110110
```
111-
julia viva -f example.vcf --select_samples select_samples_list.txt
111+
viva -f example.vcf --select_samples select_samples_list.txt
112112
```

0 commit comments

Comments
 (0)