Main areas. Processing
Pages 113
- Home
- Affymetrix
- affymetrix_expression_normalization_with_apt
- Agilent
- Association Analysis
- Association Analysis doc
- Babelomics version
- Babelomics web structure
- Burden test
- Cancer
- CDF
- Changes in this version
- Class comparison. Worked examples and exercises
- Class prediction
- Class prediction. Worked examples and exercises
- Clustering
- Clustering. Worked examples and exercises
- Cross hybridization
- data matrix expression
- Data types
- Define your comparison
- Detailed example of analysis of expression data in Babelomics: from raw data to expression differential and functional profiling
- Differential Expression for arrays
- Differential Expression for RNA Seq
- Dye bias
- Edit
- Edit your data
- example data
- Expression
- Expression array pipeline
- FAQ
- Functional
- Functional Gene Set Network Enrichment
- Functional GO Enrichment
- GAL
- Gene Set Enrichment
- Gene Set Network Enrichment (Network Miner)
- Gene vs annotation
- Genepix
- Genomics
- Genomics doc
- How to cite babelomics
- Id
- Logging in
- Main areas. Cancer
- Main areas. Expression
- Main areas. Functional
- Main areas. Genomics
- Main areas. Processing
- Main areas: Cancer
- Main areas: Expression
- Main areas: Functional
- Main areas: Genomics
- Main areas: Processing
- Network Enrichment (SNOW)
- Other biological data
- Overview and pipelines
- p values adjusted for multiple testing
- PED
- PED_MAP zipped
- Pipelines
- plink.assoc
- plink.assoc.linear
- plink.assoc.logistic
- plink.fisher
- plink.hh
- plink.log
- plink.tdt
- Preprocessing for data matrix
- Preprocessing for microarrays
- Preprocessing for RNA Seq
- Processing
- Ranked
- Requirements
- RNA Seq Normalization
- RNA Seq pipeline
- SDK (Software Development Kit)
- Single Enrichment
- Single Enrichment. Options
- SNPs array pipeline
- Software and databases used
- Technical Info
- The Babelomics Team
- tut_SNP_association
- Tutorial
- Tutorial Affymetrix Expression Microarray Normalization
- Tutorial Agilent One Color Microarray Normalization
- Tutorial Agilent Two Colors Microarray Normalization
- Tutorial Burden test
- Tutorial Class prediction
- Tutorial Clustering
- Tutorial Data matrix preprocessing
- Tutorial Differential Expression for arrays
- Tutorial Differential Expression for RNA Seq
- Tutorial Expression
- Tutorial Expression. Class comparison
- Tutorial Expression. Correlation
- Tutorial Expression. Survival
- Tutorial Functional
- Tutorial Genepix One Color Microarray Normalization
- Tutorial Genepix Two Colors Microarray Normalization
- Tutorial Genomics
- Tutorial OncodriveClust
- Tutorial OncodriveFM
- Tutorial Processing
- Tutorial SNP Association Analysis
- Tutorial SNP stratification
- Upload your data
- VCF 4.0
- VCF file pipeline
- Visualization tools
- Worked examples
- Workflow
- Show 98 more pages…
General
Tutorial
Analysis tools
Worked examples
-
Expression
-
Functional
Clone this wiki locally
##Edit
In order to include further information related to the experimental design of our data set we have to edit our data. This is a necessary step before addressing some of the Babelomics tools such as differential expression or class prediction, where the presence of at least two clearly differentiated classes is mandatory. For specific information about how to edit your data, please visit Edit.
##Microarray Normalization DNA microarray technologies allow for the simultaneous measurement of thousands of genomic features, such as gene expression, copy number variation or SNP variant. The accuracy and reproducibility of microarray measurements has been extensively validated in the past years. Despite that, in the measurements of any microarray set, there are always technological artifacts that may hide the true biological signal. Such possible distortion of the microarray measurements may produce signal effects within each arrays but also across different arrays in the set.
Causes of non biological variation in microarray measurements include, of course, differences in the sample preparation and the hybridization process, but also, [dye bias](Dye bias), [cross-hybridization](Cross hybridization) and scanner differences.
The goal of normalization is to adjust for the effects that are due to variations in the technology rather than the biology.
- [More information.](Preprocessing for microarrays)
RNA-Seq Normalization
RNA sequencing, or RNA-Seq, is becoming more and more popular over the last few years as genome-wide transcriptome profiling method because of its power to detect non-pre-established transcripts and its high reproducibility. However, this kind of data is also sensible to different biases which make that once the counts data matrix has been created, and before addressing any further analysis, a normalization process may be necessary. Babelomics allows us to correct different kinds of biases which could be present in our data, and also suggests the best normalization method for our particular data set.
For normalization of RNA-Seq data destined to differential expression, please see the section Differential Expression for RNA-Seq.
- [More information.](Preprocessing for RNA-Seq)
##Data Matrix Before any meaningful analysis can be done using your genomic data, they will need to go under a thorough cleaning process. Data normalization is the paradigm of such data reshaping processes, but not the only one. Mathematical transformations of your data like taking logarithms or missing data imputation are among them. In general the purpose of this step is to reshape your data into a distribution which will be suitable in further steps of the analysis. Babelomics preprocessing tools allow you doing such data transformations.
- [More information.](Preprocessing for data matrix)
Find the Babelomics suite at http://babelomics.org