Tutorial Agilent Two Colors Microarray Normalization
Pages 113
- Home
- Affymetrix
- affymetrix_expression_normalization_with_apt
- Agilent
- Association Analysis
- Association Analysis doc
- Babelomics version
- Babelomics web structure
- Burden test
- Cancer
- CDF
- Changes in this version
- Class comparison. Worked examples and exercises
- Class prediction
- Class prediction. Worked examples and exercises
- Clustering
- Clustering. Worked examples and exercises
- Cross hybridization
- data matrix expression
- Data types
- Define your comparison
- Detailed example of analysis of expression data in Babelomics: from raw data to expression differential and functional profiling
- Differential Expression for arrays
- Differential Expression for RNA Seq
- Dye bias
- Edit
- Edit your data
- example data
- Expression
- Expression array pipeline
- FAQ
- Functional
- Functional Gene Set Network Enrichment
- Functional GO Enrichment
- GAL
- Gene Set Enrichment
- Gene Set Network Enrichment (Network Miner)
- Gene vs annotation
- Genepix
- Genomics
- Genomics doc
- How to cite babelomics
- Id
- Logging in
- Main areas. Cancer
- Main areas. Expression
- Main areas. Functional
- Main areas. Genomics
- Main areas. Processing
- Main areas: Cancer
- Main areas: Expression
- Main areas: Functional
- Main areas: Genomics
- Main areas: Processing
- Network Enrichment (SNOW)
- Other biological data
- Overview and pipelines
- p values adjusted for multiple testing
- PED
- PED_MAP zipped
- Pipelines
- plink.assoc
- plink.assoc.linear
- plink.assoc.logistic
- plink.fisher
- plink.hh
- plink.log
- plink.tdt
- Preprocessing for data matrix
- Preprocessing for microarrays
- Preprocessing for RNA Seq
- Processing
- Ranked
- Requirements
- RNA Seq Normalization
- RNA Seq pipeline
- SDK (Software Development Kit)
- Single Enrichment
- Single Enrichment. Options
- SNPs array pipeline
- Software and databases used
- Technical Info
- The Babelomics Team
- tut_SNP_association
- Tutorial
- Tutorial Affymetrix Expression Microarray Normalization
- Tutorial Agilent One Color Microarray Normalization
- Tutorial Agilent Two Colors Microarray Normalization
- Tutorial Burden test
- Tutorial Class prediction
- Tutorial Clustering
- Tutorial Data matrix preprocessing
- Tutorial Differential Expression for arrays
- Tutorial Differential Expression for RNA Seq
- Tutorial Expression
- Tutorial Expression. Class comparison
- Tutorial Expression. Correlation
- Tutorial Expression. Survival
- Tutorial Functional
- Tutorial Genepix One Color Microarray Normalization
- Tutorial Genepix Two Colors Microarray Normalization
- Tutorial Genomics
- Tutorial OncodriveClust
- Tutorial OncodriveFM
- Tutorial Processing
- Tutorial SNP Association Analysis
- Tutorial SNP stratification
- Upload your data
- VCF 4.0
- VCF file pipeline
- Visualization tools
- Worked examples
- Workflow
- Show 98 more pages…
General
Tutorial
Analysis tools
Worked examples
-
Expression
-
Functional
Clone this wiki locally
INPUT
#### STEPS [1. Select your data](tutorial-agilent-two-colors-microarray-normalization#select-your-data)
[2. Select analysis](tutorial-agilent-two-colors-microarray-normalization#select-analysis)
[3. Fill information job](tutorial-agilent-two-colors-microarray-normalization#fill-information-job)
[4. Press *Launch job* button](tutorial-agilent-two-colors-microarray-normalization#press-launch-job-button)
#### OUTPUT - [Input parameters](tutorial-agilent-two-colors-microarray-normalization#input-parameters) - [Output results](tutorial-agilent-two-colors-microarray-normalization#output-results)
INPUT
#####Input data Input data should be a matrix upload as the data type Microarray Two channels Agilent data type. See data types [here](Data Types).
Using Babelomics you can process single channel Agilent arrays.
In this section of the form you can select the dataset you want to normalize. There are two options to upload your data:
- from this form: Select your data / File browser / Upload
- previously using the Upload Menu in Babelomics and tagged it with the Microarray Two channels Agilent data type. Datasets that are not tagged as Microarray Two channels Agilent data type cannot be normalized using the Agilent Normalization Tool.
#####Online example Here you can load a dataset from our server. You can use them to run this example and see how the tool works. Click on the links to load the data: agilent two-channels normalization example.
### STEPS #####Select your data First step is to select your data to analyze.
#####Select analysis
Normalization of Agilent two color microarrays is a tree stage process:
- Background correction: for each channel or color measured in the array, this step corrects background effects so all features are measured over the same starting intensity in that channel scale.
- Within array normalization: the aim of this step is to correct for artifacts produced by differences in the measurements of the two signal channels. Also in this step the two signals are summarized into a unique one by a log ratio transformation.
- Between array scaling: in this step measurements from all microarray are rescaled into a unique final distribution so that comparisons can be done between them.
Flagged Features
Agilent Feature Extraction Software has some options for flagging abnormal features. If you have created this quality flags when scanning your arrays, you may choose to exclude them when fitting the normalization parameters. Doing so, this bad features will not influence your data. The first check box in the Analysis section lets you set this option.
Even if you did not use flagged features to fit the normalization parameters, you may still want to get a normalized value for them. In the second check box in the Analysis section, Babelomics lets you choose if you want to get normalized values for the flagged features or you rather setting them as missing values.
See [microarray normalization methods](Main areas. Processing) section for details on the algorithms.
#####Fill information job
- Select the output folder
- Choose a job name
- Specify a description for the job if desired.
#####Press Launch job button
Press launch button and wait until the results is finished. A normal job may last approximately few minutes but the time may vary depending on the size of data. See the state of your job by clicking the jobs button in the top right at the panel menu. A box will appear at the right of the web browser with all your jobs. When the analysis is finished, you will see the label "Ready". Then, click on it and you will be redirected to the results page.

### OUTPUT #### Input parameters In this section you will find a reminder of the parameters or settings you have used to run the analysis.
Output results
In this section of the results page you will find normalized data files.
After Agilent two colors normalization procedures three results files are created. These files are text files tab delimited.
Normalized Data File: the M-values file
Contains the processed intensity measurements for all arrays in the dataset and for all biological features in the arrays. In the two color arrays, with some additional transformation, this is the log ratio of the two channel intensityes: the M-vaues.
Babelomics tries to identify non biological features and to exclude them from all steps of the analysis. If we succeeded in finding out the non biological features of your arrays they will not be represented in the normalized data. If we cannot distinguish non biological spots, all the features will be used in the normalization and reported in the normalized data file.
In this file:
- Arrays or samples are arranged in columns.
- Genes, spots or features are set in rows.
- Some header lines may be included at the beginning of the file. They will all start by #.
- One of those header lines, starting by #NAMES will contain the names of the arrays in your dataset.
- The first column contains feature identifiers. For Agilent one color arrays, Babelomics tries to figure out which is the best feature id among those provided within the raw data files. Some other feature ids will we reported in the Feature Data File.
Feature Data File:
The rows in this file match the rows in the normalized data file. The columns, contain some additional array design information about the features in the data you have normalized.
Generally reported feature characteristics are:
- Any feature ID we find in the raw data files.
- Any gene or transcript ID we find in the raw data files.
- Row and Column position of the features within the array layout.
- Chromosomal position of the target genes or transcripts of each feature in the array (when this information is available within the raw data)
A-values File
For each spot in the two channel arrays, its A-value is defined as the mean of the two color intensities in the log2 scale.
This values are reported by Babelomics in a file which lay out and organization are the same as the M-values ones.
Plots
In this section you will find some plots representing the normalized data. You can use them to asses how good the normalization process performed in your dataset. Generally you may compare different normalization options between them using this plots. Also you may want to compare the plots of the normalized data to those of the raw data and check if noise levels are lowered down after normalization.
- Box-Plots representing the normalized intensity distribution for each of the samples (arrays). In general you want overall distributions across arrays to be very similar after normalization. And as normalized measurements are scaled log ratios you will expect your data to be centered around zero.
- MA-Plots representing the normalized intensity distribution of each sample against a consensus mean sample. A "LOESS":http://en.wikipedia.org/wiki/LOESS line fitting the trend between M and A values is drawn in red. After normalization you expect no trend in the LOESS line, that is, you expect it to be as close as possible tho the horizontal 0 axis.
- Pseudo Image Plots: represent the normalized intensity of each spot within the array coordinates, creating a pseudo photo of the normalized array. High intensities are represented in red colors, low intensities are represented in blue colors. Ideally you will see an evenly colored image, meaning that, after normalization, there is no spatial effect in the array measurements. M-values ie. log ratios of the two channel intensities are represented in this plots.
| Go back to the Processing page |
|---|
| Go back to the Home page |
|---|
Find the Babelomics suite at http://babelomics.org