Skip to content

Commit

Permalink
Update 09-normalization.Rmd
Browse files Browse the repository at this point in the history
  • Loading branch information
yufree authored Sep 5, 2023
1 parent 6993970 commit 4f037ec
Showing 1 changed file with 12 additions and 0 deletions.
12 changes: 12 additions & 0 deletions 09-normalization.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,16 @@ bibliography: references.bib

# Peaks normalization

## Batch effects

Batch effects are the variances caused by factor other than the experimental design. We could simply make a linear model for the intensity of one peak:

$$Intensity = Average + Condition + Batch + Error$$

Research is focused on condition contribution part and overall average or random error could be estimated. However, we know little about the batch contribution. Sometimes we could use known variables such as injection order or operators as the batch part. However, in most cases we such variable is unknown. Almost all the batch correction methods are trying to use some estimations to balance or remove the batch effect.

For analytical chemistry, internal standards or pool quality control samples are actually standing for the batch contribution part in the model. However, it's impractical to get all the internal standards when the data is collected untargeted. For methods using internal standards or pool quality control samples, the variations among those samples are usually removed as median, quantile, mean or the ratios. Other ways like quantile regression, centering and scaling based on distribution within samples could be treated as using the stable distribution of peaks intensity to remove batch effects.

## Batch effects classification

Variances among the samples across all the extracted peaks might be affected by factors other than the experiment design. There are three types of those batch effects: Monotone, Block and Mixed.
Expand Down Expand Up @@ -342,6 +352,8 @@ It is a two-step approach via combining the best-performance internal standard c

## Method to validate the normalization

Various methods have been used for batch correction and evaluation. Simulation will ensure groud turth. Difference analysis would be a common method for evaluation. Then we could check whether this peak is true positive or false positive by settings of the simulation. Other methods need statistics or lots of standards to describ the performance of batch correction or normalization results.

## Software

- [BatchCorrMetabolomics](https://github.com/rwehrens/BatchCorrMetabolomics) is for improved batch correction in untargeted MS-based metabolomics
Expand Down

0 comments on commit 4f037ec

Please sign in to comment.