http://www.graphpad.com/quickcalcs/Grubbs1.cfm

Outliers make statistical analyses difficult. This calculator performs Grubbs' test, also called the ESD method (extreme studentized deviate), to determine whether one of the values in the list you enter is a signficant outlier from the rest. Learn more about the principles of outlier detection and exactly how this test works.

**Detecting Outliers**

Outliers make statistical analyses difficult.

When analyzing data, you'll sometimes find that one value is far from the others. Such a value is called an "outlier", a term that is usually not defined rigorously. When you encounter an outlier, you may be tempted to delete it from the analyses. First, ask yourself these questions:

* Was the value entered into the computer correctly? If there was an error in data entry, fix it.

* Were there any experimental problems with that value? For example, if you noted that one tube looked funny, you have justification to exclude the value resulting from that tube without needing to perform any calculations.

* Is the outlier caused by biological diversity? If each value comes from a different person or animal, the outlier may be a correct value. It is an outlier not because of an experimental mistake, but rather because that individual may be different from the others. This may be the most exciting finding in your data!

After answering no to those three questions, you have to decide what to do with the outlier. There are two possibilities.

* One possibility is that the outlier was due to chance. In this case, you should keep the value in your analyses. The value came from the same population as the other values, so should be included.

* The other possibility is that the outlier was due to a mistake - bad pipetting, voltage spike, holes in filters, etc. Since including an erroneous value in your analyses will give invalid results, you should remove it. In other words, the value comes from a different population than the other and is misleading.

The problem, of course, is that you can never be sure which of these possibilities is correct.

Clearly, no mathematical calculation will tell you for sure whether the outlier came from the same or different population than the others. But statistical calculations can answer this question: If the values really were all sampled from a Gaussian distribution, what is the chance that you'd find one value as far from the others as you observed? If this probability is small, then you will conclude that the outlier is likely to be an erroneous value, and you have justification to exclude it from your analyses.

Statisticians have devised several methods for detecting outliers. All the methods first quantify how far the outlier is from the other values. This can be the difference between the outlier and the mean of all points, the difference between the outlier and the mean of the remaining values, or the difference between the outlier and the next closest value. Next, standardize this value by dividing by some measure of scatter, such as the SD of all values, the SD of the remaining values, or the range of the data. Finally, compute a P value answering this question: If all the values were really sampled from a Gaussian population, what is the chance of randomly obtaining an outlier so far from the other values? If the P value is small, you conclude that the deviation of the outlier from the other values is statistically significant.

The most that Grubbs' test (or any outlier test) can do is tell you that a value is unlikely to have come from the same Gaussian population as the other values in the group. You then need to decide what to do with that value. I would recommend removing significant outliers from your calculations in situations where experimental mistakes are common, so long as biological variability is not a possibility and you document your decision. Others feel that you should never remove an outlier unless you noticed an experimental problem.

- 作者：cail
- 版权声明：署名-非商业性使用-禁止演绎 CC BY-NC-ND 3.0
- 原文网址：http://blog.cail.cn/?p=389

**Previous:**
Computer Retrieval of Information on Scientific Projects
**Next:**
Happy Chinese New Year

### blog by cail

- » the Paper Link - my latest Creation for PubMed users
- » How to use ImageJ to analyze images?
- » 2shRNA - design oligos for RNAi
- » Play background music
- » about this blog
- » about me

### New in 'Theory'

- A polarized TIRFM by using a specific dye DiI
- A clustering-outgrowth model for filopodia formation, based on study using Xenopus egg extracts and liposome
- Xenogeneic organ complementation is achievable! What's next? Human-pig chimera?
- Septins, planar cell polarity, cilia, convergent extension ... Bardet-Biedl and Meckel-Gruber syndromes
- Early-passage iPSCs retain cell-of-origin epigenetic memory, which needs continuous passaging to get into Limbo
- Right place and right time: small molecules reprogram cell fate of worm germ cells
- Caspase cleavage-mediated substrate change: switching function of Dicer between RNAi pathway and apoptosis pathway

### Hot in 'Theory'

- bacteria dividing without a wall - no need for FtsZ mediated constriction - 57,979 views
- Cdc42 targets aPKC to adherens junctions, where aPKC limits Rho activity to limit actomyosin contractility - 32,586 views
- Peter Jackson talks about BBSome, Rab8, ciliary membrane biogenesis and proteomics analysis on ciliopathies - 27,205 views
- Cells memorize their tissue origin during axolotl limb regeneration - the blastema is a heterogeneous collection of restricted progenitor cells - 20,080 views
- Intersection from prediction programs is demonstrated to be an important regulator of cancer metastasis - 16,039 views

## Leave a Reply

You must be logged in to post a comment.