T
thunk
In statistics, an outlier[1] is an observation that is numerically
distant from the rest of the data. Grubbs[2] defined an outlier as:
An outlying observation, or outlier, is one that appears to
deviate markedly from other members of the sample in which it occurs.
Outliers can occur by chance in any distribution, but they are often
indicative either of **MEASUREMENT ERROR** or that the population has
a heavy-tailed distribution. In the former case one wishes to discard
them or use statistics that are robust to outliers, while in the
latter case they indicate that the distribution has high kurtosis and
that one should be very cautious in using tool or intuitions that
assume a normal distribution. A frequent cause of outliers is a
mixture of two distributions, which may be two distinct sub-
populations, or may indicate 'correct trial' versus 'measurement
error'; this is modeled by a mixture mod
http://en.wikipedia.org/wiki/Outlier
distant from the rest of the data. Grubbs[2] defined an outlier as:
An outlying observation, or outlier, is one that appears to
deviate markedly from other members of the sample in which it occurs.
Outliers can occur by chance in any distribution, but they are often
indicative either of **MEASUREMENT ERROR** or that the population has
a heavy-tailed distribution. In the former case one wishes to discard
them or use statistics that are robust to outliers, while in the
latter case they indicate that the distribution has high kurtosis and
that one should be very cautious in using tool or intuitions that
assume a normal distribution. A frequent cause of outliers is a
mixture of two distributions, which may be two distinct sub-
populations, or may indicate 'correct trial' versus 'measurement
error'; this is modeled by a mixture mod
http://en.wikipedia.org/wiki/Outlier