answersLogoWhite

0

The mean is one of those statistics that is more sensitive to outliers, and hence to mistakes in data, than it is to uncontaminated data.

Here's a pseudorandom sample of normally distributed values with mean 10 and variance 1 in sorted order:

8.3, 8.3, 9.0, 9.1, 9.2, 9.3, 9.5, 9.5, 9.7, 9.8, 9.9, 9.9, 10.2, 10.3, 10.4, 10.7, 10.9, 11.1, 11.3, 11.8

(Actually I've rounded them to the nearest 10th for easier reading.)

Their mean is 9.91.

Now let me add one outlier to the sample, say 4.2. Now the mean is 9.64. The original mean was only 0.09 away from the true mean, this one is four times as far away.

Of course it must also be said that one must be extremely careful about discarding data. Sometimes what appears to be an outlier has the most interesting information.

User Avatar

Wiki User

11y ago

What else can I help you with?

Related Questions

Between Mean Median and Mode which are affected most by an outlier?

The mean is affected the most by an outlier.


How are the mean and standard deviation affected by an outlier?

The mean is "pushed" in the direction of the outlier. The standard deviation increases.


Which measure is most affected by an outlier?

mean


Is the mean affected by a low outlier?

Yes.Yes.Yes.Yes.


Which of the following is least affected if an extreme high outlier is added to your data mean median or standard deviation or ALL?

The median is least affected by an extreme outlier. Mean and standard deviation ARE affected by extreme outliers.


How does the outlier affect the mean of the data?

An outlier does affect the mean of the data. How it's affected depends on how many data points there are, how far from the data the outlier is, whether it is greater than the mean (increases mean) or less than the mean (decreases the mean).


How would the outlier affect the mean and median of the data?

An outlier will pull the mean and median towards itself. The extent to which the mean is affected will depend on the number of observations as well as the magnitude of the outlier. The median will change by a half-step.


What would happen if a outlier was removed from the mean?

The answer depends on the nature of the outlier. Removing a very small outlier will increase the mean while removing a large outlier will reduce the mean.


Would a mean be smaller or larger if you leave out an outlier?

Depends on whether the outlier was too small or too large. If the outlier was too small, the mean without the outlier would be larger. Conversely, if the outlier was too large, the mean without the outlier would be smaller.


How do you determine how the outlier affects the mean median mode and range?

Calculate the mean, median, and range with the outlier, and then again without the outlier. Then find the difference. Mode will be unaffected by an outlier.


How does an outlier in a group of numbers affect the mean of the numbers?

The outlier skews the mean towards it.


If an outlier is included with a sample of 50 values all of which are the same what is the effect of the outlier on the mean?

By definition, an outlier will not have the same value as other data points in the dataset. So, the correct question is "What is the effect of an outlier on a dataset's mean." The answer is that the outlier moves the mean away from the value of the other 49 identical values. If the outlier is the "high tail" the mean is moved to a higher value. If the outlier is a "low tail" the mean is moved to a lower value.

Trending Questions
If 1.5 percent of the bolts made by a automotive factory what is the probability in a shipment of 200 that there are 6 defective bolts? In a school, 22% of the students are sixth graders. What is the probability that a randomly chosen student will not be a sixth grader? What fund has a higher one-year performance than a five-year performance? What can you say about the correlation coefficient and the correlation description when the points lie exactly on vertical or horizontal line? What is a 25 digit number called? What is the formula for finding the surface area of a cylinder? What does use any estimation strategy to calculate mean? Does a parameter describe a population or a sample? If a normal distribution has a mean of 44 and a standard deviation of 8 what is the z-score for a value of 50? What does thbt mean? What is on the left side of a column chart? Do pie charts operate on more than one data series at a time? How can you find a least square trend in a equation? Suppose that rather than being just a bar graph the display you see above is a relative frequency bar graph The vertical axis of the graph will be marked off in percents from 0 percent up to 30 pe? Why is it important to establish a cause and effect between the selected known variable and unknown variables? How many times did GTE stock split? Probability word meaning 100 percent likely? Two thirds plus two thirds plus tow thirds is how many cups? What is linearity error? How many ways can a 3-person subcommittee be selected from a committee of 7 people?