When there aren't extreme values (outliers)
High sensetivity to outliers (i.e. Extreme observation)
They are called extreme values or outliers.
the interquartile range is not sensitive to outliers.
Variance is a measure of "relative to the mean, how far away does the other data fall" - it is a measure of dispersion. A high variance would indicate that your data is very much spread out over a large area (random), whereas a low variance would indicate that all your data is very similar.Standard deviation (the square root of the variance) is a measure of "on average, how far away does the data fall from the mean". It can be interpreted in a similar way to the variance, but since it is square rooted, it is less susceptible to outliers.
Extreme outliers can greatly distort statistical measures such as the mean and standard deviation, making them less representative of the data. They can also impact the accuracy of predictive models by leading to overfitting. In some cases, outliers may signal data quality issues or the presence of unexpected patterns in the data that warrant further investigation.
They would both increase.
When there aren't extreme values (outliers)
Outliers pull the mean in the direction of the outlier.
Outliers are observations that are unusually large or unusually small. There is no universally agreed definition but values smaller than Q1 - 1.5*IQR or larger than Q3 + 1.5IQR are normally considered outliers. Q1 and Q3 are the lower and upper quartiles and Q3-Q1 is the inter quartile range, IQR. Outliers distort the mean but cannot affect the median. If it distorts the median, then most of the data are rubbish and the data set should be examined thoroughly. Outliers will distort measures of dispersion, and higher moments, such as the variance, standard deviation, skewness, kurtosis etc but again, will not affect the IQR except in very extreme conditions.
Extreme numbers in the data as compared the the rest of the data are called OUTLIERS.
High sensetivity to outliers (i.e. Extreme observation)
They are called extreme values or outliers.
the interquartile range is not sensitive to outliers.
the variance
THe maximum observed (excluding any outliers).
Variance is a measure of "relative to the mean, how far away does the other data fall" - it is a measure of dispersion. A high variance would indicate that your data is very much spread out over a large area (random), whereas a low variance would indicate that all your data is very similar.Standard deviation (the square root of the variance) is a measure of "on average, how far away does the data fall from the mean". It can be interpreted in a similar way to the variance, but since it is square rooted, it is less susceptible to outliers.