Extreme data refers to data points that fall significantly outside the range of other observations in a dataset. These outliers can skew statistical analyses and distort the interpretation of results. Extreme data can be caused by measurement errors, natural variability, or rare events, and it is important to identify and properly handle these outliers in order to ensure the accuracy and reliability of data analysis.
Chat with our AI personalities
There are three dimensions to Extreme Data - volume, velocity, and variety, - and management of these dimensions is critical when it comes to winning the battle for control of enterprise data amid the conflicting requirements of business intelligence and reporting vs. data mining and statistical analysis. Management of the data is often part of an Information Lifecycle Management strategy (ILM) or Nearline approach.
Extreme numbers in the data as compared the the rest of the data are called OUTLIERS.
when there are extreme values in the data
Such a data point is called an outlier.
It is a value which appears not to fit in with the other data elements.
They are a simple measure of the spread of the data, which is not affected by extreme values.