Best Answer

Go into your data to determine which values are outliers and if they're significant and random (not an apparent group), eliminate them. This will take them out of your boxplot.

User Avatar

Wiki User

โˆ™ 2012-06-22 03:14:41
This answer is:
User Avatar
Study guides


20 cards

What are the brain's association areas

What is a field hockey stick made of

How old is she is rebecca stevenson

When during pregnancy should one quit smoking

See all cards
38 Reviews

Add your answer:

Earn +20 pts
Q: How do you Create a box plot of data that does not have any outliers?
Write your answer...
Still have questions?
magnify glass
Continue Learning about Math & Arithmetic

What is similar and different between a line plot and a frequency table?

A line plot shows data on a number line with dots or x's to to show frequency. A frequency table is made by arranging collected data values in ascending order of magnitude with their corresponding frequencies. Both will show you the absolute frequency of any given value. And both give you a visual idea of the shape of the frequency and some intuition about outliers and things like that. You can count the number of dots or x on your line plot and create a frequency table. The difference is that one of them already has numbers counted for you. So for small numbers of data, either one will do the same job. But imagine if you have 10000000 points. You really don't want to count them using a line plot. A frequency table will tell you how often each data point occurs. However, if there are lots of values that these points can take on, the frequency table will have too many values to be of much use. The line plot will give us a good visual if there is lots of data, say 1000000 temp measurements, but we only look at the temps between 90-100 and only use integer values.

What is a true statement concerning outliers for a data set summarized by a box and whisker plot?

When John Tukey invented the boxplot he suggested (somewhat arbitrarily) that any data points more than 1.5 times the length of the box (ie, the distance between the upper and lower quartiles) from the nearest end of the box should be regarded as outliers.For example, suppose the box length were 2, that the lower quartile were 5 and that the smallest data point were 1.1.5 * 2 = 35 - 3 = 21 < 2; in other words, this data point is too far away from the box.Hence, the smallest data point is an outlier.

What are the 3 things you can do with coordinates?

Plot straight line or curved graphs on the Cartesian plane Plot a line of 'best fit' for any correlation of given data Solve simultaneous equations when the coordinates intersect each other Transformations

What kind of graph would you use for numerical data?

you could use almost any kind of graph if you label it. But i would stay away from pie graphs. I would use a box and whisker plot.

What do researchers mean by secondary data?

Data collected after any research to gather primary data.

Related questions

What does the whisker in a box-and-whisker plot represent?

The whiskers mark the ends of the range of figures - they are the furthest outliers. * * * * * No. Outliers are not part of a box and whiskers plot. The whiskers mark the ends of the minimum and maximum observations EXCLUDING outliers. Outliers, if any, are marked with an X.

What is the upper extreme to a box and whisker plot?

THe maximum observed (excluding any outliers).

What is the primary disadvantage of using the range to compare the variability of data sets?

The range is very sensitive to outliers. Indeed if there are outliers then the range will be unrelated to any other elements of the sample.

How do you work out outliers on a box plot?

the number in your piece of data = n lower quartile, n+1 divided by 4 upper quartile, n+1 divded by 4 and times by three interquartile range(IQR) = upper quartile - lower quartile outliers(O) = interquartile range x 1.5 lower than IQR-O is an outlier (h) above IQR+O is an outlier (h) the outliers on your box plot are any numbers that are the value i have named (h) ^

How do you spot an anomalous result in science?

The easiest way is to plot the values on a number line, then look at any outliers and consider whether they may be anomalies.

What are the three ways to describe data in a scatter plot?

You can describe if there's any obvious correlation (like a positive or negative correlation), apparent outliers, and the corrlation coefficient, which is the "r" on your calculator when you do a regression model. The closer "r" is to either -1 or 1, the stronger that correlation is.

Which is most resistant measures of central tendency?

The midhinge.this because it eliminates 25 percent of the largest data values and the smallest data values.this means any outliers present in the set of data values will be unable to throw the data

What is an outlier in a line-plot?

A number that is different from any other numbers in the data.!

What set of data is best represented by a stem and leaf plot?

Ages of people are sorted meaningfully by a stem and leaf plot. Any type of data set in which the first or last digit differs and can be sorted. By using this type of plot, one can readily see the magnitude of each group of data sorted.

Does the box plot include any of the actual data values?

At least 2 and up to 5.

Create a Dynamic Data Website?

form_title=Create a Dynamic Data Website form_header=Create a data-driven website with help from the experts! Who will you be targeting with the website?=_ When does the website need to be completed?=_ Do you have any other websites created with dynamic data?= () Yes () No () Not Sure

How do you draw conclusions from data?

Mostly through statistics, or summaries of the data set (depending on the type of data). There are many different statistical methods used to analyze the many different types of data that come from research studies or experiments. However if you just want a relatively quick and simplistic overview of a set of data than you should follow SOCS: Shape, Outliers, Center, Spread. Shape (the shape of the graphed data points) Outliers (any data points that fall outside the realm of "normal") Center (where the data points are mostly centered around) and Spread (the range of the data points). This should give you some immediate conclusions from your data.

People also asked