Correlation of data means that two different variables are linked in some way. This might be positive correlation, which means one goes up as the other goes up (for instance, people who are heavier tend to be taller) or negative correlation, which means one goes up as the other goes down (for instance, people who are older tend to play Video Games less often). Correlation just means a link. It means that knowing one variable (a person is really tall) is enough to make a guess at the other one (that person is probably also pretty heavy).
Note that there is a very common mistake people make about correlation, and this needs to be addressed. In short, the mistake is "correlation implies causation". It doesn't. If I have data which shows people who volunteer more often tend to be happier, I cannot then say "volunteer. It makes you happy!" because correlation doesn't imply causation - it might be that if you're happy you're more likely to volunteer, and the causation is the other way around. Or it might be that if you're rich, you're both more likely to be happy, and more likely to volunteer, so the data is affected by a different variable entirely.
Chat with our AI personalities
correlation is used when there is metric data and chi square is used when there is categorized data. sayan chakrabortty
The closer the correlation is to 1 or -1, the more linear the data is
No, it is not resistant to changes in data.
a correlation statement is a sentence that says whether the points on a scatterplot have a positive, negative or no correlation.ex. This graph shows a negative correlation, as the number of cows increases (x axis data) the profitability decreases (y axis data).
Correlation cannot accurately describe any type of curve. The correlation of a curve would be a linear approximation rather than an accurate description of the data. Giving a function would more accurately describe data that lies on a curve.