i2tutorials

Understanding correlation in statistics

In this tutorial, we are going to discuss about correlation.

What is Correlation?

Correlation is a statistical technique that is used to understand the relationship between two variables.

 Variables can be a quantitative type or a qualitative type.

  1. Quantitative represents numerical data
  2. Qualitative represents categorical data

 We can evaluate the correlation between the two variables by using two different methods.

 1. Graphical method

 2. Mathematical method

 let us try to understand both of these two methods

  1. Graphical method

 we will use the scatter plot which is a graphical method to understand the relationship between two variables by visualizing the data patterns.

 There are three different types of relationships

  1. positive relationship
  2. Negative relationship
  3. No relationship

Positive relationship

When the data points which are plotted on scatter plot are inclined towards the positive coordinates, it can be considered as Positive Relation.

If you can take a look at the below scatter plot holds the data X, Y. 

 The scatter plot clearly shows it is inclined towards the positive direction that a positive relationship.

Negative relationship

When the data points which are plotted on scatter plot are inclined towards the negative coordinates, it can be considered as Negative Relation.

If you can take a look at the below scatterplot which holds the data x, y.

The scatterplot clearly shows it is in coined towards the negative direction which represents as the age of the car increases the Reliability of the car decreases.

From which we can clearly understand the relationship between these two variables is a negative relationship.

No relationship

When the data points which are plotted on scatter plot are completely scattered across the coordinates, it can be considered as No Relation.

 If you can take a look at the below scatter plot that holds the data x, y.

The scatterplot clearly shows it is completely scattered which represents there is no relationship between the two variables.


2. Mathematical method

There are many methods in a mathematical way but you are let me explain to you two important methods which are mentioned below

  1.  Karl Pearson’s correlation coefficient
  2.  Spearman rank correlation coefficient

In mathematical methods we will use a variable that is represented as “r” to understand the degree of variability.

This is generally called as Pearson’s correlation coefficient for the product-moment correlation coefficient.

Here R represents the nature and strength of association.

Characteristics of r :

  1. Karl Pearson’s Correlation coefficient

When the given variables are Quantitative in nature, then we would prefer working with Karl Pearson’s Correlation.

2. Spearman Rank Correlation Coefficient

When the given variables are both of Qualitative Ordered in nature, Otherwise one Quantitative Variable & one Qualitative Ordered Variable in nature, then we would prefer working with Spearman Rank Correlation.

It is a non-parametric measure of correlation.

This procedure makes use of the two sets of ranks that may be assigned to the sample values of X and Y.

Procedure:

Exit mobile version