Transcript

NARRATOR
Hello. In this video, we will discuss what we mean by bivariate data. Well, just by the word bivariate data, we know that we are dealing with some kind of data. But what kind of data is the question. To better understand the word bivariate, we need to break it down in two. The prefix bi means two or twice, and variant means a quantity that can take the value of any member of a particular set, a variable.
So bivariate data are data that have two variables with different values. With bivariate data, we are comparing two different sets of variables. The one variable is the dependent variable, and the other variable is the independent variable. Let's look at an example.
As you can see, this table has three columns. The columns are labeled as days, temperature, and the number of Pepsi cans sold. So this table is showing the data relating to the number of Pepsi cans sold in the different days of a week. The two sets of variant data we have here are the temperature, and the number of Pepsi cans sold.
The temperature on this table is the independent variable, and the number of Pepsi cans is the dependent variable, because the number of Pepsi cans sold depends on the increase or decrease in temperature. As you can see in this table, on the day the temperature was 12 degrees celcius, the number of Pepsi cans sold was 12. When the temperature increased from 12 to 18 degrees celcius, the sale of Pepsi cans also increased.
Now, let's have a look at how these two different sets of data can be compared. We can plot this data into a graph, and the scatter plot is one of the best graphs for representing bivariate data. On this scatter plot here, we have the x-axis, which is the horizontal axis, representing the temperature, which is our independent variable. And on the y-axis, which is the vertical axis, we have the dependent variable, which is the number of Pepsi cans sold.
So from the graph, you can see that with the increase in temperature, we also have an increase in the number of Pepsi cans sold. In other words, this graph shows a positive correlation between the two variables because with the increase in temperature, the sale of Pepsi cans has also increased.
There can be cases where there is no positive correlation, as with the example here, where the increase in temperature has not seen an increase in the number of packets of biscuits sold. This is what we call a negative correlation.
In this video, you learned what bivariate data are, and that they are best graphed in a scatter plot to show either a positive or negative correlation.