Here is a bivariate data set.

\begin{tabular}{|r|c|}
\hline
[tex]$x$[/tex] & [tex]$y$[/tex] \\
\hline
23.6 & 7.2 \\
\hline
20.5 & 28.4 \\
\hline
19.8 & 43.5 \\
\hline
28.9 & 61.2 \\
\hline
-1.4 & 29.4 \\
\hline
37.9 & 54.1 \\
\hline
5.2 & -4.6 \\
\hline
21.1 & 59.2 \\
\hline
27.8 & 42.7 \\
\hline
23.8 & 51.9 \\
\hline
34.2 & 89.3 \\
\hline
6.7 & 30.2 \\
\hline
0.1 & 37.7 \\
\hline
2.9 & 50 \\
\hline
\end{tabular}

This data can be downloaded as a *.csv file with this link: Download CSV

Find the correlation coefficient and report it accurate to three decimal places.

[tex]$r=$[/tex] [tex]$\square$[/tex]

What proportion of the variation in [tex]$y$[/tex] can be explained by the variation in the values of [tex]$x$[/tex]? Report answer as a percentage accurate to one decimal place.

[tex]$R^2=$[/tex] [tex]$\square$[/tex] \%



Answer :

To address this problem, we will determine two key statistical measures: the correlation coefficient [tex]\( r \)[/tex] and the coefficient of determination [tex]\( R^2 \)[/tex].

1. Correlation Coefficient [tex]\( r \)[/tex]:
The correlation coefficient [tex]\( r \)[/tex] measures the strength and direction of the linear relationship between two variables [tex]\( x \)[/tex] and [tex]\( y \)[/tex]. It ranges from -1 to 1, where:
- [tex]\( r = 1 \)[/tex] indicates a perfect positive linear relationship,
- [tex]\( r = -1 \)[/tex] indicates a perfect negative linear relationship,
- [tex]\( r = 0 \)[/tex] indicates no linear relationship.

Based on our calculations, the correlation coefficient [tex]\( r \)[/tex] for the given data set is:
[tex]\[ r = 0.519 \][/tex]

2. Coefficient of Determination [tex]\( R^2 \)[/tex]:
The coefficient of determination [tex]\( R^2 \)[/tex] indicates the proportion of the variance in the dependent variable [tex]\( y \)[/tex] that is predictable from the independent variable [tex]\( x \)[/tex]. It is calculated by squaring the correlation coefficient [tex]\( r \)[/tex]:
[tex]\[ R^2 = r^2 \][/tex]

To express [tex]\( R^2 \)[/tex] as a percentage, we multiply by 100:
[tex]\[ R^2 (\%) = r^2 \times 100 \][/tex]

Given [tex]\( r = 0.519 \)[/tex]:
[tex]\[ R^2 \approx (0.519)^2 \times 100 = 0.269 \times 100 = 26.9 \% \][/tex]

In summary, the correlation coefficient [tex]\( r \)[/tex] is [tex]\( 0.519 \)[/tex], and the proportion of the variation in [tex]\( y \)[/tex] that can be explained by the variation in [tex]\( x \)[/tex] is [tex]\( 26.9\% \)[/tex].

To formally answer the questions:

1. The correlation coefficient accurate to three decimal places is:
[tex]\[ r = 0.519 \][/tex]
2. The proportion of the variation in [tex]\( y \)[/tex] explained by the variation in [tex]\( x \)[/tex] accurate to one decimal place is:
[tex]\[ R^2 = 26.9\% \][/tex]