Paper 4, Section I, J

Suppose you have a data frame with variables response, covar1, and covar2. You run the following commands on $R$ .

$\begin{array}{llllll}\text { covar2 } & 0.3755 & 2.5978 & 0.145 & 0.886\end{array}$

...

(a) Consider the following three scenarios:

(i) All the output you have is the abbreviated output of summary (model) above.

(ii) You have the abbreviated output of summary (model) above together with

Residual standard error: $0.8097$ on 47 degrees of freedom

Multiple R-squared: $0.8126$ , Adjusted R-squared: $0.8046$

F-statistic: $101.9$ on 2 and 47 DF, p-value: < $2.2 e-16$

(iii) You have the abbreviated output of summary (model) above together with

Residual standard error: $0.9184$ on 47 degrees of freedom

Multiple R-squared: $0.000712$ , Adjusted R-squared: $-0.04181$

F-statistic: $0.01674$ on 2 and 47 DF, p-value: $0.9834$

What conclusion can you draw about which variables explain the response in each of the three scenarios? Explain.

(b) Assume now that you have the abbreviated output of summary (model) above together with

anova(lm(response $~ 1), \operatorname{lm}($ response $\sim \operatorname{covar} 1)$ , model $)$

What are the values of the entries with a question mark? [You may express your answers as arithmetic expressions if necessary].