Today’s data come from Equity in Athletics Data Analysis and includes information about sports expenditures and revenues for colleges and universities in the United States. This data set was featured in a March 2022 Tidy Tuesday.
We will focus on the 2019 (2019 - 2020 season) expenditures on football for institutions in the NCAA - Division 1 FBS (Football Bowl Subdivision). The variables are :
total_exp_m: Total expenditures on football in the 2019 - 2020 academic year (in millions USD)
enrollment_th: Total student enrollment in the 2019 - 2020 academic year (in thousands)
type: institution type (Public or Private)
nonsense: a created variable (see above) which has nothing to do with expenditure
Regression model
exp_fit <-lm(total_exp_m ~ enrollment_th + type + nonsense, data = football)tidy(exp_fit) |>kable(digits =3)
term
estimate
std.error
statistic
p.value
(Intercept)
17.833
3.523
5.061
0.000
enrollment_th
0.796
0.112
7.095
0.000
typePublic
-13.520
3.178
-4.254
0.000
nonsense
0.298
0.371
0.803
0.423
Confidence interval in R
We can compute the confidence intervals in R easily: