proback
diff --git a/‎01-Introduction.Rmd‎
Lines changed: 1 addition & 1 deletion b/‎01-Introduction.Rmd‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎02-Beyond-Most-Least-Squares.Rmd‎
Lines changed: 7 additions & 7 deletions b/‎02-Beyond-Most-Least-Squares.Rmd‎
Lines changed: 7 additions & 7 deletions
diff --git a/‎03-Distribution-Theory.Rmd‎
Lines changed: 2 additions & 2 deletions b/‎03-Distribution-Theory.Rmd‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎04-Poisson-Regression.Rmd‎
Lines changed: 1 addition & 1 deletion b/‎04-Poisson-Regression.Rmd‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎06-Logistic-Regression.Rmd‎
Lines changed: 4 additions & 4 deletions b/‎06-Logistic-Regression.Rmd‎
Lines changed: 4 additions & 4 deletions
diff --git a/‎07-Correlated-Data.Rmd‎
Lines changed: 3 additions & 3 deletions b/‎07-Correlated-Data.Rmd‎
Lines changed: 3 additions & 3 deletions
@@ -663,7 +663,7 @@ This application involves both non-normal data (number of stops by ethnic group
 
 1. **Kentucky Derby.**  The next set of questions is related to the Kentucky Derby case study from this chapter.   
 
-    a. Discuss the pros and cons of using side-by-side boxplots vs. stacked histograms to illustrate the relationships between year and track condition in Figure \@ref(fig:bivariate).
+    a. Discuss the pros and cons of using side-by-side boxplots vs. stacked histograms to illustrate the relationship between year and track condition in Figure \@ref(fig:bivariate).
     b. Why is a scatterplot more informative than a correlation coefficient to describe the relationship between speed of the winning horse and year in Figure \@ref(fig:bivariate).
     c. How might you incorporate a fourth variable, say number of starters, into Figure \@ref(fig:codeds)?
     d. Explain why $\epsilon_i$ in Equation \@ref(eq:model1) measures the vertical distance from a data point to the regression line.
 
@@ -267,7 +267,7 @@ Lik.f(nBoys = 30, nGirls = 20, nGrid = 50)
 # more precise MLE for p_B based on finer grid (more points)
 Lik.f(nBoys = 30, nGirls = 20, nGrid = 1000)
 
-## Another approach: using R's optimize command
+## Another approach: using R's optimize function
 ##   Note that the log-likelihood is optimized here
 oLik.f <- function(pb){
     return(30*log(pb) + 20*log(1-pb))
@@ -421,9 +421,9 @@ Numfams <- c(930,951,582,666,666,530,186,177,173,
              148,151,125,182,159)
 Numchild <- c(930,951,1164,1332,1332,1060,558,531,
               519,444,453,375,546,477)
-Malesfemales <- c("97 boys to 100 girls"," ",
-    "104 boys to 100 girls"," "," "," ",
-    "104 boys to 100 girls"," "," "," "," "," "," "," ")
+Malesfemales <- c("97:100"," ",
+    "104:100"," "," "," ",
+    "104:100"," "," "," "," "," "," "," ")
 PB <- c("0.494", " ","0.511"," "," "," ","0.510"," "," ",
         " "," "," "," "," ")
 ```
@@ -708,7 +708,7 @@ We have convincing evidence that the Sex Conditional Model provides a significan
 
 *Note: *You may notice that the LRT is similar in spirit to the extra-sum-of-squares F-test used in linear regression. Recall that the extra-sum-of-squares F-test involves comparing two nested models.  When the smaller model is true, the F-ratio follows an F-distribution which on average is 1.0.  A large, unusual F-ratio provides evidence that the larger model provides a significant improvement.
 
-*Also note: * It might have been more logical to start by using Likelihood Ratio Test to determine whether the probability of having a boy differs significantly from 0.5.  We leave this as an exercise.
+*Also note: * It might have been more logical to start by using a Likelihood Ratio Test to determine whether the probability of having a boy differs significantly from 0.5.  We leave this as an exercise.
 
 ## Model 3: Stopping Rule Model (waiting for a boy)
 
@@ -763,8 +763,8 @@ Cont <- c("$\\bstop$",
 table9chp2 <- data.frame(Famcomp2, Numfamis3, Lik, Cont)
 colnames(table9chp2) <- c("Family Composition",
                           "Num. families",
-                          "  Likelihood",
-                          "Contribution")
+                          "Likelihood Contribution",
+                          " ")
 kable(table9chp2, booktabs = T, escape = F,
       caption="Likelihood contributions for NLSY families in Model 3: Waiting for a boy.")%>%   
   column_spec(1, width = "3cm") %>%
 
@@ -351,7 +351,7 @@ Suppose we have a Poisson process with rate $\lambda$, and we wish to model the
 f(y) = \lambda e^{-\lambda y} \quad \textrm{for} \quad y > 0,
 (\#eq:expRV)
 \end{equation}
-where $\E(Y) = 1/\lambda$, $\SD(Y) = 1/\lambda$. Figure \@ref(fig:multExp) displays three exponential distributions with different $\lambda$ values. As $\lambda$ increases, $\E(Y)$ tends towards 0, and distributions "die off" quicker. 
+where $\E(Y) = 1/\lambda$ and $\SD(Y) = 1/\lambda$. Figure \@ref(fig:multExp) displays three exponential distributions with different $\lambda$ values. As $\lambda$ increases, $\E(Y)$ tends towards 0, and distributions "die off" quicker. 
 
 (ref:multExp) Exponential distributions with $\lambda = 0.5, 1,$ and $5$.
 
@@ -586,7 +586,7 @@ In this course, we encounter $\chi^2$ distributions \index{chi-square distributi
 
 In general, $\chi^2$ distributions with $k$ degrees of freedom are right skewed with a mean $k$ and standard deviation $\sqrt{2k}$.  Figure \@ref(fig:multChisq) displays chi-square distributions with different values of $k$.
 
-The $\chi^2$ distribution is a special case of gamma distributions. Specifically, a $\chi^2$ distribution with $k$ degrees of freedom can be expressed as a gamma distribution with $\lambda = 1/2$ and $r = k/2$.
+The $\chi^2$ distribution is a special case of a gamma distribution. Specifically, a $\chi^2$ distribution with $k$ degrees of freedom can be expressed as a gamma distribution with $\lambda = 1/2$ and $r = k/2$.
 
 (ref:multChisq) $\chi^2$ distributions with 1, 3, and 7 degrees of freedom..
 
 
@@ -732,7 +732,7 @@ table4ch4 <- c.data %>%
 kable(table4ch4, booktabs=T, caption = 'The mean and variance of the violent crime rate by region and type of institution.')
 ```
 
-```{r, boxtyperegion, fig.align="center",out.width="60%", fig.cap='Boxplot of violent crime rate by region and type of institution.',echo=FALSE, warning=FALSE, message=FALSE}
+```{r, boxtyperegion, fig.align="center",out.width="60%", fig.cap='Boxplot of violent crime rate by region and type of institution (colleges (C) on the left, and universities (U) on the right).',echo=FALSE, warning=FALSE, message=FALSE}
 #Insert boxplot without the outlier and combining S and SE
 ggplot(c.data, aes(x = region, y = nvrate, fill = type)) +
   geom_boxplot() +
 
@@ -194,7 +194,7 @@ p_0=\frac{e^{\beta_0}}{1+e^{\beta_0}}
 
 We use likelihood methods to estimate $\beta_0$ and $\beta_1$. As we had done in Chapter \@ref(ch-beyondmost), we can write the likelihood for this example in the following form:
 
-\[\Lik(p_1, p_0) = {28 \choose 22}p_1^{22}(1-p_1)^{2}
+\[\Lik(p_1, p_0) = {24 \choose 22}p_1^{22}(1-p_1)^{2}
 {180 \choose 141}p_0^{141}(1-p_0)^{39}\]
 
 Our interest centers on estimating $\hat{\beta_0}$ and $\hat{\beta_1}$, not $p_1$ or $p_0$. So we replace $p_1$ in the likelihood with an expression for $p_1$ in terms of $\beta_0$ and $\beta_1$  as in Equation \@ref(eq:pBehindform). Similarly, $p_0$ in Equation \@ref(eq:pNotBehindform) involves only $\beta_0$. After removing constants, the new likelihood looks like:
@@ -536,7 +536,7 @@ A deviance residual can be calculated for each observation using:
 
 When the number of trials is large for all of the observations and the models are appropriate, both sets of residuals should follow a standard normal distribution.
 
-The sum of the individual deviance residuals is referred to as the **deviance** or **residual deviance**. \index{residual deviance} The residual deviance is used to assess the model. As the name suggests, a model with a small deviance is preferred. In the case of binomial regression, when the denominators, $m_i$,  are large and a model fits, the residual deviance follows a $\chi^2$ distribution with $n-p$ degrees of freedom (the residual degrees of freedom). Thus for a good fitting model the residual deviance should be approximately equal to its corresponding degrees of freedom. When binomial data meets these conditions, the deviance can be used for a goodness-of-fit test. The p-value for lack-of-fit is the proportion of values from a $\chi_{n-p}^2$ that are greater than the observed residual deviance.
+The sum of the individual deviance residuals is referred to as the **deviance** or **residual deviance**. \index{residual deviance} The residual deviance is used to assess the model. As the name suggests, a model with a small deviance is preferred. In the case of binomial regression, when the denominators, $m_i$,  are large and a model fits, the residual deviance follows a $\chi^2$ distribution with $n-p$ degrees of freedom (the residual degrees of freedom). Thus for a good fitting model the residual deviance should be approximately equal to its corresponding degrees of freedom. When binomial data meets these conditions, the deviance can be used for a goodness-of-fit test. The p-value for lack-of-fit is the proportion of values from a $\chi_{n-p}^2$ distribution that are greater than the observed residual deviance.
 
 We begin a residual analysis of our interaction model by plotting the residuals against the fitted values in Figure \@ref(fig:resid). This kind of plot for binomial regression would produce two linear trends with similar negative slopes if there were equal sample sizes $m_i$ for each observation. 
 
@@ -652,7 +652,7 @@ We began by fitting a logistic regression model with `distance` alone. Then we a
 
 ## Case Study: Trying to Lose Weight
 
-The final case study uses individual-specific information so that our response, rather than the number of successes out of some number of trials, is simply a binary variable taking on values of 0 or 1 (for failure/success, no/yes, etc.).  This type of problem---__binary logistic regression__---is exceedingly common in practice \index{binary logistic regression}. Here we examine characteristics of young people who are trying to lose weight. The prevalence of obesity among U.S. youth suggests that wanting to lose weight is sensible and desirable for some young people such as those with a high body mass index (BMI). On the flip side, there are young people who do not need to lose weight but make ill-advised attempts to do so nonetheless. A multitude of studies on weight loss focus specifically on youth and propose a variety of motivations for the young wanting to lose weight; athletics and the media are two commonly cited sources of motivation for losing weight for young people.
+The final case study uses individual-specific information so that our response, rather than the number of successes out of some number of trials, is simply a binary variable taking on values of 0 or 1 (for failure/success, no/yes, etc.).  This type of problem---__binary logistic regression__---is exceedingly common in practice. \index{binary logistic regression}  Here we examine characteristics of young people who are trying to lose weight. The prevalence of obesity among U.S. youth suggests that wanting to lose weight is sensible and desirable for some young people such as those with a high body mass index (BMI). On the flip side, there are young people who do not need to lose weight but make ill-advised attempts to do so nonetheless. A multitude of studies on weight loss focus specifically on youth and propose a variety of motivations for the young wanting to lose weight; athletics and the media are two commonly cited sources of motivation for losing weight for young people.
 
 Sports have been implicated as a reason for young people wanting to shed pounds, but not all studies are consistent with this idea. For example, a study by @Martinsen2009 reported that, despite preconceptions to the contrary, there was a higher rate of self-reported eating disorders among controls (non-elite athletes) as opposed to elite athletes. Interestingly, the kind of sport was not found to be a factor, as participants in leanness sports (for example, distance running, swimming, gymnastics, dance, and diving) did not differ in the proportion with eating disorders when compared to those in non-leanness sports. So, in our analysis, we will not make a distinction between different sports.
 
@@ -1303,5 +1303,5 @@ summary(model1a)
 4. __Trashball.__ Great for a rainy day! A fun way to generate overdispersed binomial data. Each student crumbles an 8.5 by 11 inch sheet and tosses it from three prescribed distances ten times each. The response is the number of made baskets out of 10 tosses, keeping track of the distance. Have the class generate and collect potential covariates, and include them in your data set (e.g., years of basketball experience, using a tennis ball instead of a sheet of paper, height).  Some sample analysis steps:
 
     a. Create scatterplots of logits vs. continuous predictors (distance, height, shot number, etc.) and boxplots of logit vs. categorical variables (sex, type of ball, etc.). Summarize important trends in one or two sentences.
-    b. Create a graph with empirical logits vs. distance plotted separately by sex. What might you conclude from this plot?
+    b. Create a graph with empirical logits vs. distance plotted separately by type of ball. What might you conclude from this plot?
     c. Find a binomial model using the variables that you collected. Give a brief discussion on your findings. 
@@ -60,9 +60,9 @@ tModelName <- c("fit\\_1a\\_binom", "fit\\_1a\\_quasi", "fit\\_1b\\_binom", "fit
                        "Model Name", "fit\\_2a\\_binom","fit\\_2a\\_quasi", "fit\\_2b\\_binom", "fit\\_2b\\_quasi")
 
 tBeta <- c("","","","","",
-           "$\\beta_1$", "","","","")
+           "$\\hat{\\beta}_1$", "","","","")
 tSEBeta <- c("","","","","",
-           "SE $\\beta_1$", "","","","")
+           "SE $\\hat{\\beta}_1$", "","","","")
 tTStat <- c("","","","","",
            "$t$", "","","","")
 tPVal <- c("","","","","",
@@ -81,7 +81,7 @@ tGOFP <- c("","X","","X","",
            "GOF p value", "","X","","X")
 
 scenarioSimTab <- tibble(tScenario, tModel, tModelName, tBeta, tSEBeta, tTStat, tPVal, tPhi, tEst, tCI, tMeanCount, tSDCount, tGOFP)
-colnames(scenarioSimTab) <- c("Scenario", "Model", "Model Name", "$\\beta_0$", "SE $\\beta_0$", "$t$", "p value", "$\\phi$", "Est prob", "CI prob", "Mean count", "SD count", "GOF p value")
+colnames(scenarioSimTab) <- c("Scenario", "Model", "Model Name", "$\\hat{\\beta}_0$", "SE $\\hat{\\beta}_0$", "$t$", "p value", "$\\phi$", "Est prob", "CI prob", "Mean count", "SD count", "GOF p value")
 
 kable(scenarioSimTab, booktabs=T, caption="Summary of simulations for Dams and Pups case study.", escape=F) %>%
   kable_styling(latex_options = "scale_down", font_size = 9) %>%