Clicker Q

to go with Practicing Statistics by Kuiper & Sklar. Math 150 - Methods in Biostatistics.

In terms of the prerequisite for Math 150, Methods in Biostatistics, you should know at least a little bit (hopefully a lotta bit) about the following topics.

Hypothesis test, confidence interval, sample mean, central limit theorem, standard deviation, standard error of a statistics, p-value, t-test, chi-squared test.¹

Never heard of it
Heard of it, but don’t know anything about it
Know a little about it (or did once)
Know something about it
Confident about it

In terms of the prerequisite for Math 150, Methods in Biostatisitcs, you do not need to know the following topics

Interaction, simple linear regression, multiple linear regression, logistic regression, survival analysis, R.²

Never heard of it
Heard of it, but don’t know anything about it
Know a little about it (or did once)
Know something about it
Confident about it

R / R Studio / Quarto³
1. all good
2. started, progress is slow and steady
3. started, very stuck
4. haven’t started yet
5. what do you mean by “R”?

Git / GitHub⁴
1. all good
2. started, progress is slow and steady
3. started, very stuck
4. haven’t started yet
5. what do you mean by “Git”?

Where can I get feedback on my HW assignments / quizzes?⁵
1. prof will return paper versions
2. on Gradescope
3. on Canvas
4. on GitHub

Which of the following includes talking to the remote version of GitHub?⁶
1. changing your name (updating the YAML)
2. committing the file(s)
3. pushing the file(s)
4. some of the above
5. all of the above

The Central Limit Theorem (CLT) says:⁷

The sample average (statistic) converges to the true average (parameter)
The sample average (statistic) converges to some point
The distribution of the sample average (statistic) converges to a normal distribution
The distribution of the sample average (statistic) converges to some distribution
I have no idea what the CLT says

The p-value is the probability:⁸

that the null hypothesis is true given the observed data.
of data as or more extreme than the observed data given that the null hypothesis is true.

Why do we use a t distribution (instead of a z / normal distribution) in the t-test?⁹

the technical conditions don’t hold
the means are quite variable
we like the letter t
we have two samples
we don’t know the true standard deviation parameter

What happens if a t-test is used but isn’t appropriate (technical conditions don’t hold)?¹⁰

the p-value isn’t actually the probability of our data or more extreme if H0 is true.
the software won’t give a p-value as output
the rejection region needs to be calculated in the opposite direction
the world blows up

We use linear regression to run a test of means ( $x_{i} = 0$ for controls, group 1; $x_{i} = 1$ for cases, group 2) What is: $\sum_{i = 1}^{n} x_{i} ?$ ¹¹

$n$
$n_{1}$
$n_{2}$
$n_{1} \cdot {\overset{―}{y}}_{1}$
$n_{2} \cdot {\overset{―}{y}}_{2}$

We use linear regression to run a test of means ( $x_{i} = 0$ for controls, group 1; $x_{i} = 1$ for cases, group 2) What is: $\sum_{i = 1}^{n} x_{i} y_{i} ?$ ¹²

$n$
$n_{1}$
$n_{2}$
$n_{1} \cdot {\overset{―}{y}}_{1}$
$n_{2} \cdot {\overset{―}{y}}_{2}$

The regression technical conditions include:¹³
1. The Y variable is normally distributed
2. The X variable is normally distributed
3. The residuals are normally distributed
4. The slope coefficient is normally distributed
5. The intercept coefficient is normally distributed

We need the technical conditions to hold in order to calculate $b_{0}$ and $b_{1} .$ ¹⁴
1. TRUE
2. FALSE
3. It depends

Why do we check technical conditions?¹⁵
1. so that the inference is valid
2. so that the estimates are valid
3. so that the p-value is more likely to be small
4. so that the confidence level is right
5. for fun

When writing the regression equation, why is there a hat $(\hat{})$ on the response variable?¹⁶
1. because the prediction is an estimate
2. because the prediction is an average
3. because the prediction may be due to extrapolation
4. a & b
5. all of the above

With a strong correlation and very small p-value, what can we conclude about happiness and life expectancy?¹⁷
1. happiness causes longer lives
2. longer lives cause happiness
3. happiness and longer life are correlated
4. happiness and longer life are perfectly predictive
5. happiness and longer life are unrelated

If there is no relationship in the population (true correlation = 0), then r = 0.¹⁸
1. TRUE
2. FALSE

If there is no relationship in the population (true slope $β_{1} = 0$ ), then $b_{1} = 0$ .¹⁹
1. TRUE
2. FALSE

Smaller variability around the regression line $(σ) :$ ²⁰
1. increases the variability of $b_{1}$ .
2. decreases the variability of $b_{1}$ .
3. doesn’t necessarily change the variability of $b_{1}$ .

Smaller variability in the explanatory variable (SD(X) = $s_{X}) :$ ²¹
1. increases the variability of $b_{1}$ .
2. decreases the variability of $b_{1}$ .
3. doesn’t necessarily change the variability of $b_{1}$ .

A smaller sample size $(n) :$ ²²
1. increases the variability of $b_{1}$ .
2. decreases the variability of $b_{1}$ .
3. doesn’t necessarily change the variability of $b_{1}$ .

We transform our variables…²³
1. … to find the highest $r^{2}$ value.
2. … when the X variable is not normally distributed.
3. … to make the model easier to interpret.
4. … so that the technical conditions are met.

In the Botox and Pain Relief example, the p-value is calculated. What does “probability” refer to?²⁴
1. random allocation
2. random sample

p-value = probability of the observed data or more extreme given the null hypothesis is true.

“Observed data or more extreme” is:²⁵
1. fewer than 9
2. 9 or fewer
3. 9 or more
4. more than 9

What is the mean value of the null sampling distribution for the number of Botox therapy who showed pain reduction?²⁶
1. 0
2. 9
3. 5.3
4. 11
5. 15

What conclusion would you draw from the Back Pain and Botox study?²⁷
1. Not enough evidence to conclude that Botox is more effective than the placebo.
2. Strong evidence that Botox is equally as effective as the placebo.
3. Strong evidence that Botox is more effective than the placebo.

If we consider those in the study with back pain to be representative of all people with back pain, what would you conclude about the percentage of people who will have reduced back pain if they use Botox?²⁸
1. Substantially greater than 50%
2. Substantially less than 50%
3. Close to 50%

Material check-in
1. So far, so good
2. Concepts are good, R is confusing
3. R is good, concepts are confusing
4. Everything is confusing

People check-in
1. So far, so good
2. I can go to office hours / mentor sessions, but I didn’t happen to this week.
3. I can’t make the scheduled office hours / mentor sessions
4. I’m looking for someone to study with

See Canvas front page for anonymous survey / feedback for the class. Also, if you are looking for people to work with, please contact me directly (non-anonymously!) so that I can connect you to people.

Sample 1,000,000 people who are over 6’ tall and 1,000,000 people who are under 6’ tall. Record if the person is in the NBA. What is measurable?²⁹
1. P(NBA if 6’ tall)
2. P(6’ tall if in the NBA)
3. both
4. neither

Sample 100 people who are in the NBA and 100 people who are not in the NBA. Record if the person is over 6’ tall. What is measurable?³⁰
1. P(NBA if 6’ tall)
2. P(6’ tall if in the NBA)
3. both
4. neither

Sample 10,000,000 people. Record their height and whether or not they are in the NBA. What is measurable?³¹
1. P(NBA if 6’ tall)
2. P(6’ tall if in the NBA)
3. both
4. neither

From the NYT, March 21, 2023, https://www.nytimes.com/2023/03/21/sports/basketball/tall-basketball-march-madness.html

American men who are between 6 feet and 6-2 — significantly taller than the 5-9 average — have about a five in a million chance of making the N.B.A., according to “The Sports Gene,” a 2013 book by David Epstein about the science of athletic performance. But if you hit the genetic lottery and happen to be 7 feet tall, your chances of landing in the N.B.A. are roughly one in six. (There are 38 players on active rosters who are 7 feet or taller, according to N.B.A. Advanced Stats; the average height of an N.B.A. player is 6 feet 6.5 inches.)

https://davidepstein.com/david-epstein-the-sports-gene/

Calcium channel blockers have recently been reported to be associated with increased mortality. Cardiac patients who recently died of their heart disease were compared to control cardiac patients with similar disease who survive. Assume such a study had found that 40% of the recent cardiac deaths were taking calcium channel blockers at the time of death, as compared to 25% of the controls.³²
1. Case-control
2. Cohort
3. Cross-classification

It is well known that the use of urinary catheters conveys a substantial risk of urinary tract infection (UTI). A group of physicians believe that, in an intensive care setting, use of one particular type of urinary catheter is more likely to encourage infection than use of other types. They therefore review medical records over a recent period for all uses of urinary catheters in an ICU. They find that 200 new UTIs occurred during 1000 ICU patient-days of catheterization with the suspect type of catheter, as compared to 100 new UTIs during 5000 ICU-patient days of catheterization with all other types. Noting the increased frequency of new UTIs when the suspect catheter type is used, they regard their hypothesis as confirmed. To reduce nosocomial UTIs, they recommend discontinuing use of that type of catheter in the ICU.³³
1. Case-control
2. Cohort
3. Cross-classification

When we select individuals based on the explanatory variable, we cannot accurately measure³⁴
1. the proportion of people in the population in each explanatory category
2. the proportion of people in the population in each response group
3. anything about the population
4. confounding variables

Relative Risk is³⁵
1. the difference of two proportions
2. the ratio of two proportions
3. the log of the ratio of two proportions
4. the log of the difference of two proportions

The odds ratio is “invariant to which variable is explanatory and which is response” means:³⁶
1. we always put the bigger odds in the numerator
2. we must collect data so that we can estimate the response in the population
3. which variable is called the explanatory changes the value of the OR
4. which variable is called the explanatory does not change the value of the OR

In finding a CI for RR = p1/p2, why is it okay to exponentiate the end points of the interval for ln(p1/p2)?³⁷
1. Because if ln(p1/p2) is in the original interval, p1/p2 will be in the exponentiated interval.
2. Because taking the natural log of the RR makes the distribution approximately normal.
3. Because the natural log compresses values that are bigger than 1 and spreads values that are smaller than 1.
4. Because we can get exact p-values using Fisher’s Exact Test.

In order to find a CI for the true OR, our steps are:³⁸

find $\hat{\ln (OR)}$
add $\pm z^{*} \sqrt{\frac{1}{n_{1} {\hat{p}}_{1} (1 - {\hat{p}}_{1})} + \frac{1}{n_{2} {\hat{p}}_{2} (1 - {\hat{p}}_{2})}}$
take exp of the endpoints

because the sampling distribution of $\hat{OR}$ is normal
because OR is typically greater than 1
because the $\ln$ transformation makes the sampling distribution almost normal
because OR is invariant to the choice of explanatory or response variable

I know where to find: the solutions to the worksheets, the clicker questions (with solutions), and the HW solutions³⁹
1. TRUE
2. FALSE

At the value $x = - β_{0} / β_{1}$ , the probability of success is:⁴⁰
1. 0
2. 0.5
3. 1
4. depends on $β_{0}$
5. depends on $β_{1}$

The logistic model gives probability of failure:⁴¹
1. $\frac{e^{β_{0} + β_{1} x}}{1 + e^{β_{0} + β_{1} x}}$
2. $\frac{1}{1 + e^{β_{0} + β_{1} x}}$
3. $e^{β_{0} + β_{1} x}$
4. $e^{- (β_{0} + β_{1} x)}$
5. $β_{0} + β_{1} x$

The logistic model gives odds of success:⁴²
1. $\frac{e^{β_{0} + β_{1} x}}{1 + e^{β_{0} + β_{1} x}}$
2. $\frac{1}{1 + e^{β_{0} + β_{1} x}}$
3. $e^{β_{0} + β_{1} x}$
4. $e^{- (β_{0} + β_{1} x)}$
5. $β_{0} + β_{1} x$

The logistic model gives odds of failure:⁴³
1. $\frac{e^{β_{0} + β_{1} x}}{1 + e^{β_{0} + β_{1} x}}$
2. $\frac{1}{1 + e^{β_{0} + β_{1} x}}$
3. $e^{β_{0} + β_{1} x}$
4. $e^{- (β_{0} + β_{1} x)}$
5. $β_{0} + β_{1} x$

With a logistic regression model, the relative risk of success (for a one unit increase in X) is:⁴⁴
1. $- β_{0} / β_{1}$
2. $β_{0} + β_{1} x$
3. $e^{β_{0} + β_{1} x}$
4. a non-linear function of X (which depends on X )

If we want the relative risk of survival (for a one unit increase in X) to be independent of X, we should use which link:⁴⁵
1. linear
2. logistic
3. complementary log-log
4. log-linear

You take a sample of size 4 from a binary population and get: FSFF. (failure, success, failure, failure) What is your guess for p = P(success)?⁴⁶
1. 0.05
2. 0.15
3. 0.25
4. 0.5
5. 0.75

In a logistic regression model, the variability is given by⁴⁷
1. Normal Y given X
2. Binomial Y given X
3. Bernoulli Y given X
4. Poisson Y given X

When trying to find estimates for $β_{0}$ and $β_{1}$ , we maximize the likelihood. $\prod_{i = 1}^{n} (\frac{e^{β_{0} + β_{1} x_{i}}}{1 + e^{β_{0} + β_{1} x_{i}}})^{y_{i}} (\frac{1}{1 + e^{β_{0} + β_{1} x_{i}}})^{1 - y_{i}}$ Take the derivative with respect to which variable(s):⁴⁸
1. X
2. Y
3. $β_{0}$
4. $β_{1}$
5. $β_{0}$ and $β_{1}$

Maximum likelihood estimation seeks to:⁴⁹
1. Find the data which are most likely under the model.
2. Find the parameters which are most likely under the model.
3. Find the parameters which make the data most likely under the model.
4. Find the data which make the parameters most likely under the model.

We use maximum likelihood estimation because:⁵⁰
1. It gives an principled approach for estimating the parameters.
2. The estimates are asymptotically normally distributed.
3. The estimates are always easy to compute.
4. All of the above.
5. Some of the above.

We know that for a given data set (with MLEs of $b_{0}$ , $b_{1}$ ):⁵¹
1. $L (b_{0}, b_{1}) < L (b_{0}, β_{1} = 0)$ always
2. $L (b_{0}, b_{1}) > L (b_{0}, β_{1} = 0)$ always
3. $L (b_{0}, b_{1}) \leq L (b_{0}, β_{1} = 0)$ always
4. $L (b_{0}, b_{1}) \geq L (b_{0}, β_{1} = 0)$ always

In a logistic regresion if $H_{0}$ is true, what is the probability of success?⁵²

$p_{0}$
$\frac{e^{b_{0}}}{1 + e^{b_{0}}}$
$\frac{e^{b_{1}}}{1 + e^{b_{1}}}$
$e^{b_{0}}$
$\frac{e^{b_{0} + b_{1} x_{i}}}{1 + e^{b_{0} + b_{1} x_{i}}}$

Which is the correct logistic regression model to predict disease status based on snoring (never, occasionally, often, always): $X_{1} = 1$ for occasionally; $X_{2} = 1$ for often; $X_{3} = 1$ for always.⁵³

logit $(p) = β_{0}$
logit $(p) = β_{0} + β_{1} X$
logit $(p) = β_{0} + β_{1} X_{1} + β_{2} X_{2} + β_{3} X_{3}$
logit $(p) = β_{0} + β_{1} X_{1} + β_{2} X_{2} + β_{3} X_{3} + β_{4} X_{4}$
logit $(p) = β_{0} + β_{1} (X_{1} + X_{2} + X_{3})$

How many parameters did we estimate in the HERS worksheet with the additive model?⁵⁴
1. 1
2. 3
3. 4
4. 2757
5. 2761

How many parameters did we estimate in the HERS worksheet with the interaction model?⁵⁵
1. 3
2. 4
3. 6
4. 7
5. 12

What are the df for the LRT addressing whether interaction is needed in the HERS worksheet?⁵⁶
1. 2
2. 3
3. 2760
4. 2754
5. 2757

(Bird nest example) How many parameters do we estimate when considering Length as a categorical variable? (the only variable)⁵⁷
1. 0
2. 1
3. 2
4. 33
5. 34

(Bird nest example) How many df for the LRT addressing whether Length (as a categorical variable) belongs in the model?⁵⁸
1. 0
2. 1
3. 2
4. 33
5. 34

(Bird nest example) How many df for the LRT addressing whether Incubate and Color belong in the model (given Length is determined to be in the model)?⁵⁹
1. 0
2. 1
3. 2
4. 3
5. 4

An interaction term in a multiple logistic regression model may be used when:⁶⁰
1. the model fit is poor.
2. there is a quadratic relationship between the response and explanatory variables.
3. neither one of two explanatory variables contribute significantly to the regression model.
4. the relationship between X1 and P(success) changes for differing values of X2.

The interpretations of the main effects (on their own) make sense only when the interaction component is not significant.⁶¹
1. TRUE
2. FALSE

If the interaction is significant but the main effects aren’t:⁶²
1. report on the significance of the main effects
2. remove the main effects from the model
3. avoid talking about main effects on their own
4. test whether the main effects are significant without interaction in the model

With two variables of interest, what should you test first?⁶³
1. Variable 1.
2. Variable 2.
3. The interaction between variables 1 and 2.
4. None of the above.

Consider variable 1 is continuous and variable 2 has 4 levels. How many degrees of freedom are associated with the drop in deviance test (LRT) of their overall interaction?⁶⁴
1. 1
2. 2
3. 3
4. 4
5. 5

When selecting variables, it is important that⁶⁵
1. The model predicts training data well
2. The model predicts test data well
3. The coefficients on the variables are all significant
4. The relationships between the variables make sense

To get a sense of the true accuracy of the model, the test data should be assessed (for accuracy)⁶⁶
1. on the first model only.
2. on the last model only.
3. on every model in the process.

If I am using all features of my dataset and I achieve 100% accuracy on my training set, but ~70% on testing set, what should I look out for?⁶⁷
1. Underfitting
2. Nothing, the model is perfect
3. Overfitting

If I am picking and choosing between features of my dataset and I achieve 30% accuracy on my training set, and ~30% on testing set, what should I look out for?⁶⁸
1. Underfitting
2. Nothing, the model is perfect
3. Overfitting

Cross validating will guarantee that the model does not overfit.⁶⁹
1. TRUE
2. FALSE

Suppose we want to compute 10-Fold Cross-Validation error on 200 training examples. We need to compute a model error rate N1 times, and the Cross-Validation error is the average of the errors. To compute each error, we need to train a model with data of size N2, and test the model on the data of size N3. What are the numbers for N1, N2, N3?⁷⁰
1. N1 = 1, N2 = 180, N3 = 20
2. N1 = 10, N2 = 180, N3 = 20
3. N1 = 10, N2 = 200, N3 = 20
4. N1 = 10, N2 = 200, N3 = 200
5. N1 = 20, N2 = 180, N3 = 20

You are reviewing papers for Fancy Conference, and you see submissions with the following claims. Which ones would you consider accepting?⁷¹
1. My method achieves a training error lower than all previous methods!
2. My method achieves a test error lower than all previous methods! (Footnote: When variables are chosen so as to min test error.)
3. My method achieves a test error lower than all previous methods! (Footnote: When variables are chosen so as to min CV error.)
4. My method achieves a CV error lower than all previous methods! (Footnote: When variables are chosen so as to min CV error.)

Which model is better (according to ROC)?⁷²
1. pink because it goes closer to (1,1)
2. pink because it is closer to y=x
3. blue because it is farther from y=x
4. blue because it is steeper
5. neither

In ROC curve, the x-axis measures⁷³
1. True Pos Rate which we want high
2. False Pos Rate which we want low
3. True Neg Rate which we want high
4. False Neg Rate which we want low

Quiz on 11 topics (you know nothing). Your friends know topics:
A: {1, 2, 3, 4, 5, 6, 7}
B: {8, 9, 10}
C: {1, 2, 3, 4, 8, 10}
D: {5, 6, 7, 9, 11}
Who should you choose to help you answer the questions?⁷⁴
1. A
2. B
3. C
4. D
5. can’t tell

Who do you want to choose next?⁷⁵
A: {1, 2, 3, 4, 5, 6, 7}
B: {8, 9, 10}
C: {1, 2, 3, 4, 8, 10}
D: {5, 6, 7, 9, 11}
1. A
2. B
3. C
4. D
5. can’t tell

If you can pick two people, who do you pick?⁷⁶
A: {1, 2, 3, 4, 5, 6, 7}
B: {8, 9, 10}
C: {1, 2, 3, 4, 8, 10}
D: {5, 6, 7, 9, 11}
1. A, B
2. A, C
3. A, D
4. C, B
5. C, D

Which variable should I put in first for the forward model process?⁷⁷
1. Location
2. No.eggs
3. Color
4. Incubate
5. Nestling

Which variable should I put in second for the forward model process?⁷⁸
1. Location
2. No.eggs
3. Color
4. Incubate
5. Nestling

Which variable should I remove first for the backward model process?⁷⁹
1. Location
2. No.eggs
3. Color
4. Incubate
5. Nestling

Which variable should I remove second for the backward model process?⁸⁰
1. Location
2. No.eggs
3. Color
4. Incubate
5. Nestling

The variables in the k-variable model identified by forward selection are a subset of the variables in the (k+1)-variable model identified by forward selection.⁸¹
1. TRUE (always TRUE)
2. FALSE (not always TRUE)

The variables in the k-variable model identified by backward selection are a subset of the variables in the (k+1)-variable model identified by backward selection.⁸²
1. TRUE (always TRUE)
2. FALSE (not always TRUE)

The variables in the k-variable model identified by backward selection are a subset of the variables in the (k+1)-variable model identified by forward selection.⁸³
1. TRUE (always TRUE)
2. FALSE (not always TRUE)

The variables in the k-variable model identified by forward selection are a subset of the variables in the (k+1)-variable model identified by backward selection.⁸⁴
1. TRUE (always TRUE)
2. FALSE (not always TRUE)

The variables in the k-variable model identified by best-subsets selection are a subset of the variables in the (k+1)-variable model identified by best-subsets selection.⁸⁵
1. TRUE (always TRUE)
2. FALSE (not always TRUE)

In a drop-in-deviance test (LRT), the reduced model corresponds to the null hypothesis being true.⁸⁶
1. TRUE
2. FALSE

In a drop-in-deviance test (LRT), the full model corresponds to the alternative hypothesis being true.⁸⁷
1. TRUE
2. FALSE

With model building:⁸⁸
1. There are many ways to find a good model.
2. There is always one right answer.
3. There is no end to the fun.
4. Can we take a pure math class yet?

When probability of being able to buy a candy bar is modeled as a function of the number of coins, the coefficient on number of coins is:⁸⁹
1. positive
2. negative
3. zero
4. no intuition exists for being able to answer this question

When probability of being able to buy a candy bar is modeled as a function of the number of low coins, the coefficient on number of low coins is:⁹⁰
1. positive
2. negative
3. zero
4. no intuition exists for being able to answer this question

When probability of being able to buy a candy bar is modeled as a function of the number of coins and number of low coins, the coefficient on number of coins is:⁹¹
1. positive
2. negative
3. zero
4. no intuition exists for being able to answer this question

When probability of being able to buy a candy bar is modeled as a function of the number of coins and number of low coins, the coefficient on number of low coins is:⁹²
1. positive
2. negative
3. zero
4. no intuition exists for being able to answer this question

If we consider the censored times to be event times, the empirical survival curve will (on average)⁹³
1. underestimate the parameter
2. overestimate the parameter
3. sometimes under and sometimes overestimate the parameter

If we remove all the censored observations, the empirical survival curve will (on average)⁹⁴
1. underestimate the parameter
2. overestimate the parameter
3. sometimes under and sometimes overestimate the parameter

$n_{i} - d_{i} = n_{i + 1}$ when:⁹⁵
1. there are no deaths at time $t_{i}$
2. there is no censoring at time $t_{i}$
3. there are no deaths at time $t_{i + 1}$
4. there is no censoring at time $t_{i + 1}$
5. there is no censoring at time $t_{i - 1}$

$\frac{(n_{i} - d_{i})}{n_{i}} = 1$ when:⁹⁶
1. there are no deaths at time $t_{i}$
2. there is no censoring at time $t_{i}$
3. there are no deaths at time $t_{i + 1}$
4. there is no censoring at time $t_{i + 1}$
5. there is no censoring at time $t_{i - 1}$

Prop survive > 50 days, treated (turquoise line)⁹⁷
1. ~0.65
2. ~0.35
3. ~0.45
4. we only know it’s bigger than red
5. we only know it’s smaller than red

Kaplan Meier curves (Log-Rank p-value),⁹⁸
1. blue is clearly better
2. red is clearly better
3. can’t tell because they cross
4. can’t tell because the p-value is big
5. can’t tell because the p-value is small

In the log-rank test, why is it okay to consider only one cell of the 2x2 table at time $t_{j}$ ?⁹⁹
1. Because the row totals are fixed.
2. Because the column totals are fixed.
3. Because the row and column totals are fixed.
4. Because the total number of observations is fixed.

What does it mean for the log rank test to be more powerful than the Wilcoxon test?¹⁰⁰
1. log rank is more likely to reject $H_{0}$ when $H_{0}$ is true.
2. log rank is more likely to reject $H_{0}$ when $H_{0}$ is false.
3. log rank is less likely to reject $H_{0}$ when $H_{0}$ is true.
4. log rank is less likely to reject $H_{0}$ when $H_{0}$ is false.

The hazard at time $t$ represents:¹⁰¹
1. the probability of the event
2. the instantaneous rate of the event
3. the relative risk of the event
4. the odds ratio of the event

The last entry in the table for the h(t) column is NA because:¹⁰²
1. the last observation was a death
2. the last observation was censored
3. the time interval is too big
4. the time interval is too small

Censored observations are $\dots$ ?¹⁰³
1. More important than non-censored ones in survival analysis
2. Are assumed to be normally distributed over time
3. Are assumed to have the same survival chances as uncensored observations
4. Are essential to allow calculation of the Kaplan Meier plot
5. Are allocated to the baseline survival curve

Survival Analysis: for a one unit change of an explanatory variable, the corresponding coefficient $e^{β}$ represents:¹⁰⁴
1. baseline survival
2. survival ratio
3. baseline hazard
4. hazard ratio

In survival analysis, the closest interpretation of the value $e^{β}$ is:¹⁰⁵
1. odds
2. probability
3. time to event
4. relative risk
5. odds ratio

Let the event be death. If larger values of the explanatory variable are associated with higher likelihood of survival, the coefficient $(β)$ should be ¹⁰⁶
1. bigger than 1
2. smaller than 1
3. positive
4. negative
5. zero

Let the event be death. If larger values of the variable are NOT associated with higher (or lower) likelihood of survival, the coefficient $(β)$ should be¹⁰⁷
1. bigger than 1
2. smaller than 1
3. positive
4. negative
5. zero

BP violates the “linear HR” condition if:¹⁰⁸
1. the ln ratio of the hazard curves is not linear with respect to BP
2. the ln ratio of the survival curves is not linear with respect to BP
3. the effect of BP is to increase the hazard
4. the effect of BP is to decrease the hazard
5. there is no effect due to BP

A Cox regression analysis:¹⁰⁹
1. Is used to analyze survival data when individuals in the study are followed for varying lengths of time.
2. Can only be used when there are censored data
3. Assumes that the relative hazard for a particular variable is always constant
4. Uses the logrank statistic to compare two survival curves
5. Relies on the condition that the explanatory variables (covariates) in the model are normally distributed.

The effect of weight could violate PH if:¹¹⁰
1. people of different weights are in control vs treatment group
2. people tend to weigh less over time
3. the hazard function for weight is not monotonic
4. the hazard function changes as a function of weight which is also changing over time

The effect of treatment could violate PH if:¹¹¹
1. the treatment has no effect
2. the treatment produces short term benefits only
3. the treatment effect interacts with a different variable, like gender
4. there is more than one treatment group

AIC, BIC, model validation, and stepwise regression are methods for¹¹²
1. parameter estimation
2. variable selection

If $α = 0.05$ , I would expect 5% of all hypotheses to be rejected.¹¹³
1. TRUE
2. FALSE

Power is:¹¹⁴
1. P(type I error)
2. P(type II error)
3. 1 – P(type I error)
4. 1 – P(type II error)

type I = $H_{0}$ true, but we reject
type II = $H_{0}$ false, but we fail to reject
power = P(rejecting when $H_{0}$ false)

The p-value is¹¹⁵
1. P( $H_{0}$ is true | data)
2. P( $H_{a}$ is true | data)
3. P(data | $H_{0}$ is true)
4. P(data | $H_{a}$ is true)
5. 1 – P(data | $H_{0}$ is true)

RA Fisher (1929) >“… An observation is judged significant, if it would rarely have been produced, in the absence of a real cause of the kind we are seeking. It is a common practice to judge a result significant, if it is of such a magnitude that it would have been produced by chance not more frequently than once in twenty trials. This is an arbitrary, but convenient, level of significance for the practical investigator, but it does not mean that he allows himself to be deceived once in every twenty experiments. The test of significance only tells him what to ignore, namely all experiments in which significant results are not obtained. He should only claim that a phenomenon is experimentally demonstrable when he knows how to design an experiment so that it will rarely fail to give a significant result. Consequently, isolated significant results which he does not know how to reproduce are left in suspense pending further investigation.”

For hypothesis testing, the problem of multiple comparisons (also known as the multiple testing problem) results from the increase in ________ that occurs when statistical tests are used repeatedly.¹¹⁶
1. Type I errors
2. Type II errors
3. Null hypothesis
4. Statistical hypothesis testing

If $H_{0}$ is true, the p-values should be distributed:¹¹⁷
1. Uniformly (equal prob) on 0 to 1
2. Uniformly on -1 to 1
3. Unimodal on 0 to 1
4. Skewed left on 0 to 1
5. Skewed right on 0 to 1

Given many many tests (presumably some are null and some are “true”), a good estimate of the number of null tests is:¹¹⁸
1. (# p-values > 0.5) / 2
2. (# p-values > 0.5) * 2
3. (# p-values < 0.5) / 2
4. (# p-values < 0.5) * 2

What do I do if the adjusted p-value is bigger than 1?¹¹⁹
1. Leave it unadjusted
2. Assign the value of the previous (“smaller”) p-value
3. Round it to 1
4. Divide by 2

With Holm’s method, what do I do if the (m+1)^th adjusted p-value is smaller than the m^th adjusted p-value?¹²⁰
1. Leave it unadjusted
2. Assign the value of the m^th adjusted p-value to the (m+1)^th adjusted p-value
3. Round it to 1
4. Divide by 2

The false discovery rate represents¹²¹
1. the proportion of true discoveries out of the total tests
2. the proportion of true discoveries out of the total discoveries
3. the ratio of the number of true discoveries divided by the number of null discoveries
4. the number of null discoveries out of the total tests
5. the number of null discoveries out of the total discoveries

FDR and FWER differ in that¹²²
1. FDR is a rate and FWER is a probability
2. FDR controls the rate of false positives
3. FWER controls the probability of getting a false positive
4. some of the above
5. all of the above

Which multiple comparisons adjustment gives the highest power?¹²³
1. Bonferonni
2. Holm
3. Benjamini-Hochberg
4. Storey (q-values)

Which stopping criteria is most aggressive with respect to stopping early (i.e., is most likely to stop early)?¹²⁴
1. Bonferonni
2. Pocock
3. Peto
4. O’Brien-Fleming

Why do we want to stop early?¹²⁵
1. get out positive results sooner
2. get out negative results sooner
3. use fewer observations (people)
4. all of the above
5. control the type I error

Footnotes

preferably d or e. maybe c on some of them.↩︎
these are the topics we will be covering. Would be nice if you have heard of them.↩︎
wherever you are, make sure you are communicating with me when you have questions!↩︎
wherever you are, make sure you are communicating with me when you have questions!↩︎
1. on Gradescope
↩︎
1. pushing the file(s)
↩︎
1. The distribution of the sample average (statistic) converges to a normal distribution
↩︎
1. of data as or more extreme than the observed data given that the null hypothesis is true.
↩︎
1. we don’t know the true standard deviation parameter
↩︎
1. the p-value isn’t actually the probability of our data or more extreme if H0 is true.
↩︎
1. $n_{2}$
↩︎
1. $n_{2} \cdot {\overset{―}{y}}_{2}$
↩︎
1. The residuals are normally distributed (which induces a., d., and e.). There is nothing in the technical conditions about the distribution of X (remember, X can be binary!).
↩︎
FALSE. We can always minimize the sums of squares, regardless of whether or not the model is any good.↩︎
1. so that the inference is valid (and also for fun). Note that d. so that the confidence level is right is also a correct answer because confidence intervals are all part of the “inference” paradigm.
↩︎
1. due to estimation and average
↩︎
1. happiness and longer life are correlated
↩︎
1. FALSE, there is no reason that the statistic will equal the parameter.
↩︎
1. FALSE, there is no reason that the statistic will equal the parameter.
↩︎
1. decreases the variability of $b_{1}$ .
↩︎
1. increases the variability of $b_{1}$ .
↩︎
1. increases the variability of $b_{1}$ .
↩︎
1. so that the technical conditions are met.
↩︎
1. random allocation
↩︎
1. 9 or more
↩︎
1. 5.3 because (15/31)*11 = 5.3
↩︎
1. Strong evidence that Botox is more effective than the placebo.
↩︎
1. Close to 50% (the point estimate is 0.6)
↩︎
1. P(NBA if 6’ tall) (cohort: cannot measure the probability of the explanatory variable given the response)
↩︎
1. P(6’ tall if in the NBA) (case-control: cannot measure the probability of the response variable given a level of the explanatory variable)
↩︎
1. both (cross-classification: can measure all the probabilities)
↩︎
1. case-control (they selected based on people who had died or not)
↩︎
1. cross-classification (they selected all uses of catheters)
↩︎
1. the proportion of people in the population in each explanatory category (tbh, we can’t measure b either, but we can measure the proportion of people in each response group, separated by the explanatory variable)
↩︎
1. the ratio of two proportions
↩︎
1. which variable is called the explanatory does not change the value of the OR
↩︎
1. Because if ln(p1/p2) is in the original interval, p1/p2 will be in the exponentiated interval.
↩︎
1. because the $\ln$ transformation makes the sampling distribution almost normal
↩︎
The worksheet solutions and clicker questions are on the main course website. The HW solutions are on Canvas under Files.↩︎
1. 0.5
↩︎
1. $\frac{1}{1 + e^{β_{0} + β_{1} x}}$
↩︎
1. $e^{β_{0} + β_{1} x}$
↩︎
1. $e^{- (β_{0} + β_{1} x)}$
↩︎
1. a non-linear function of X (which depends on X )
↩︎
1. log-linear
↩︎
1. 0.25
↩︎
1. Bernoulli Y given X
↩︎
1. $β_{0}$ and $β_{1}$
↩︎
1. Find the parameters which make the data most likely under the model.
↩︎
1. Some of the above (a. It gives an principled approach for estimating the parameters. and b. The estimates are asymptotically normally distributed.)
↩︎
1. $L (b_{0}, b_{1}) \geq L (b_{0}, β_{1} = 0)$ always
↩︎
1. $\frac{e^{b_{0}}}{1 + e^{b_{0}}}$
↩︎
1. logit $(p) = β_{0} + β_{1} X_{1} + β_{2} X_{2} + β_{3} X_{3} + β_{4} X_{4}$
↩︎
1. 4 parameter estimates: $b_{0}, b_{1}, b_{2}, b_{3}$
↩︎
1. 7 parameter estimates: $b_{0}, b_{1}, b_{2}, b_{3}, b_{4}, b_{5}, b_{6}$
↩︎
1. 3 (7 - 4 = 3)
↩︎
1. 34
↩︎
1. 33 (34 - 1 = 33)
↩︎
1. 2 (4 - 2 = 2)
↩︎
1. the relationship between X1 and P(success) changes for differing values of X2.
↩︎
1. TRUE
↩︎
1. avoid talking about main effects on their own
↩︎
1. The interaction between variables 1 and 2. (probably… although there are many schools of thought on how to build models)
↩︎
1. 3 (1 * (4-1) = 3)
↩︎
1. The model predicts test data well
↩︎
1. on the last model only.
↩︎
1. overfitting
↩︎
1. underfitting
↩︎
1. FALSE. CV reduces the effect of overfitting, but at the end of the day, you are still building a model on the dataset at hand, and it is possible that you will overfit that dataet.
↩︎
1. N1 = 10, N2 = 180, N3 = 20
↩︎
1. My method achieves a test error lower than all previous methods! (Footnote: When variables are chosen so as to min CV error.)
↩︎
1. blue because it is farther from the line y=x
↩︎
1. False Pos Rate which we want low
↩︎
1. A
↩︎
1. B
↩︎
1. C and D
↩︎
1. or e. Hard to say, could have removed Incubate, Nestling, or Totcare. (I removed Nestling.)
↩︎
1. Color has the smalles test statistic and correspondingly largest p-value.
↩︎
1. or e. Hard to say, could have removed Incubate, Nestling, or Totcare. (I removed Nestling.)
↩︎
1. Color has the smalles test statistic and correspondingly largest p-value.
↩︎
1. TRUE
↩︎
1. TRUE
↩︎
1. FALSE
↩︎
1. FALSE
↩︎
1. FALSE
↩︎
1. TRUE (the coefficient values are forced to be zero)
↩︎
1. FALSE (the null model can exist within the full model because there is flexibility in the values of the coefficients)
↩︎
1. There are many ways to find a good model. Also, c. there is no end to the fun.
↩︎
1. positive
↩︎
1. positive
↩︎
1. positive
↩︎
1. negative
↩︎
1. underestimate the parameter
↩︎
1. sometimes under and sometimes overestimate the parameter. Because censoring and survival time are independent, there isn’t any reason why the censored observations would be different from the other observations. However, removing censored observations isn’t ideal because you lose information.
↩︎
1. there is no censoring at time $t_{i}$
↩︎
1. there are no deaths at time $t_{i}$
↩︎
1. ~0.65
↩︎
1. can’t tell because they cross (and also because d. the p-value is big)
↩︎
1. Because the row and column totals are fixed.
↩︎
1. log rank is more likely to reject $H_{0}$ when $H_{0}$ is false.
↩︎
1. the instantaneous rate of the event
↩︎
1. the last observation was censored. The reason the hazard is zero is because the width of the time interval is unknown, that is we don’t know when the last event time is.
↩︎
1. Are assumed to have the same survival chances as uncensored observations
↩︎
1. hazard ratio
↩︎
1. relative risk
↩︎
1. negative
↩︎
1. zero
↩︎
1. the ln ratio of the hazard curves is not linear with respect to BP
↩︎
1. Is used to analyze survival data when individuals in the study are followed for varying lengths of time. and c. Assumes that the relative hazard for a particular variable is always constant
↩︎
1. the hazard function changes as a function of weight which is also changing over time
↩︎
1. the treatment produces short term benefits only
↩︎
1. variable selection
↩︎
1. FALSE, we’d expect 5% of all null hypotheses to be rejected
↩︎
1. 1 – P(type II error)
↩︎
1. P(data | $H_{0}$ is true)
↩︎
1. Type I errors
↩︎
1. Uniformly (equal prob) on 0 to 1
↩︎
1. (# p-values > 0.5) * 2
↩︎
1. Round it to 1
↩︎
1. Assign the value of the m^th adjusted p-value to the (m+1)^th adjusted p-value
↩︎
1. the number of null discoveries out of the total discoveries
↩︎
1. all of the above
↩︎
1. Storey (q-values)
↩︎
1. Pocock
↩︎
1. all of the above. We restrict our analysis to a situation of controling the type I error. However, that isn’t why we want to stop early. We want to stop early for the other reasons listed.
↩︎

Reuse

CC BY 4.0