Homework 2

# note that you can load ISCAM.RData to get the iscambinomprob function
load(url("http://www.rossmanchance.com/iscam3/ISCAM.RData"))

Note:
homework (HW) at: http://www.rossmanchance.com/iscam3/instructors.html
Practicee Problems (PP) in the textbook at the end of the investigations

Practice Problem 1.8A (no technology except calculator)

For which of the earlier studies we have investigated would a normal approximation be appropriate? Explain.

Friend or Foe
Do Names Match Faces
Hospital Mortality Rates (10 cases)
Hospital Mortality Rates (361 patients)
Kissing the Right Way

In the investigation, you took samples of 25 candies. Between what two values should 95% of the sample proportions fall when \(\pi=0.5\)?

Chapter 1 HW 7

For informal sports events, players often play “rock-paper-scissors” to decide who serves first or who is the home team etc. Two players simultaneously show one of the three objects. The player showing the dominant object (e.g., rock beats scissors) wins. The optimal strategy is to alternate among the three objects. Play the game rock/paper/scissors against the computer using the website: http://www.nytimes.com/interactive/science/rock-paper-scissors.html. Select the novice version of the computer to play against. Play for at least 30 rounds, but keep going for as long as you’d like. Keep track of which option you choose (rock or paper or scissors) for every round that you play (the computer will record this for you but that information will soon scroll off the screen, so make your own notes). Try to recreate how you would play against a person and don’t view your prior results when making your next selection.

Identify the observational units in this study.
Identify the variable of interest.

An article published in College Mathematics Journal (Eyler, Shalla, Doumaux, & McDevitt, 2009) found that players tend to not prefer scissors, choosing it less than 1/3 of the time. We will investigate whether your data suggest that you tend to choose scissors less than one-third of the time.

Calculate the statistic in this study and create a bar graph of your results. (In R, you can do this with just the “summarized” data, you don’t have to enter the individual outcomes. See page 33 in your text.) Are your results in the direction conjectured by these researchers (choosing scissors less than 1/3 of the time)?
Define the parameter of interest in this study.
State appropriate null and alternative hypotheses about this parameter according to the theory suggested in the CMJ article.
Explain how you could use an ordinary six-sided die to simulate a what-if distribution under this null hypothesis. Be sure to indicate what each possible outcome of the die (1, 2, 3, 4, 5, 6) would represent.
Based on your sample results (each student will be different), are you convinced that you choose scissors less than one-third of the time in the long run? Clearly explain your reasoning.

# do you remember this fun function?  my numbers are wrong (for the problem at hand).  
# see page 29 in your text
iscambinomprob(10, 200, .1, lower.tail=TRUE)

## Probability 10 and below = 0.00807125

## [1] 0.00807125

Just for fun… let’s do a little bit of this by hand (in case that type of question comes up on the exam). Assume that you only choose scissors twice out of 30 trials. What is your p-value? (Perform the calculation by hand. Feel free to use R as a calculator, but don’t use a binomial function. Note that in R, * means multiply and ^ means “to the power of”.)

Chapter 1 HW 8

A statistics class at Cal Poly collected data on a well-known campus legend. Each student was asked to specify one of the four tires to answer in a situation where you have to make up which tire had recently been flat on your car. The prior conjecture is that a higher number than would be expected due to chance alone would pick the right front tire. In this class, 24 of 54 students in class chose the right front tire (a tire identified in advance as being one that people tend to pick out of the four). You will conduct a test of whether these data provide evidence that Cal Poly students tend to choose the right front tire more often than would be expected if the four tire choices were equally likely. [Complete data: 15 = left front, 8 = left rear, 24 = right front, 7 = right rear]

Identify the observational units and variable in this study. Also classify the variable as categorical or quantitative. If the variable is categorical, also indicate whether it is binary.
State the appropriate null and alternative hypothesis, in symbols and in words.
Use technology to produce a bar graph of the student responses. Submit this graph, and comment on what it reveals.
Use R to determine the (exact binomial) p-value for the test of your hypotheses in (b).
Write a sentence describing what this p-value is the probability of.
Write a couple of sentences summarizing the conclusion that you would draw from this analysis and also explaining the reasoning process that underlies your conclusion.
Suppose that another statistics class conducts this same study in their own class, which has exactly half as many students. Suppose further that this class obtains the same proportion of students choosing the right front tire. Determine the exact p-value in this case. Describe how the p-value and your conclusion would differ for this class of 27 students compared to the first class of 54 students, and comment on why this makes intuitive sense.

Chapter 1 HW 9

Reconsider the previous exercise, where you tested whether sample data suggest that more than 25% of a population would answer “right front” when asked to name a tire that had gone flat. Suppose that you read of a study in which 30% of a random sample answered “right front.”

What further information would you require to assess whether this sample result constitutes strong evidence that more than 25% of the population would answer “right front”? Also explain why this information is needed.
Determine the p-value of the test for the following sample sizes, in each case supposing that the sample proportion answering “right front” is 0.3: n = 10, 50, 250, 500. (Feel free to use technology, but explain what you ask the technology to do in each case.)
Determine the smallest sample size for which this sample result would be statistically significant at the 0.05 level.
Repeat (c) for the 0.01 level.

Math 58B - Introduction to Biostatistics

Jo Hardin

Spring 2017

Homework 2