Stat 230 Summer 2014 Final Exam Page 1: Answer All 30 Questi

Stat 230 Summer 2014 Final Exampage 1please Answer All 30 Q

Stat 230 Summer 2014 – Final Exam Please answer all 30 questions. Make sure your answers are as complete as possible. Show all of your work and reasoning. In particular, when there are calculations involved, you must show how you come up with your answers with critical work and/or necessary tables. Answers that come straight from program software packages will not be accepted. You must include the Honor Pledge on the title page of your submitted final exam. Exam submitted without the Honor Pledge will not be accepted. Honor Pledge: "I have completed this final examination myself, working independently and not consulting anyone except the instructor. I have neither given nor received help on this final examination." Use the information below to answer Questions 1 through 3. Given a sample size of 34, with sample mean 660.3 and sample standard deviation 104.9, we perform the following hypothesis test. Since n>30, this is a Z test. Null Hypothesis 0 : 700H ï  Alternative Hypothesis : 700aH ï  1. What is the test statistic? What is the p-value? 2. At a 5% significance level (95% confidence level), what is the critical value(s) in this test? Do we reject the null hypothesis? 3. What are the border values of x between acceptance and rejection of this hypothesis? Stat 230 Summer 2014 – Final Exam Questions 4 through 7 involve rolling of dice. 4. Given a fair, six-sided die, what is the probability of rolling the die twice and getting a “1” each time? 5. What is the probability of getting a “1” on the second roll when you get a “1” on the first roll? 6. The House managed to load the die in such a way that the faces “2” and “4” show up twice as frequently as all other faces. Meanwhile, all the other faces still show up with equal frequency. What is the probability of getting a “5” when rolling this loaded die? 7. Write the probability distribution for this loaded die, showing each outcome and its probability. Use the data in the table to answer Questions 8 through 9. x y . Determine SSxx, SSxy, and SSyy. 9. Find the equation of the regression line. What is the predicted value when 4?x  Stat 230 Summer 2014 – Final Exam Use the data below to answer Questions 10 through 12. A group of students from three universities were asked to pick their favorite college sport to attend of their choice: The results, in number of students, are listed as follows: Football Basketball Soccer Maryland Duke UCLA Supposed that a student is randomly selected from the group mentioned above. 10. What is the probability that the student is from UCLA or chooses football? 11. What is the probability that the student is from Duke, given that the student chooses basketball? 12. What is the probability that the student is from Maryland and chooses soccer? Use the information below to answer Questions 13 and 15. There are 4000 mangoes in a shipment. It is found that it a mean weight of 15 ounces with a standard deviation of 2 ounces. 13. How many mangoes have weights between 14 ounces and 16 ounces? 14. What is the probability that a randomly selected mango weighs less than 14 ounces? Stat 230 Summer 2014 – Final Exam a. What is the probability that the 100 randomly selected mangoes have a mean weight less than 14 ounces? b. Do you come up with the same result in Question 14? Why or why not? 16. Suppose that in a box of 20 iPhone devices, there are 5 with defective antennas. In a draw without replacement, if 3 iPhone devices are picked, what is the probability that all 3 have defective antennas? Use the information below to answer Questions 17 and 18. Benford's law, also called the first-digit law, states that in lists of numbers from many (but not all) real-life sources of data, the leading digit is distributed in a specific, non-uniform way shown in the following table. Leading Digit Distribution of Leading Digit (%) 30.1 17.6 12.5 9.7 7.9 6.7 5.8 5.1 4.6 The owner of a small business would like to audit its account payable over the past year because of a suspicion of fraudulent activities. He suspects that one of his managers is issuing checks to non-existing vendors in order to pocket the money. There have been 790 checks written out to vendors by this manager. The leading digits of these checks are listed as follow: Leading Digits. Suppose you are hired as a forensic accountant by the owner of this small business, what statistical test would you employ to determine if there is fraud committed in the issuing of checks? What is the test statistic in this case? Stat 230 Summer 2014 – Final Exam level)? Do the data provide sufficient evidence to conclude that there is fraud committed? Hypothesis Test versus Confidence Interval – Questions 19 through 21 Random samples of size n1=55 and n2 = 65 were drawn from populations 1 and 2 , respectively. The samples yielded Ì‚ðŸ =. 🕠ð’‚ð’ð’… Ì‚ðŸ =. ðŸ”. Test Ho: (p1-p2) = 0 against Ha: (p1-p2) >0 using α = .05. 19. Perform a hypothesis test of p1 = p2 with a 5% significance level (95% confidence level). 20. Obtain a 95% confidence interval estimate of p1 - p2. 21. Do you come up with the same conclusion for Question 19 and Question 20? Why or why not? Hardness of Gem – Questions 22 and 23 Listed below are measured hardness indices from three different collections of gemstones. Collection Hardness Indices i x 2 i s A 9.3 9.3 9.3 8.6 8.7 9.3 9..91 0.10 B 8.7 7.7 7.7 8.7 8.2 9.0 7.4 7..03 0.60 C 7.2 7.9 6.8 7.4 6.5 6.6 6.7 6.5 6.5 7.1 6.7 5.5 7.3 6.82 0.34 You are also given that 7.99x  . 22. What is the test statistic? Stat 230 Summer 2014 – Final Exam collections have the same mean hardness. 24. The probability that an individual egg in a carton of eggs is cracked is 0.03. You have picked out a carton of 1 dozen eggs (that’s 12 eggs) at the grocery store. Determine the probability that at most one of the eggs in the carton are cracked. 25. In a group lineup of 7 models in a commercial, 3 are male and 4 are female. In how many ways can you arrange 3 models in a lineup if the first and the third must be a male but the second one must be a female? 26. Pair Sample from Population 1 (observation 1) Sample from Population 2 (observation The data for a random sample of six paired observations are shown in the table above. Stat 230 Summer 2014 – Final Exam a) Compute ð‘‘ Ì… and Sd b) Express µd in terms of µ1 and µ2. c) Form a 95% confidence interval for µd. d) Test Ho: µd = 0 against Ha: µd ≠0. Use α = .. Peter, Paul, Mary, John and Martha are members of the pastoral council at a local church. They are to be seated at one side of a long conference table in a pastoral council meeting. a) How many possible ways can these 5 council members be seated? b) How many possible sitting arrangements are there if only gender is considered in the process? 28. How many social robots would need to be sampled in order to estimate the proportion of robots designed with legs, no wheels to within .075 of its true value with 99% confidence. Given that a random sample of 106 robots showed that 63 were designed with legs, no wheels. 29. Composite sampling is a way to reduce laboratory testing costs. A public health department is testing for possible fecal contamination in public swimming pools. In this case, water samples from 5 pools are combined for one test, and further testing is performed only if the combined sample shows fecal contamination. Based on past experience, there is a 3% chance of finding fecal contamination in a public swimming area. What is the probability that a combined sample from 5 swimming pools has fecal contamination? Recall that P(A) +P(not A) = 1.0. 30 A random sample of five accidents resulted in the following number of persons injured: 18, 15, 12, 19, & 21. Using the .01 significance level, can we conclude the population mean is less than 20 for all accidents? a) State Ho and Ha? b) Test statistic = ? c) Critical value = ? d) Reject Ho: (yes or no)

Paper For Above instruction

This comprehensive statistical analysis addresses multiple facets of inferential statistics, probability, regression, hypothesis testing, and sampling methods based on the questions provided. The paper aims to synthesize these concepts through detailed calculations, explanations, and applications relevant to real-world scenarios, maintaining academic rigor throughout.

1. Hypothesis Testing for Mean Weight of a Sample

Given a sample size of 34, sample mean of 660.3, and standard deviation of 104.9, the goal is to test if the population mean differs from 700. Because the sample size exceeds 30, a z-test is appropriate. The test statistic formula is:

\[ z = \frac{\bar{x} - \mu_0}{s/\sqrt{n}} \]

Plugging in the values:

\[ z = \frac{660.3 - 700}{104.9/\sqrt{34}} \approx \frac{-39.7}{17.97} \approx -2.21 \]

The p-value associated with z = -2.21 (two-tailed) is approximately 0.027. Since 0.027

2. Critical Values at 5% Significance Level

For a two-tailed test at α=0.05, the critical values are ±1.96. Because the computed z = -2.21 is less than -1.96, the decision is to reject the null hypothesis.

3. Border Values of the Sample Mean

The acceptance region for the sample mean is between:

\[ \bar{x} = \mu_0 \pm z_{crit} \times \frac{s}{\sqrt{n}} = 700 \pm 1.96 \times 17.97 \]

\[ \Rightarrow 700 \pm 35.23 \]

Thus, the acceptance region for the sample mean is approximately between 664.77 and 735.23.

4-7. Probabilities and Distribution of a Loaded Die

For a fair six-sided die, the probability of rolling a 1 twice is:

\[ P = \left(\frac{1}{6}\right)^2 = \frac{1}{36} \approx 0.0278. \]

The probability of getting a 1 on the second roll, given that a 1 was rolled first (independent events), remains:

\[ P(Second=1 | First=1) = P(Second=1) = \frac{1}{6}. \]

For the loaded die where faces 2 and 4 are twice as likely, the total probability sums to 1:

\[ 5p + 2p (for face 2) + 2p (for face 4) + p (for others) = 1. \]

Assuming all other faces have probability p, we set:

\[ p + p + 2p + 2p + p + p = 1 \Rightarrow 8p = 1 \Rightarrow p = \frac{1}{8}. \]

Consequently:

- Face 2 and 4 probability: \( 2 \times \frac{1}{8} = \frac{1}{4} \).

- Other faces probability: \( \frac{1}{8} \).

Therefore, probability of rolling a 5:

\[ P(5) = \frac{1}{8} = 0.125. \]

The distribution:

| Outcome | Probability |

|---|---|

| 1 | 1/8 = 0.125 |

| 2 | 1/4 = 0.25 |

| 3 | 1/8 = 0.125 |

| 4 | 1/4 = 0.25 |

| 5 | 1/8 = 0.125 |

| 6 | 1/8 = 0.125 |

8 & 9. Regression Analysis

Using the summarized table data with variables x and y, the sums of squares are calculated as follows:

- Sum of \( x \), sum of \( y \), sum of \( xy \), and sum of \( x^2 \) are first computed.

- The formulas for sums of squares:

\[ SS_{xx} = \sum (x_i - \bar{x})^2 \]

\[ SS_{xy} = \sum (x_i - \bar{x})(y_i - \bar{y}) \]

\[ SS_{yy} = \sum (y_i - \bar{y})^2 \]

Assuming these are executed, the regression line is derived:

\[ \hat{y} = b_0 + b_1 x \]

where:

\[ b_1 = \frac{SS_{xy}}{SS_{xx}}, \quad b_0 = \bar{y} - b_1 \bar{x} \]

Predicted value at \( x=4 \):

\[ \hat{y} = b_0 + b_1 \times 4. \]

10-12. Probability Calculations for University Sports

Total students: sum across universities and sports.

- UCLA or football:

\[ P(UCLA \cup Football) = P(UCLA) + P(Football) - P(UCLA \cap Football). \]

Assuming independence:

\[ P(UCLA) = \frac{\text{UCLA students}}{\text{total}}, \quad P(Football) = \frac{\text{Maryland students}}{\text{total}}. \]

- Given that a student chooses basketball, probability from Duke:

\[ P(Duke | Basketball) = \frac{P(Duke \cap Basketball)}{P(Basketball)}. \]

- Maryland and soccer:

\[ P(Maryland \cap Soccer) = \frac{\text{Maryland students who chose soccer}}{\text{total}}. \]

13 & 14. Mango Weight Analysis

Number of mangoes with weights between 14 and 16 ounces:

\[ Z = \frac{14.5 - 15}{2} = -0.25, \quad Z=0.25 \]

Probability using standard normal table:

\[ P(14 \leq X \leq 16) = P(-0.25 \leq Z \leq 0.25) \approx 0.5987. \]

Probability less than 14 ounces:

\[ Z = -0.5, \quad P(Z

Mean of 100 mangoes:

\[ \bar{X} \sim N(\mu, \sigma/\sqrt{n}) \]

\[ \Rightarrow P(\bar{X}

\[

Z = \frac{14 - 15}{2/\sqrt{100}} = \frac{-1}{0.2} = -5,

\]

whose probability is practically zero.

15. Comparing Probabilities

Similarly, the probability that the mean weight of 100 mangoes is less than 14 approaches zero, consistent with the individual mango probability due to the Central Limit Theorem.

16. Probability of All Defective in Sample

Number of defective antennas:

\[ P(\text{all 3 are defective}) = \frac{\binom{5}{3} \times \binom{15}{0}}{\binom{20}{3}} = \frac{ \binom{5}{3} }{\binom{20}{3}} \approx 0.014. \]

17 & 18. Benford’s Law Testing

Applying chi-square goodness-of-fit test between observed leading digits and expected distribution per Benford's law:

\[

\chi^2 = \sum \frac{(O_i - E_i)^2}{E_i}

\]

where \( O_i \) are observed counts and \( E_i \) are expected counts based on Benford's proportions. Comparing the test statistic to the critical value determines whether fraudulent activity exists.

19-21. Comparing Two Population Proportions

- Testing: \( H_0: p_1 = p_2 \); \( H_a: p_1 > p_2 \).

- Standard error:

\[ SE = \sqrt{p(1-p)(1/n_1 + 1/n_2)} \]

with pooled proportion:

\[ p = \frac{x_1 + x_2}{n_1 + n_2} \]

- Confidence interval:

\[ (p_1 - p_2) \pm z_{0.975} \times SE. \]

Consistent conclusions are obtained because confidence intervals that exclude zero coincide with significance test rejections.

22 & 23. Gem Hardness Test

Using the mean hardness indices and standard deviations, the t-test compares the means of the three collections. Given the data, the test statistic calculates as:

\[

t = \frac{\bar{x}_A - \mu_{pop}}{s / \sqrt{n}}

\]

24. Eggs Cracked Probability

Number of eggs cracked follows a binomial distribution:

\[

P(\text{at most 1 cracked}) = P(0) + P(1) = \