16 mayΒ

1

Questions 1-4 are based on the following information:

An insurance company analyst is interested in analyzing the dollar value of damage in

automobile accidents. She collects data from 115 accidents, and records the amount of

damage as well as the age of the driver. The results of her regression analysis are listed

below.

1. The regression equation is:

A) π¦= 10725.802 + 69.964x

B) π¦= 114 + 113x

C) π¦ = 1535.215 + 34.625x

D) π¦ = 113 + 114x

2. How would you best explain the y-intercept in this situation?

A) For each additional 1-year increase in the age of the driver, we would expect

damage to increase by $10,726.

B) For each additional 1-year increase in the age of the driver, we would expect

damage to increase by $70.

C) It makes no sense to explain the intercept in this situation, since we cannot

have a driver with an age of zero.

D) The average amount of damage was $10,726.

3. What would be the dollar value of an accident involving a 25-year-old driver?

A) $11,836.56

B) $10,795.47

C) $13,372.58

D) $12,474.90

Practice Final Exam

2

4. Which of the following statements is the best explanation of the R2?

A) 3.5% of the accident damage can be explained by the age of the driver.

B) 3.5% of the variation in accident damage can be explained by variation in the

age of the driver.

C) 3.5% of the coefficients, t-stat, and p-value can be explained by the age of the

driver.

D) 3.5% of the total error can be explained by the SSE.

5. The residual is defined as the difference between the actual value of:

A) y and the estimated value of y.

B) x and the estimated value of x.

C) y and the estimated value of x.

D) x and the estimated value of y.

Questions 6-10 are based on the following information:

A sales manager is interested in determining the relationship between the amount spent

on advertising and total sales. The manager collects data for the past 24 months and runs

a regression of sales on advertising expenditures. The results are presented below but,

unfortunately, some values identified by asterisks are missing.

6. What are the degrees of freedom for residuals?

A) 21

B) 22

C) 23

D) 24

7. The value of mean square error (residual) is:

A) 1678.9

B) 1,554.25

C) 1,493.63

D) 1,407.35

Practice Final Exam

3

8. The total degrees of freedom is:

A) 21

B) 22

C) 23

D) 24

9. A regression analysis between sales (in $1000) and advertising (in $100) resulted in

the following least squares line: π¦= 75 + 5x. This implies that if advertising is $800,

then the predicted amount of sales (in dollars) is:

A) $4075

B) $115,000

C) $164,000

D) $179,000

10. In a regression problem, a coefficient of determination 0.90 indicates that:

A) 90% of the y values are positive.

B) 90% of the variation in y can be explained by the regression line.

C) 90% of the x values are equal.

D) 90% of the variation in x can be explained by the regression line.

11. If the least squares regression line π¦= -2.88 + 1.77x and the coefficient of

determination is 0.81, the coefficient of correlation is:

A) -0.88

B) +0.88

C) +0.90

D) -0.90

12. In a simple regression analysis (where Y is a dependent and X an independent

variable), if the Y intercept is positive, then

A) there is a positive correlation between X and Y

B) if X is increased, Y must also increase

C) if Y is increased, X must also increase

D) None of these alternatives is correct.

13. In regression analysis, the independent variable is

A. used to predict other independent variables

B. used to predict the dependent variable

C. called the intervening variable

D. the variable that is being predicted

Practice Final Exam

4

14. In a regression analysis if SSE = 200 and SSR = 300, then the coefficient of

determination is

A. 0.6667

B. 0.6000

C. 0.4000

D. 1.5000

15. Regression analysis was applied between sales (in β¬1000) and advertising (in β¬100)

and the following regression function was obtained.

π¦= 500 + 4 x

Based on the above estimated regression line if advertising is β¬10,000, then the

point estimate for sales (in euros) is

A. β¬900

B. β¬900,000

C. β¬40,500

D. β¬505,000

16. A regression analysis between demand (Y in 1000 units) and price (X in euros)

resulted in the following equation

π¦= 9 – 3 x

The above equation implies that if the price is increased by β¬1, the demand is

expected to

A. increase by 6 units

B. decrease by 3 units

C. decrease by 6,000 units

D. decrease by 3,000 units

17. Regression analysis was applied between sales (Y in β¬1,000) and advertising (X in

β¬100), and the following estimated regression equation was obtained.

π¦= 80 + 6.2 x

Based on the above estimated regression line, if advertising is β¬10,000, then the

point estimate for sales (in euros) is

A. β¬62,080

B. β¬142,000

C. β¬700

D. β¬700,000

Questions 18 and 19 are based on the following information:

A large milling machine produces steel rods to certain specifications. The machine is

considered to be running normally if the standard deviation of the diameter of the rods is

at most 0.15 millimeters. The line supervisor needs to test the machine is for normal

Practice Final Exam

5

functionality. The quality inspector takes a sample of 25 rods and finds that the sample

standard deviation is 0.19.

18. What are the null and alternative hypotheses for the test?

a. H0 : Ο2 > 0.15 and H1 : Ο2 β€ 0.15

b. H0 : Ο β₯ 0.15 and H1 : Ο < 0.15
c. H0 : Ο = 0.0225 and H1 : Ο β 0.0225
d. H0 : Ο2 β€ 0.0225 and H1 : Ο2 > 0.0225

19. Which of the following statements is the most accurate?

a. Reject null hypothesis at Ξ± = 0.025

b. Fail to reject null hypothesis at Ξ± β€ 0.10

c. Reject null hypothesis at Ξ± = 0.05

d. Reject null hypothesis at Ξ± = 0.01

20. Suppose you have the following null and alternative hypotheses: H0 : Ο2 = 34.5 and

H1 : Ο2 > 34.5. If you take a random sample of 15 observations from a normally

distributed population and find that s2 = 48.1, what is the most accurate statement

that can be made about the p-value for this test?

a. 0.01 < p-value < 0.025
b. 0.025 < p-value < 0.05
c. p-value < 0.01
d. 0.05 < p-value < 0.10

21. A sample of 20 cans of tomato juice showed a standard deviation of 10.0 grams. A

95% confidence interval estimate of the variance for the population is

a. 144.6 to 533.3

b. 139.0 to 495.3

c. 55.6 to 198.1

d. 57.8 to 213.3

22. The producer of bottling equipment claims that the variance of all their filled bottles

is 0.027 or less. A sample of 30 bottles showed a standard deviation of 0.2. The p-

value for the test is

a. between 0.025 to 0.05

b. between 0.05 to 0.01

c. 0.05

d. 0.025

Practice Final Exam

6

23. The chi-squared value for a one-tailed (upper tail) hypothesis test of the population

standard deviation at 95% confidence and a sample size of 25 is

a. 33.196

b. 36.415

c. 39.364

d. 37.652

24. We are interested in testing a claim that the standard deviation of a population is no

more than 5.0. The null hypothesis for this test is

a. Ho: Ο2 > 25.0

b. Ho: Ο2 β€ 25.0

c. Ho: Ο2 β€ 5.0

d. Ho: Ο2 β₯ 5.0

25. The value of F0.025 with 15 numerator and 10 denominator degrees of freedom is

a. 3.06

b. 2.54

c. 2.85

d. 3.52

Questions 26 and 27 are based on the following information:

The standard deviation of the ages of a sample of 16 executives from large companies

was 8.2 years; while the standard deviation of the ages of a sample of 26 executives from

small companies was 12.8 years.

26. What is the test statistic to test if there any difference in the standard deviations of the

ages of all the large-company and small-company executives.

a. 2.32

b. 2.25

c. 2.21

d. 3.19

27. We would like to test to see if there is any difference in the standard deviations of the

ages of all the large-company and small-company executives. What is the critical

value of F at 10% significance level?

a. 2.12

b. 2.10

c. 2.21

d. 3.10

Practice Final Exam

7

Questions 28-30 are based on the following information:

Suppose two food preservatives are extensively tested and determined safe for use in

meats. A processor wants to compare the preservatives for their effects on retarding

spoilage. Suppose 16 cuts of fresh meat are treated with preservative A and another 12

cuts of meat are treated with preservative B. The number of hours until spoilage begins is

recorded for each of the 28 cuts of meat. The results are summarized in the table below.

Preservatives A Preservatives B

Sample means 95.25 100.50

Sample standard deviation 13.45 10.55

28. The value of the test statistic for determining if there is a difference in the population

variances for preservatives A and B is equal to:

e. 0.784

f. 1.625

g. 1.275

h. 1.129

29. The numerator and denominator degrees of freedom associated with the test statistic

for determining if there is a difference in the population variances for preservatives A

and B are, respectively:

i. 16 and 12

j. 12 and 16

k. 15 and 11

l. 11 and 15

30. The most accurate statement that can be made about the p-value for testing whether

there is a difference in the population variances for preservatives A and B is:

a. p-value > 0.05

b. p-value = 0.00

c. p-value < 0.01
d. 0.01 < p-value < 0.05

31. An ANOVA procedure is used for data that was obtained from four sample groups

each comprised of five observations. The degrees of freedom for the critical value of

F are

a. 3 and 20

b. 3 and 16

c. 4 and 17

d. 3 and 19

32. The analysis of variance is a procedure that allows statisticians to compare three or

more population:

a. Means.

b. Proportions.

c. Variances.

d. Standard deviations.

Practice Final Exam

8

33. In analysis of variance, between-treatments variation is measured by the:

a. SSE

b. SSTR

c. SST

d. All of the above

Questions 34-38 are based on the following information:

Consider the following ANOVA table:

Source of Variation SS df MS F

Treatments SSTR=4 2 ? ?

Error SSE= ? ? ?

Total SST=34 14

34. What are the degrees of freedom for the Errors?

a. 3

b. 4

c. 13

d. 12

35. What is the value of SSE?

a. 29

b. 20

c. 30

d. 33

36. What is the Mean-Square for Treatments?

a. 2

b. 1

c. 4

d. 0

37. What is Mean-Square Errors?

a. 3

b. 4

c. 3.5

d. 2.5

38. What is the value of the F-statistic?

a. 0.5

b. 0.8

c. 1.2

d. 1.6

Practice Final Exam

9

Answer Key

1 A

2 C

3 D

4 B

5 A

6 B

7 A

8 C

9 B

10 B

11 C

12 D

13 B

14 B

15 B

16 D

17 D

18 D

19 C

20 A

21 D

22 A

23 B

24 B

25 D

26 B

27 C

28 B

29 C

30 A

31 B

32 A

33 B

34 D

35 C

36 A

37 D

38 B

1

Chapter11

β’ Interval Estimation of a Population Variance

π β 1 π !

π!

!

! β€ π

! β€

π β 1 π !

π

!!!!

!

where π! values are based on a chi-squared distribution with (n-1) degrees of freedom.

β’ Hypothesis Testing about the Variances of Two Populations

Test Statistic

π =

πππ

πππ

The critical value πΉ! is based on an F distribution with π! β 1 (numerator) and π! β 1

(denominator) degrees of freedom

Chapter 13

Test for the Equality of k Population Means

ππππ πππ’πππ πππππ‘ππππ‘π =

ππ’π ππ πππ’πππ πππππ‘ππππ‘π

πππππππ ππ πππππππ πππππ‘ππππ‘

ππππ πππ’πππ πΈππππ =

ππ’π ππ πππ’πππ πΈππππ

πππππππ ππ πππππππ πΈππππ

πΉ =

ππππ πππ’πππ πππππ‘ππππ‘π

ππππ πππ’πππ πΈππππ

The critical value πΉ!is based on an F distribution with k-1 numerator degrees of freedom and n-k

denominator degrees of freedom.

Chapter 14 (Simple Linear Regression)

b 1 =

!!! ! (!!! !)

!!! ! Β²

b1: slope coefficient in a simple regression line

b0: intercept coefficient (coefficient of the constant) in a simple regression line

π¦: estimated value of the dependent variable y

b0 = π¦ β b!x

π¦ = b0 + b 1 x (estimated equation)

2

πππΈ = (π! β π)Β²

SST = SSR + SSE βΉ

(π! β π)Β² = SST = ( π βπ)Β²+ (π! β π)Β²

π π’π ππ π ππ’ππππ ππ₯πππππππ ππ¦ ππππππ π πππ = πππ = ( π βπ)Β²

total variation in Y = SST = (π! β π)Β²

R2 = !!”

!!”

=coefficient of determination,

sample correlation coefficient = π!,! = π Β² (carry the sign of b1)

tstat = b1 / se(b1), se(b1):standard error of the slope coefficient,

residual = error = Y-π

1

Chapter11

β’ Interval Estimation of a Population Variance

π β 1 π !

π!

!

! β€ π

! β€

π β 1 π !

π

!!!!

!

where π! values are based on a chi-squared distribution with (n-1) degrees of freedom.

β’ Hypothesis Testing about the Variances of Two Populations

Test Statistic

π =

πππ

πππ

The critical value πΉ! is based on an F distribution with π! β 1 (numerator) and π! β 1

(denominator) degrees of freedom

Chapter 13

Test for the Equality of k Population Means

ππππ πππ’πππ πππππ‘ππππ‘π =

ππ’π ππ πππ’πππ πππππ‘ππππ‘π

πππππππ ππ πππππππ πππππ‘ππππ‘

ππππ πππ’πππ πΈππππ =

ππ’π ππ πππ’πππ πΈππππ

πππππππ ππ πππππππ πΈππππ

πΉ =

ππππ πππ’πππ πππππ‘ππππ‘π

ππππ πππ’πππ πΈππππ

The critical value πΉ!is based on an F distribution with k-1 numerator degrees of freedom and n-k

denominator degrees of freedom.

Chapter 14 (Simple Linear Regression)

b 1 =

!!! ! (!!! !)

!!! ! Β²

b1: slope coefficient in a simple regression line

b0: intercept coefficient (coefficient of the constant) in a simple regression line

π¦: estimated value of the dependent variable y

b0 = π¦ β b!x

π¦ = b0 + b 1 x (estimated equation)

2

πππΈ = (π! β π)Β²

SST = SSR + SSE βΉ

(π! β π)Β² = SST = ( π βπ)Β²+ (π! β π)Β²

π π’π ππ π ππ’ππππ ππ₯πππππππ ππ¦ ππππππ π πππ = πππ = ( π βπ)Β²

total variation in Y = SST = (π! β π)Β²

R2 = !!”

!!”

=coefficient of determination,

sample correlation coefficient = π!,! = π Β² (carry the sign of b1)

tstat = b1 / se(b1), se(b1):standard error of the slope coefficient,

residual = error = Y-π