Session:10 Hypothesis Testing with Two Samples

Solutions

Introductory Business Statistics | Leadership Development – Micro-Learning Session

Rice University 2020 | Michael Laverty, Colorado State University Global Chris Littel, North Carolina State University| https://openstax.org/details/books/introductory-business-statistics

1

two proportions

3

matched or paired samples

5

single mean

7

independent group means, population standard deviations and/or variances unknown

9

two proportions

11

independent group means, population standard deviations and/or variances unknown

13

independent group means, population standard deviations and/or variances unknown

15

two proportions

17

The random variable is the difference between the mean amounts of sugar in the two soft drinks.

19

means

21

two-tailed

23

the difference between the mean life spans of White and non-White people

25

This is a comparison of two population means with unknown population standard deviations.

27

Check student’s solution.

28

  1. Cannot accept the null hypothesis
  2. p-value < 0.05
  3. There is not enough evidence at the 5% level of significance to support the claim that life expectancy in the 1900s is different between White and non-White people.

31

POS1 – POS2 = difference in the proportions of phones that had system failures within the first eight hours of operation with OS1 and OS2.

34

proportions

36

right-tailed

38

The random variable is the difference in proportions (percents) of the populations that are of two or more races in Nevada and North Dakota.

40

Our sample sizes are much greater than five each, so we use the normal for two proportions distribution for this hypothesis test.

42

  1. Cannot accept the null hypothesis.
  2. p-value < alpha
  3. At the 5% significance level, there is sufficient evidence to conclude that the proportion (percent) of the population that is of two or more races in Nevada is statistically higher than that in North Dakota.

44

The difference in mean speeds of the fastball pitches of the two pitchers

46

–2.46

47

At the 1% significance level, we can reject the null hypothesis. There is sufficient data to conclude that the mean speed of Rodriguez’s fastball is faster than Wesley’s.

49

Subscripts: 1 = Food, 2 = No Food
H0:μ1μ2

0:12
Ha:μ1>μ2

:1>2 

51

Subscripts: 1 = Gamma, 2 = Zeta
H0:μ1=μ2

0:1=2
Ha:μ1μ2

:12 

53

There is sufficient evidence so we cannot accept the null hypothesis. The data support that the melting point for Alloy Zeta is different from the melting point of Alloy Gamma.

54

the mean difference of the system failures

56

With a p-value 0.0067, we can cannot accept the null hypothesis. There is enough evidence to support that the software patch is effective in reducing the number of system failures.

60

H0μd ≥ 0

Haμd < 0

63

We decline to reject the null hypothesis. There is not sufficient evidence to support that the medication is effective.

65

Subscripts: 1: two-year colleges; 2: four-year colleges

  1. H0:μ1μ2
    0:12
     
  2. Ha:μ1<μ2
    :1<2
     
  3. X¯¯¯1X¯¯¯2
    ¯1¯2
     

    is the difference between the mean enrollments of the two-year colleges and the four-year colleges.

  4. Student’s-t
  5. test statistic: -0.2480
  6. p-value: 0.4019
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot reject
    3. Reason for Decision: p-value > alpha
    4. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean enrollment at four-year colleges is higher than at two-year colleges.

67

Subscripts: 1: mechanical engineering; 2: electrical engineering

  1. H0:μ1μ2
    0:12
     
  2. Ha:μ1<μ2
    :1<2
     
  3. X¯¯¯1X¯¯¯2
    ¯1¯2
     

    is the difference between the mean entry level salaries of mechanical engineers and electrical engineers.

  4. t108
  5. test statistic: t = –0.82
  6. p-value: 0.2061
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot reject the null hypothesis.
    3. Reason for Decision: p-value > alpha
    4. Conclusion: At the 5% significance level, there is insufficient evidence to conclude that the mean entry-level salaries of mechanical engineers is lower than that of electrical engineers.

69

  1. H0:μ1=μ2
    0:1=2
     
  2. Ha:μ1μ2
    :12
     
  3. X¯¯¯1X¯¯¯2
    ¯1¯2
     

    is the difference between the mean times for completing a lap in races and in practices.

  4. t20.32
  5. test statistic: –4.70
  6. p-value: 0.0001
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot accept the null hypothesis.
    3. Reason for Decision: p-value < alpha
    4. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean time for completing a lap in races is different from that in practices.

71

  1. H0:μ1=μ2
    0:1=2
     
  2. Ha:μ1μ2
    :12
     
  3. is the difference between the mean times for completing a lap in races and in practices.
  4. t40.94
  5. test statistic: –5.08
  6. p-value: zero
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot accept the null hypothesis.
    3. Reason for Decision: p-value < alpha
    4. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean time for completing a lap in races is different from that in practices.

74

c

76

Test: two independent sample means, population standard deviations unknown.

μ1

1 = the mean price of a sociology text on the selected site.

μ2

2 = the mean price of a math/science text on the selected site.

Random variable: X1¯¯¯¯X1¯¯¯¯

1¯1¯ = the difference in the sample mean textbook price between sociology texts and math/science texts.

Hypotheses: H0 : μ1μ2 = 0, Ha : μ1  μ2 < μ2

0 : 12 = 0,  : 1  2 < 2 which can be expressed as H0s: μ1μ2, Ha μ1 < μ2

H0s: μ1μ2, Ha μ1 < μ2.

Distribution for the test: Use tdf

; because each sample has more than 30 observations, df=n1+n22=33+332=64

=1+22=33+332=64.

Estimate the critical value on the t

-table using the nearest available degrees of freedom, 60. The critical value, 2.660, is found in the .0005 column.

Calculate the test statistic: tc=(X¯¯¯1X¯¯¯2)0s12n2+s22n2=(74.64111.56)049.36233+66.90233=2.55

=(¯1¯2)0122+222=(74.64111.56)049.36233+66.90233=2.55.

Using a calculator with tc=2.55

=2.55 and df=64

=64, the left-tailed p

-value: Decision: Reject H0

0. Conclusion: At the 1% level of significance, from the sample data, there is sufficient evidence to conclude that the mean price of sociology textbooks is less than the mean price of textbooks for math/science.

78

d

80

  1. H0PW = PB
  2. HaPW ≠ PB
  3. The random variable is the difference in the proportions of White and Black suicide victims, aged 15 to 24.
  4. normal for two proportions
  5. test statistic: –0.1944
  6. p-value: 0.8458
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot accept the null hypothesis.
    3. Reason for decision: p-value > alpha
    4. Conclusion: At the 5% significance level, there is insufficient evidence to conclude that the proportions of White and Black female suicide victims, aged 15 to 24, are different.

82

Subscripts: 1 = Cabrillo College, 2 = Lake Tahoe College

  1. H0:p1=p2
    0:1=2
     
  2. Ha:p1p2
    :12
     
  3. The random variable is the difference between the proportions of Hispanic students at Cabrillo College and Lake Tahoe College.
  4. normal for two proportions
  5. test statistic: 4.29
  6. p-value: 0.00002
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot accept the null hypothesis.
    3. Reason for decision: p-value < alpha
    4. Conclusion: There is sufficient evidence to conclude that the proportions of Hispanic students at Cabrillo College and Lake Tahoe College are different.

84

a

85

Test: two independent sample proportions.

Random variable: p1 – p2

Distribution:
H0:p1=p2

0:1=2
Ha:p1p2

:12 

The proportion of eReader users is different for the 16- to 29-year-old users from that of the 30 and older users.

Graph: two-tailed

87

Test: two independent sample proportions

Random variable: p′1 − p′2

Distribution:

H0:p1=p2

0:1=2
Ha:p1>p2

:1>2 

A higher proportion of tablet owners are aged 16 to 29 years old than are 30 years old and older.

Graph: right-tailed

Do not reject the H0.

Conclusion: At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that a higher proportion of tablet owners are aged 16 to 29 years old than are 30 years old and older.

89

Subscripts: 1: men; 2: women

  1. H0:p1p2
    0:12
     
  2. Ha:p1>p2
    :1>2
     
  3. P1P2
    12
     

    is the difference between the proportions of men and women who enjoy shopping for electronic equipment.

  4. normal for two proportions
  5. test statistic: 0.22
  6. p-value: 0.4133
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot reject the null hypothesis.
    3. Reason for Decision: p-value > alpha
    4. Conclusion: At the 5% significance level, there is insufficient evidence to conclude that the proportion of men who enjoy shopping for electronic equipment is more than the proportion of women.

91

  1. H0:p1=p2
    0:1=2
     
  2. Ha:p1p2
    :12
     
  3. P1P2
    12
     

    is the difference between the proportions of men and women that have at least one pierced ear.

  4. normal for two proportions
  5. test statistic: –4.82
  6. p-value: zero
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot accept the null hypothesis.
    3. Reason for Decision: p-value < alpha
    4. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the proportions of males and females with at least one pierced ear is different.

92

  1. H0µd = 0
  2. Haµd > 0
  3. The random variable Xd is the mean difference in work times on days when eating breakfast and on days when not eating breakfast.
  4. t9
  5. test statistic: 4.8963
  6. p-value: 0.0004
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot accept the null hypothesis.
    3. Reason for Decision: p-value < alpha
    4. Conclusion: At the 5% level of significance, there is sufficient evidence to conclude that the mean difference in work times on days when eating breakfast and on days when not eating breakfast has increased.

94

Subscripts: 1 = boys, 2 = girls

  1. H0:μ1μ2
    0:12
     
  2. Ha:μ1>μ2
    :1>2
     
  3. The random variable is the difference in the mean auto insurance costs for boys and girls.
  4. normal
  5. test statistic: z = 2.50
  6. p-value: 0.0062
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot accept the null hypothesis.
    3. Reason for Decision: p-value < alpha
    4. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean cost of auto insurance for teenage boys is greater than that for girls.

96

Subscripts: 1 = non-hybrid sedans, 2 = hybrid sedans

  1. H0:μ1μ2
    0:12
     
  2. Ha:μ1<μ2
    :1<2
     
  3. The random variable is the difference in the mean miles per gallon of non-hybrid sedans and hybrid sedans.
  4. normal
  5. test statistic: 6.36
  6. p-value: 0
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Cannot accept the null hypothesis.
    3. Reason for decision: p-value < alpha
    4. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean miles per gallon of non-hybrid sedans is less than that of hybrid sedans.

98

  1. H0µd = 0
  2. Haµd < 0
  3. The random variable Xd is the average difference between husband’s and wife’s satisfaction level.
  4. t9
  5. test statistic: t = –1.86
  6. p-value: 0.0479
  7. Check student’s solution
    1. Alpha: 0.05
    2. Decision: Cannot accept the null hypothesis, but run another test.
    3. Reason for Decision: p-value < alpha
    4. Conclusion: This is a weak test because alpha and the p-value are close. However, there is insufficient evidence to conclude that the mean difference is negative.

99

p-value = 0.1494

At the 5% significance level, there is insufficient evidence to conclude that the medication lowered cholesterol levels after 12 weeks.

103

Test: two matched pairs or paired samples (t-test)

Random variable: Xd

 

Distribution: t12

H0μd = 0 Haμd > 0

The mean of the differences of new female breast cancer cases in the south between 2013 and 2012 is greater than zero. The estimate for new female breast cancer cases in the south is higher in 2013 than in 2012.

Graph: right-tailed

p-value: 0.0004

Decision: Cannot accept H0

Conclusion: At the 5% level of significance, from the sample data, there is sufficient evidence to conclude that there was a higher estimate of new female breast cancer cases in 2013 than in 2012.

105

Test: matched or paired samples (t-test)

Difference data: {–0.9, –3.7, –3.2, –0.5, 0.6, –1.9, –0.5, 0.2, 0.6, 0.4, 1.7, –2.4, 1.8}

Random Variable: Xd

 

Distribution: H0μd = 0 Haμd < 0

The mean of the differences of the rate of underemployment in the northeastern states between 2012 and 2011 is less than zero. The underemployment rate went down from 2011 to 2012.

Graph: left-tailed.

Decision: Cannot reject H0.

Conclusion: At the 5% level of significance, from the sample data, there is not sufficient evidence to conclude that there was a decrease in the underemployment rates of the northeastern states from 2011 to 2012.

107

e

109

d

111

f

113

e

115

f

117

a

LEARN | GROW | LEAD

Access Your Leadership Academy!

Evolutionary

Leadership Academy

Leadership

Excellence Academy

Leadership

On the Go

Audiobooks

Leadership

On the Go

Courses

Go

LEARN | GROW | LEAD

Access Your Leadership Academy!

Evolutionary

Leadership Academy

Leadership

Excellence Academy

Leadership

On the Go

Audiobooks

Leadership

On the Go

Courses