Sidebar Menu

Paired t-test ( paired t-test ) is one method of testing the hypothesis where the data used is not independent which is characterized by the existence of a value relationship in each of the same samples ( pairs ).). The characteristics that are most often found in paired cases are that one individual (object of research) is subjected to 2 different treatments. Although using the same individual, the researchers still obtained 2 kinds of sample data, namely data from the first treatment and data from the second treatment. The first treatment may be in the form of control, which is not giving any treatment at all to the object of research. 

For example, in research on the effectiveness of a particular drug, in the first treatment, the researcher applies control, while in the second treatment, then the object of research is subjected to a certain action, such as drug administration. Thus, the performance of the drug can be known by comparing the condition of the object of research before and after being given the drug. Another example is a diet program where weight measurements are weighed before and after the diet. Another example that can be considered a pair even though there are 2 objects of research, for example the difference between the height of the father and his son.

A full discussion of the t-student test for paired data can be read in the following document.

Hyphothesis

 

Test Type:

 

Two-Tailed

Right Tailed

Left Tailed

 

H0 : μd = 0

H0 : μd = 0

H0 : μd = 0

 

HA : μd ≠ 0

HA : μd > 0

HA : μd < 0

Decision:

     

Reject H0 if:

|t| > tα/2,df

t > t α,df

t < -t α,df

Assumption

  1. The sample data contains paired elements
  2. Samples taken randomly
  3. Normally Distributed

Statistical Test

 $$ t=\frac{\bar{d}-\mu_d}{\frac{s_d}{\sqrt n}}$$

or if μd = 0, then:
 $$ t=\frac{\bar{d}}{\frac{s_d}{\sqrt n}}$$

Where is the degree of freedom (df) = n-1

d = difference between each individual/object in pairs

μd = the average value of the difference d of the population from the entire data pair, usually 0.

 $\bar{d}$ = the average value of d

sd = standard deviation value from d

n = number of data pairs

Before performing data analysis with paired t-tests, we first test whether the two data spread normally or not. The test statistics used are the Lilliefors (Kolmogorov-Smirnov) normality test.

Normality test hypothesis:

  • H0 : Data spread normally
  • H1 : Data does not spread normally

On paired observations:

  • Inter-sample or unit struggles were performed before the experiment began on the basis of expectation that if there was no effect of treatment then both groups gave the same response, and
  • Source of variance from the outside is omitted, so the calculation of the value of the critique is based on the variance of differences between groups and not on the variance between individuals in each sample.

Example 1

The National Health Survey and Nutrition Testing, organized by the Department of Health and Public Services, examined the difference between self-reported and directly measured heights of some women between the ages of 12-16. Self-reported height and measured height data are presented in the Table below.

No.

Reported height

Measured height

d

1

53

58.1

-5.1

2

64

62.7

1.3

3

61

61.1

-0.1

4

66

64.8

1.2

5

64

63.2

0.8

6

65

66.4

-1.4

7

68

67.6

0.4

8

63

63.5

-0.5

9

64

66.8

-2.8

10

64

63.9

0.1

11

64

62.1

1.9

12

67

68.5

-1.5

average

-0.475

sd

1.980874

  1. Is there enough evidence to support the allegation that there is a difference between the reported height and the measured height of women between the ages of 12-16? Use a significant level of 0.05.
  2. Create a confidence interval with a 95% confidence level between the average difference in reported height and the measured height. 

Manual Calculation (Classical/Traditional Method)

 

Statistics

n

12

 $\bar{{d}}$

-0.475

sd

1.980874

  1. Step 1: Claim: the alleged difference between the reported height and the measured height can be denoted by: μd ≠ 0, if the conjecture is false, then μ1 = μ2
  2. Step 2: From the two equations above, the μd = 0 contains an element of equality, thus becoming a null hypothesis and alternative hypothesis (H1) μ1 ≠ μ2.
  3. H0: μd = 0
  4. H1: μd ≠ 0
  5. Significant level: α = 0.05
  6. Determine the statistical test: the sample is randomly taken from the paired object so that the appropriate statistical test is a paired test.
  7. Calculate tstat and determine tcritical:
  8.  $ t=\frac{\bar{d}-\mu_d}{\frac{s_d}{\sqrt n}}=\frac{-0.475-0}{\frac{1.981}{\sqrt{12}}}=-0.83$
  9. Determine thetcritica with df = 12 and α = 0.05. From the t-student table obtained a critical t-value for a two-tailed test = ±2. 201 (sign due to Two-Tailed test±)
  10. Because |tstat| < |tcritical|= |-0.83| < | ±2.201|, then H0 is welcome!
  11. From the results of the t-test, it was concluded that at a significant level of 5%, there was not enough evidence to state a difference between the self-reported height and the height measured by the Department.

  1. 95% confidence interval for average difference: Determine its Margin of Error Value (E)

 $$ E=t_{\alpha/2}\frac{s_d}{\sqrt n}=2.201.\frac{1.981}{\sqrt{12}}=1.258596$$

The confidence interval is calculated based on the following formula:

 $$\begin{matrix}\bar{d}-E<\mu_d<\bar{d}+E\\-0.475-1.259<\mu_d<-0.475+1.259\\-1.734<\mu_d<0.784\\\end{matrix}$$

That is, 95% of the average difference in the sample will lie in those intervals that in fact contain the average difference from the actual population.

Since the limit of the trust interval contains a value of 0, there is not enough evidence to support the allegation that there is a difference between the two heights.

Modern Methods:

The above method is a statistical test by traditional methods. Tests with modern methods use p-value in determining whether or not a statistical test is significant.

If: P-Value < Significant Level then the significant test or H0 is rejected

If: P-Value > Significant Level then the non-significant test or H0 is accepted

If the P-Value value < 0.01, then the test is very significant!

Calculations by using the SmartstatXL Excel Add-In

Graphical user interface, application Description automatically generated

Analysis Results

Table Description automatically generated

Example 2

The effectiveness of a Tutoring in the face of a Test is assessed based on a comparison between the grades obtained by students before and after implementing the course. The score was obtained from 10 students who took the Entrance Preparatory Test Tutoring (based on data from the College Board and "An Analysis of the Impact of Commercial Test Preparation Courses on SAT Scores," by Sesnowitz, Bernhardt, and Knain, American Educational Research Journal, Vol. 19, No. 3.)

  1. Is there enough evidence to conclude that Bimbel is effective in improving scores (test scores)? Test at a significant level of 0.05.
  2. Determine the 95% confidence interval for the average difference between before and after following the Tutoring. Write down the statement and interpret the result.

Student

A

B

C

D

E

F

G

H

I

J

before

700

840

830

860

840

690

830

1180

930

1070

after

720

840

820

900

870

700

800

1200

950

1080

d

-20

0

10

-40

-30

-10

30

-20

-20

-10

average d

-11

sd

20.24846

Manual Calculation (Classical/Traditional Method)

 

Statistics

n

10

 $\bar{{d}}$

-11

sd

20.24846

  1. Step 1: Claim: the allegation that after attending the Bimbel program can increase the score of the exam result can be denoted by: μd < 0, if the allegation is false, then μ1 ≥ 0. (Why is the claim not symbolized μd > 0? If the value afterbimbel is greater, then the difference (d) must be negative, so it is symbolized by μd < 0 )
  2. Step 2: From the two equations above, the μd ≥ 0 contains an element of equality, thus becoming a null hypothesis and alternative hypothesis(H1) μd < 0.
  3. H0: μd = 0
  4. H1: μd < 0
  5. Significant level: α = 0.05
  6. Determine the statistical test: the sample is randomly taken from the paired object so that the appropriate statistical test is a paired test.
  7. Calculate tstat and determine tcritical:
  8.  $ t=\frac{\bar{d}-\mu_d}{\frac{s_d}{\sqrt n}}=\frac{-11-0}{\frac{20.248}{\sqrt{10}}}=-1.718$
  9. Determine tcritica with df = 9 and α = 0.05. From the t-student table obtained the tcritica value for the one tailed test = -1.833 (minus sign due to the left-tailed test)
  10. Because |tstat| < |tcritical|= |-1.718| < |-1.833|, then H0 is accepted!
  11. From the results of the t-test, it was concluded that at a significant level of 5%, there was not enough evidence to state that test preparation by taking a course (Bimbel) would be effective in improving test scores.

  1. 95% confidence interval for the average difference:

Specify the Margin of Error Value (E)

 $$ E=t_{\alpha/2}\frac{s_d}{\sqrt n}=2.262\times\frac{20.248}{\sqrt{10}}=14.484$$

The confidence interval is calculated based on the following formula:

 $$\begin{matrix}\bar{d}-E<\mu_d<\bar{d}+E\\-11-14.484<\mu_d<-11+14.484\\-25.484<\mu_d<3.484\\\end{matrix}$$

That is, 95 % we believe that the average value of the difference from the actual population lies at an interval between 25.48 to 3.48.

Exercise 1

A researcher studied the influence of lighting on a Lucerne flower plant on different environmental conditions. Researchers took 10 plants that were fresh with flowers that were illuminated without hindrance at the top and hidden flowers at the bottom. Then, data on the number of seeds at each location were collected (Torrie, 1980). 

Sample Number

Upper flowers

Lower flowers

1

4.0

4.4

2

5.2

3.7

3

5.7

4.7

4

4.2

2.8

5

4.8

4.2

S

3.9

4.3

7

4.1

3.5

8

3.0

3.7

9

4.6

3.1

10

6.8

1.9

Hypothesis test there is no difference between the average population (H0) and its counterpart (the flowers located at the top produce more seeds, H1). Use a significant level of 0.05.

 

Calculations by using the SmartstatXL Excel Add-In:

Graphical user interface, application Description automatically generated

Analysis Results

Table Description automatically generated

Calculations using MINITAB Software v.19:

Graphical user interface, text, application, email Description automatically generated

Interpretation:

Hypothesis:

  1. H0: μd = 0
  2. H1: μd > 0

Classic Method

tstat = tstat = T-Value = 1.98

tcritica = ttable = t(0.05.9) = 1,833 (obtained from the table value t-student)

Because |tstat| > |tcritical| 1.98 > 1,833 then H0 is rejected!

That is to say: 95% of us believe that flowers located at the top produce more seeds compared to the flowers that are at the bottom of them.

Modern Methods:

Tests with modern methods use p-value in determining whether or not a statistical test is significant.

If: P-Value < Significant Level then the significant test or H0 is rejected

If: P-Value > Significant Level then the non-significant test or H0 is accepted

In the above case, P-Value = 0.040 which is a smaller value than the value of α = 0.05. This indicates that the test is significant or H0 is rejected.

If the P-Value value < 0.01, then the test is very significant!

Exercise 2

As tested at a significant level of 0.01, is there a difference in the concentration of sugar in red clover nectar stored for 8 hours at two different pressures (4.4 mmHg and 9.9 mmHg)? Sugar concentration data are presented in the following Table.

Table of Sugar Content of red clover nectar (Torrie, 1980)

Sample Number

Pressure 4.4 mmHg

Pressure 9.9 mmHg

1

62.5

51.7

2

65.2

54.2

3

67.6

53.3

4

69.9

57.0

5

69.4

56.4

S

70.1

61.5

7

67.8

57.2

8

67.0

56.2

9

68.5

58.4

10

62.4

55.8

  1. Hypothesis
  2. H0: μ1 = μ2
  3. H1: μ1 ≠ μ2
  4. Significant level: α = 0.01

Calculations by using the SmartstatXL Excel Add-In:

Graphical user interface, application Description automatically generated

Analysis Results

Table Description automatically generated

Calculations using MINITAB Software v.19:

Graphical user interface, text, application, email Description automatically generated

Interpretation:

Hypothesis:

  1. H0: μd = 0
  2. H1: μd ≠ 0

Classic Method

tstat = tstat = T-Value = 15.46

tcritica = ttable = t(0.0 1.9) = 3,250 (Two-Tailed at significant level 1%)

Because |tstat| > |tcritical| 15.46 > 3,250 then H0 is rejected!

This shows that there is a very significant difference between the two pressures on sugar levels.

Modern Methods:

Tests with modern methods use p-value in determining whether or not a statistical test is significant.

If: P-Value < Significant Level then the significant test or H0 is rejected

If: P-Value > Significant Level then the non-significant test or H0 is accepted

In the above case, P-Value = 0.000 which is a smaller value than the value of α = 0.01. This indicates that the test is very significant or H0 is rejected (P-Value value < 0.01)