Independent T-Test for Two Samples (Equal Variance)

Details: Written by Ade Setiawan; Category: Student's t-Test; 13 Jan

In one sample t-test we only compare a population with a certain value, but in reality cases using this type of test are very rare. Researchers, particularly in agriculture, are more concerned with cases that require comparisons between two conditions or two population averages.

Before we do the analysis, we must first consider whether the two populations come from a normal distribution, are the variance of two populations equal? This will guide us in choosing the right method and formula for conducting a t-test analysis to compare the two population mean values.

A full discussion of the t-student test can be read in the following document.

Hypothesis

	Test Type:
	Two-Tailed	Right Tailed	Left Tailed
	H₀ : μ₁ = μ₂	H₀ : μ₁ = μ₂	H₀ : μ₁ = μ₂
	H_A : μ₁ ≠ μ₂	H_A : μ₁ > μ₂	H_A : μ₁ < μ₂
Decision:
Reject H₀ if:	\|t\| > t_α/2,df	t > t _α,df	\|t\| > \|-t _α,df \|

Description: |t| = absolute value of t

When the variance of the two populations is equal (σ₁ = σ₂ = σ) and the value of the σ is not known, the value is approached with its approximate value, that is, the average deviation of the example, s. Since both populations have an s value, the s used is a combined s (s_p) of the two populations:

Standard deviation of the population:

$$\sigma_{{\bar{y}}_1-{\bar{y}}_2}=\sigma_p\sqrt{\frac{1}{n_1}+\frac{1}{n_2}}$$

Standard deviation of sample population (Approximate):

$$s_{{\bar{y}}_1-{\bar{y}}_2}=s_p\sqrt{\frac{1}{n_1}+\frac{1}{n_2}}$$

Where is the combined standard deviation:

$$s_p=\sqrt{\frac{(n_1-1)s_1^2+(n_2-1)s_2^2}{n_1+n_2-2}}$$

Statistical Test

$$ t=\frac{{\bar{y}}_1-{\bar{y}}_2}{s_{{\bar{y}}_1-{\bar{y}}_2}}$$

$$ t=\frac{{\bar{y}}_1-{\bar{y}}_2}{s_p\sqrt{\frac{1}{n_1}+\frac{1}{n_2}}}$$

with $ df=n_1+n_2-2$

Example 1: One Tailed test

Computer Programming courses are given to two different classes of students. Class A, which consists of 12 students, is taught by the usual method. Meanwhile, class B consisting of 10 students is taught with a new teaching method. At the end of the semester, classes A and B are given the same exam materials. In class A, the average score of students is 85 with a standard deviation of 4, and in class B the average score is 81 with a standard deviation of 5. Are you sure that the usual teaching method remains better than the new teaching method with a significant degree of 0.01? It is assumed that two populations are close to the normal distribution with the equal variance.

Answer

Sample A: n₁ = 12; ${\bar{y}}_1$= 85; s₁ = 4; and Sample B: n₂ = 10; ${\bar{y}}_2$ = 81; s₂ = 5.

Step 1: Claim: The old teaching method remains better, symbolically it can be expressed by μ₁ > μ₂ vs μ₁ = μ₂
Step 2: From the two equations above, the μ₁ = μ₂ contains an element of equality, thus becoming the hypothesis of null and alternative hypothesis (H₁) μ₁ > μ₂.
H₀: μ₁ = μ₂
H₁: μ₁ > μ₂
Significant level : α = 0.01
Determine the statistical test: the sample is taken randomly; n < 30 (but assumed to be normally distributed) and σ₁ = σ₂ = σ. From these conditions, the appropriate statistical test is an independent t-test with a homogeneous variance.
Calculate t_stat and determine t_critical:
The combined standard deviation:

$$s_p=\sqrt{\frac{\left(n_1-1\right){s_1}^2+\left(n_2-1\right){s_2}^2}{n_1+n_2-2}}=\sqrt{\frac{\left(11\right)4^2+\left(9\right)5^2}{20}}=\sqrt{20,05}=4.478$$

$$ t=\frac{{\bar{y}}_1-{\bar{y}}_2}{s_p\sqrt{\frac{1}{n_1}+\frac{1}{n_2}}}=\frac{85-80}{4.478\sqrt{\frac{1}{12}+\frac{1}{10}}}=2.09$$

df = n₁ + n₂ -2 = 12+10-2 = 20; α = 0.01
t_critica = t_(0.01.20) = 2.528
Since t stats < t_critical then H₀ is accepted
From the results of the t-test, it was concluded that at a significant level of 1%, the ordinary teaching method is no different from the new teaching method. If the cost of the teaching method ischeaper, we can choose this method, because the quality is no different compared to ordinary teaching methods.

Example 2: Two Tailed Test [Weight gain of Holstein calves (Torrie, 1980)]

A total of 28 Holstein cows, grouped into two, the first group of 14 cows were given vitamin A, while the second group was not given vitamin A as a control. To find out the difference in the weight of the two groups of cows, a statistical test t with 2 free (independent) samples was used. Data on the weight gain of cows treated with vitamin A are presented in Table 1:

Table 1.1. Weight gain of Holstein cows due to the administration of vitamin A

Number	Cow weight(lb)
Number	Control	Vitamin A
1	175	142
2	132	311
3	218	337
4	151	262
5	200	302
6	219	195
7	234	253
8	149	199
9	187	236
10	123	216
11	248	211
12	206	176
13	179	249
14	206	214

Manual Calculation (Classical/Traditional Method)

Sample A: n₁ = 14; ${\bar{y}}_1$ =187.6429; s₁ = 38.09827; and

Sample B: n₂ = 14; ${\bar{y}}_2$ =235.928685; s₂ = 54.28623.

Step 1: Claim: There is a difference between the two treats, symbolically it can be expressed by μ₁ ≠ μ₂ vs μ₁ = μ₂
Step 2: From the two equations above, the μ₁ = μ₂ contains an element of equality, thus becoming the null hypothesis and alternative hypothesis (H₁) μ₁ ≠ μ₂.
H₀: μ₁ = μ₂
H₁: μ₁ ≠ μ₂
Significant level : α = 0.05
Determine the statistical test: the sample is taken randomly; n < 30; normally distributed (Kolmogorov-Smirnov/Shapiro-Wilk Test) and σ₁ = σ₂ = σ (Levene Test). From these conditions, the appropriate statistical test is an independent t-test with a homogeneous variance.
Calculate t_stat and determine t_critical:
The combined standard deviation:

$$\begin{matrix}s_p=\sqrt{\frac{\left(n_1-1\right){s_1}^2+\left(n_2-1\right){s_2}^2}{n_1+n_2-2}}=\sqrt{\frac{\left(13\right)38.10^2+\left(13\right)54.29^2}{26}}\\=\sqrt{2199.2365}=46.896\\\end{matrix}$$ $$\begin{matrix}t=\frac{{\bar{y}}_1-{\bar{y}}_2}{s_p\sqrt{\frac{1}{n_1}+\frac{1}{n_2}}}=\frac{187.6429-235.9286}{46.896\sqrt{\frac{1}{14}+\frac{1}{14}}}\\=\frac{-48.2857}{17.725}=-2.724153\\\end{matrix}$$

df = n₁ + n₂ -2 = 14+14-2 = 26; α = 0.05
t_critica = t_(α/2; _df) = t_(0.025.26) = 2.056
Because |t_stat| >|t_critica|= |-2.73| > 2.056, then H₀ is rejected!
From the results of the t-test, it was concluded that at a significant level of 5%, the weight between the group of cows that were not given vitamin A (control) and those given Vitamin A was different.

Modern Methods:

The above method is a statistical test by traditional methods. Tests with modern methods use p-value in determining whether or not a statistical test is significant.

If: P-Value < Significant Level then the significant test or H₀ is rejected

If: P-Value > Significant Level then the non-significant test or H₀ is accepted

If the P-Value value < 0.01, then the test is very significant!

Calculations by using the SmartstatXL Add-In:

Descriptive Statistics and Test for Homogeneity Variance

Normality Test

T-test

Calculation using SPSS software v.16:

Group Statistics
	Group	N	Mean	Std. Deviation	Std. Error Mean
Treatment	Control	14	187.6429	38.09827	10.18219
Treatment	Vitamin A	14	235.9286	54.28623	14.50860

Tests of Normality
	Group	Kolmogorov-Smirnov^a			Shapiro-Wilk
	Group	Statistics	Df	Sig.	Statistics	Df	Sig.
Treatment	Control	.127	14	.200*	.964	14	.793
Treatment	Vitamin A	.143	14	.200*	.973	14	.915
a. Lilliefors Significance Correction
*. This is a lower bound of the true significance.

Independent Samples Test
		Levene's Test for Equality of Variances		t-test for Equality of Means
		F	Sig.	t	Df	Sig. (2-tailed)	Mean Difference	Std. Error Difference	95% Confidence Interval of the Difference
		F	Sig.	t	Df	Sig. (2-tailed)	Mean Difference	Std. Error Difference	Lower	Upper
Treatment	Equal variances assumed	1.339	.258	-2.724	26	.011	-48.28571	17.72503	-84.72003	-11.85140
Treatment	Equal variances not assumed			-2.724	23.306	.012	-48.28571	17.72503	-84.92613	-11.64530

Normality Test

Kolmogorov-Smirnov^a or Shapiro-Wilk is used to test the assumption of whether the sample taken is normally distributed or not. This assumption is necessary before performing the t-test. The t-test can only be performed if the sample is normally distributed.

H₀ = normally distributed sample;

H₁ = non-normally distributed sample.

Conclusion:

H₀ rejected: significant (usually p < 0.20) = non-normally distributed sample
H₀ accepted: not significant (usually p > 0.20) = normally distributed sample

The normality test (Kolmogorov-Smirnov^a/ Shapiro-Wilk) for both groups was not significant. Kolmogorov-Smirnov test for the Control group = p > 0.2, and Vitamin A = p > 0.2. This shows that the two samples are normally distributed.

Homogeneity test:

The Levene test was used to test the homogeneity of the variance between the two populations (groups).

H₀ = homogeneous variance.
H_A = not all flavors are the same.

Levene Test Conclusion:

H₀ is rejected which means the Levene Test is significant (usually p < 0.10) = unequal variance (σ₁ ≠ σ₂)
H₀ is accepted which means the Levene Test is not significant (usually p > 0.10) = homogeneous variance (σ₁ = σ₂)

Pay attention to the calculation results using the SPSS software above. There are two variance assumptions in the output of the Independent Samples Test, namely Equal variance assumsed, and not assumses. The results of the Levene Test showed that the two populations had a homogeneous variance (p = 0.258 > 0.10), so the t test used was a test assuming a homogeneous variance (Equal variance assumed). This Hal suggests that both cow samples are from the same population.

Conclusion:

Hypothesis:

H₀: μ₁ = μ₂
H₁: μ₁ ≠ μ₂

Classic Method

t_stat = t_stat = T-Value = -2.724

t_critica= t_table = t_{(0.05.9) = 2,056} (obtained from the table value t-student)

Since |t_stat| > |t_critical| = 2,724 > 2,056 then H0 is rejected!

Modern Methods:

Tests with modern methods use p-value in determining whether or not a statistical test is significant.

If: P-Value < Significant Level then the significant test or H₀ is rejected

If: P-Value > Significant Level then the non-significant test or H₀ is accepted

Since its signification value (p-value or Sig. = 0.011) < 0.05, then H₀ is rejected and H_A is accepted. This states that there are differences between the group of cows given vitamin A and the control group.

In the output the Table also includes the value of the confidence interval.

P(-84.72 < μ₁ - μ₂ < -11.85) = 95%,

That is to say: 95% we are sure that the average difference between the two lies in the range between -84.72 and -11.85.

Last Updated: 23 August 2022

Sidebar Menu

Main Menu EN