Two Sample T Test Calculator

Calculator Inputs

Input mode

Variance method

Alternative hypothesis

Hypothesized difference

Confidence level (%)

Alpha for decision

Decimal places

Sample 1 name

Sample 2 name

Sample 1 size

Sample 1 mean

Sample 1 standard deviation

Sample 2 size

Sample 2 mean

Sample 2 standard deviation

Sample 1 raw data

Sample 2 raw data

Use raw mode for pasted observations. Use summary mode for n, mean, and standard deviation.

Example Data Table

Observation	Program A	Program B
1	82	75
2	85	78
3	79	80
4	91	73
5	88	77
6	84	82
7	86	79
8	90	74
9	87	76
10	83	81
11	89	72
12	81

Formula Used

Mean difference: d = x̄₁ - x̄₂

Welch standard error: SE = sqrt(s₁² / n₁ + s₂² / n₂)

Welch degrees of freedom: df = (A + B)² / (A² / (n₁ - 1) + B² / (n₂ - 1)), where A = s₁² / n₁ and B = s₂² / n₂.

Pooled variance: s_p² = ((n₁ - 1)s₁² + (n₂ - 1)s₂²) / (n₁ + n₂ - 2)

Pooled standard error: SE = s_p sqrt(1 / n₁ + 1 / n₂)

Test statistic: t = (d - d₀) / SE

Confidence interval: d ± t_critical × SE

Effect size: Cohen d = d / s_p. Hedges g applies the small sample correction.

How to Use This Calculator

Select raw data or summary statistics.
Choose Welch when variances may differ.
Choose pooled only when equal variance is reasonable.
Select the tail direction for your hypothesis.
Enter alpha and confidence level values.
Press Calculate to show results above the form.
Use CSV or PDF buttons to save the report.

Two Sample T Test Guide

A two sample t test compares two independent means. It is useful when each group has its own observations. The groups should not contain paired measurements. Common examples include treatment versus control, two separate teams, or two production lines.

What This Test Measures

The calculator estimates the mean difference between group one and group two. It then compares that difference with the variation inside both samples. A larger absolute t value gives stronger evidence against the null difference. A small p value suggests the observed gap is unlikely under the null model.

Welch Or Pooled Method

Welch's method is the safer default. It does not assume equal population variances. It also adjusts degrees of freedom with the Welch Satterthwaite equation. The pooled method assumes both populations have the same variance. Use it only when that assumption is justified by design, history, or clear evidence.

Tails And Decisions

Choose two tailed when any difference matters. Choose right tailed when group one is expected to be larger. Choose left tailed when group one is expected to be smaller. The decision uses the selected alpha level. When p is less than alpha, reject the null hypothesis. Otherwise, do not reject it.

Effect Size And Confidence

Statistical significance does not show practical size. Cohen's d and Hedges' g describe the difference in standard deviation units. The confidence interval gives a likely range for the true mean difference. Wide intervals warn that more data may be needed.

Data Quality Notes

The test works best with independent sampling, numeric measurements, and roughly normal data. Moderate non normality is often acceptable for larger samples. Extreme outliers can distort means and standard deviations. Always inspect the raw data when possible.

Reporting Results

Report the method, t value, degrees of freedom, p value, confidence interval, and effect size. Also include sample sizes, means, and standard deviations. This gives readers enough information to judge both statistical evidence and practical meaning.

Interpreting Limits

The calculator supports raw entries and summary statistics. Raw mode is best when observations are available. Summary mode is useful for published studies. The result remains an estimate. It should support judgment, not replace study design, subject knowledge, or careful expert review alone.

FAQs

What is a two sample t test?

It compares the means of two independent groups. It checks whether the observed mean difference is large compared with sample variation.

Should I use Welch or pooled?

Use Welch for most work. It allows unequal variances. Use pooled only when equal variance is a clear and defensible assumption.

What does a small p value mean?

A small p value means the observed result is unlikely if the null difference is true. It does not measure practical importance.

Can I paste raw data?

Yes. Choose raw observations. Then paste numbers separated by commas, spaces, semicolons, tabs, or line breaks.

Can I use summary statistics?

Yes. Choose summary statistics. Enter sample size, mean, and sample standard deviation for each group.

What is the hypothesized difference?

It is the mean difference stated by the null hypothesis. Most tests use zero, meaning no difference between population means.

What does Cohen d show?

Cohen d shows the mean difference in pooled standard deviation units. It helps judge the practical size of the difference.

Are the groups allowed to be paired?

No. This calculator is for independent groups. Use a paired test when values are matched by person, item, or time.

Observation	Program A	Program B
1	82	75
2	85	78
3	79	80
4	91	73
5	88	77
6	84	82
7	86	79
8	90	74
9	87	76
10	83	81
11	89	72
12	81

Observation	Program A	Program B
1	82	75
2	85	78
3	79	80
4	91	73
5	88	77
6	84	82
7	86	79
8	90	74
9	87	76
10	83	81
11	89	72
12	81