Two Sample Unequal Variance T Test Calculator

Calculator

Input mode

Alternative hypothesis

Alpha

Sample 1 raw values

Separate values with commas, spaces, or new lines.

Sample 2 raw values

Raw mode ignores the summary boxes below.

Hypothesized mean difference

Usually 0 for equal population means.

Sample 1 size

Sample 1 mean

Sample 1 spread

Sample 2 size

Sample 2 mean

Sample 2 spread

Summary spread type

Confidence level

Example Data Table

Group	Values	Sample Size	Mean	Sample Variance
Sample 1	12.4, 13.1, 11.9, 14.2, 12.8, 13.5, 12.7, 14.0	8	13.0875	0.5827
Sample 2	10.8, 11.3, 12.1, 10.5, 11.7, 12.4, 10.9	7	11.3857	0.4964

Formula Used

Welch standard error: SE = √(s1² / n1 + s2² / n2)

Test statistic: t = [(x̄1 − x̄2) − Δ0] / SE

Degrees of freedom: df = (s1²/n1 + s2²/n2)² / {[(s1²/n1)²/(n1−1)] + [(s2²/n2)²/(n2−1)]}

Confidence interval: (x̄1 − x̄2) ± t critical × SE

The p value comes from the Student t distribution with Welch degrees of freedom.

How to Use This Calculator

Select raw data when you have every observation. Select summary statistics when you already know sample size, mean, and spread. Choose the tail that matches your research claim. Enter alpha for the decision rule. Enter a confidence level for the interval. Press calculate. Then download the result as CSV or PDF when needed.

Understanding the Welch Two Sample Test

The Welch two sample t test compares two independent means. It is useful when the two groups may have different variances. This happens often in surveys, lab trials, finance samples, and classroom data. The method does not force a pooled variance. It estimates the standard error from each group separately.

Why Unequal Variance Matters

A pooled test can become misleading when spreads are very different. Welch's method adjusts both the standard error and degrees of freedom. The adjusted degrees of freedom are often fractional. That is normal. It gives a better reference distribution for the t statistic. The calculator reports that value so the decision is transparent.

What The Result Means

The t statistic measures how far the observed mean difference is from the null difference. The distance is measured in standard errors. A large absolute t value suggests stronger evidence against the null claim. The p value shows how unusual the result is under the null. Compare the p value with alpha. If the p value is lower, reject the null hypothesis.

Confidence Interval

The confidence interval estimates a likely range for the true mean difference. A two sided interval that does not include the null difference supports a significant result at the matching level. The interval also shows direction and practical size. This is often more useful than a yes or no decision.

Using Raw Or Summary Data

Raw data gives the calculator direct control over means and variances. Summary data is faster when you already know sample size, mean, and spread. Use sample variance or sample standard deviation. The calculator converts standard deviation to variance when needed. Keep the same units for both groups.

Good Statistical Practice

Check that groups are independent. Look for extreme outliers before trusting any test. For very small samples, inspect the data shape. Welch's test is fairly robust, but it is not magic. Report the sample sizes, means, variances, t value, degrees of freedom, p value, and confidence interval. Add context about practical importance. A tiny p value can still describe a small effect. A non-significant result can still be useful when the interval is narrow. Always explain assumptions before presenting a final business recommendation.

FAQs

What does this calculator test?

It tests whether two independent group means differ when the population variances are not assumed equal. This is commonly called Welch's two sample t test.

When should I use unequal variances?

Use it when group spreads differ, sample sizes differ, or you do not trust a pooled variance assumption. It is a safe default for many independent sample comparisons.

Can I enter raw data?

Yes. Choose raw sample data and paste values for both groups. The calculator computes sample size, mean, and sample variance automatically.

Can I use summary statistics?

Yes. Choose summary statistics. Enter each sample size, mean, and either sample variance or sample standard deviation. Use the spread type selector carefully.

What is the null difference?

It is the hypothesized value of μ1 − μ2. Most tests use 0, meaning no difference between population means.

Why are degrees of freedom decimal?

Welch's method estimates degrees of freedom from two separate variances. The formula often returns a decimal value, and that is expected.

What does a low p value mean?

A low p value means the observed difference is unusual under the null hypothesis. If p is below alpha, the calculator rejects the null claim.

Does significance prove practical importance?

No. Significance shows statistical evidence. Practical importance depends on effect size, measurement units, cost, risk, and subject knowledge.