Two Sample T Score Calculator

Calculator

Data source

Variance method

Alternative hypothesis

Group 1 mean

Group 1 standard deviation

Group 1 sample size

Group 2 mean

Group 2 standard deviation

Group 2 sample size

Hypothesized difference

Confidence level percent

Group 1 raw values

Group 2 raw values

Example Data Table

Group	Mean	Standard Deviation	Sample Size	Use Case
Training Group	82	9.5	30	After new method
Control Group	76.4	8.8	28	Standard method
Hypothesized Difference	0		95%	Two tailed comparison

Formula Used

Welch Unequal Variance Test

t = ((x̄1 - x̄2) - Δ0) / √((s1² / n1) + (s2² / n2))

df = ((s1² / n1 + s2² / n2)²) / (((s1² / n1)² / (n1 - 1)) + ((s2² / n2)² / (n2 - 1)))

Pooled Equal Variance Test

sp² = (((n1 - 1)s1²) + ((n2 - 1)s2²)) / (n1 + n2 - 2)

t = ((x̄1 - x̄2) - Δ0) / (sp × √(1 / n1 + 1 / n2))

df = n1 + n2 - 2

Confidence Interval

CI = observed mean difference ± t critical × standard error

Effect Size

Cohen d = observed mean difference / pooled standard deviation. Hedges g applies a small sample correction to Cohen d.

How to Use This Calculator

Select summary statistics or raw sample values.
Choose Welch when variances or sample sizes are different.
Choose pooled only when equal variance is reasonable.
Enter both group means, standard deviations, and sample sizes.
Set the hypothesized difference. Use zero for most comparisons.
Choose the alternative hypothesis and confidence level.
Press Calculate to view the result below the header.
Use CSV or PDF download for records and reports.

Understanding Two Sample T Scores

A two sample t score compares the average value from two independent groups. It helps you decide whether the observed gap is large enough to matter statistically. The calculator can use Welch testing or pooled variance testing. Welch testing is safer when spreads or group sizes differ. Pooled testing is useful when both groups can reasonably share one variance estimate.

What the Result Shows

The t score measures the mean difference in standard error units. A larger absolute value suggests a stronger separation between groups. The degrees of freedom control the reference curve used for the p value. Smaller samples usually give fewer degrees of freedom. That makes the test more conservative. The p value estimates how unusual the result would be if the hypothesized difference were true.

Choosing Inputs Carefully

Good results require clean inputs. Use sample means, sample standard deviations, and sample sizes from each group. You may also paste raw values. Raw values should come from independent observations. Do not mix paired before and after data here. That situation needs a paired t test. Enter a hypothesized mean difference when testing a value other than zero.

Interpreting Practical Size

Statistical significance is not the whole story. The calculator also reports Cohen d and Hedges g. These measures describe effect size using the pooled spread. Hedges g corrects small sample bias. A confidence interval gives a practical range for the true mean difference. If the interval is narrow, the estimate is precise. If it is wide, more data may be needed.

Reporting the Test

A clear report should include the method, t score, degrees of freedom, p value, mean difference, and confidence interval. State whether the test was two tailed or one tailed. Mention why Welch or pooled variance was selected. Add the group summaries so another reader can understand the evidence. Exporting the result helps keep a reproducible record for homework, research notes, audits, and dashboards.

Common Cautions

Check assumptions before trusting the output. Each group should be sampled independently. Extreme outliers can distort means and standard deviations. Very small samples need careful review. The test also assumes measurements are numeric and roughly continuous. Use domain knowledge before making final decisions wisely.

FAQs

What is a two sample t score?

It measures how far the difference between two sample means is from the hypothesized difference, using standard error units.

When should I use Welch testing?

Use Welch testing when group variances differ, sample sizes differ, or you are unsure whether equal variance is reasonable.

When should I use pooled variance?

Use pooled variance only when the two populations can reasonably share the same variance and the study design supports that assumption.

Can I paste raw values?

Yes. Select raw sample values, then paste each independent group into its own box using commas, spaces, or new lines.

What does the p value mean?

The p value estimates how unusual your result is if the null hypothesis is true. Smaller values give stronger evidence.

What is Cohen d?

Cohen d is an effect size. It expresses the mean difference relative to the pooled standard deviation.

What is Hedges g?

Hedges g is a corrected effect size. It adjusts Cohen d to reduce small sample bias.

Can this calculator handle paired data?

No. This page is for independent samples. Paired before and after data needs a paired t score calculator.