Explore sample means, variability, hypotheses, and evidence. Switch test types, enter values instantly, and interpret. Visual summaries and downloadable reports make statistical decisions easier.
| Scenario | Recommended Test | Example Inputs | Use Case |
|---|---|---|---|
| Single class average vs target score | One-sample | Mean 72.4, SD 8.6, n 20, null mean 70 | Checks whether one sample differs from a benchmark. |
| Two independent classes | Independent equal variances | Group 1: 78.2, 9.1, 18; Group 2: 71.5, 10.4, 16 | Compares average scores from different groups. |
| Different spreads between groups | Welch two-sample | Group 1 SD 6.2, Group 2 SD 15.1 | Handles unequal variances more safely. |
| Before and after tutoring scores | Paired | Differences mean 4.2, SD 6.1, pairs 14 | Measures change within matched observations. |
t = (x̄ - μ₀) / (s / √n)
Use this when one sample mean is compared with a known or hypothesized population mean.
sp2 = [((n₁-1)s₁²) + ((n₂-1)s₂²)] / (n₁+n₂-2)
t = [(x̄₁ - x̄₂) - Δ₀] / [sp √(1/n₁ + 1/n₂)]
Use this when two independent groups likely share the same variance.
t = [(x̄₁ - x̄₂) - Δ₀] / √(s₁²/n₁ + s₂²/n₂)
Welch’s version is preferred when group variances or sizes differ noticeably.
t = (d̄ - Δ₀) / (sd / √n)
Here, d̄ is the average paired difference and sd is the standard deviation of differences.
Confidence intervals are built with the estimate ± critical t × standard error. The calculator also reports effect size to show practical importance, not only statistical significance.
Use it when one sample is compared against a known or hypothesized mean. It checks whether the sample average is significantly different from that benchmark.
The equal-variance Student test assumes both groups have similar population variances. Welch’s test relaxes that assumption and is usually safer when spreads or sample sizes differ.
Use a paired test when observations are matched, such as before-and-after scores or repeated measurements on the same subjects.
The p value shows how likely your result would be if the null hypothesis were true. Smaller values indicate stronger evidence against the null.
It gives a plausible range for the sample mean or mean difference. It helps you judge magnitude and precision, not just significance.
For independent tests, different lengths are fine. For paired tests, each value must have a matching partner, so both lists must contain the same count.
Yes. It reports Cohen’s d for one-sample and paired designs, and Hedges’ g for two-sample designs, with a simple magnitude label.
Yes. After calculation, use the CSV button for spreadsheet-friendly output or the PDF button for a printable result summary.
Important Note: All the Calculators listed in this site are for educational purpose only and we do not guarentee the accuracy of results. Please do consult with other sources as well.