Score Test Calculator

Score Test Inputs

Choose a score-test family, add your numbers, and review the decision, p value, interval, and assumptions.

Test type

Alternative hypothesis

Confidence level (%)

Observed successes (x)

Sample size (n)

Null proportion (p₀)

Group 1 successes

Group 1 size

Group 2 successes

Group 2 size

Sample mean

Null mean (μ₀)

Known sigma (σ)

Sample size

Example Data Table

Scenario	Inputs	Illustrative output
One-sample proportion	x = 62, n = 100, p₀ = 0.50, two-sided, 95%	Z ≈ 2.400, p ≈ 0.016, reject H₀
Two-sample proportions	x₁ = 70, n₁ = 120, x₂ = 48, n₂ = 110, two-sided, 95%	Z ≈ 2.269, p ≈ 0.023, reject H₀
One-sample mean	x̄ = 53.4, μ₀ = 50, σ = 8, n = 36, right-tailed, 95%	Z ≈ 2.550, p ≈ 0.005, reject H₀

Formula Used

One-sample proportion: z = (p̂ − p₀) / √[p₀(1 − p₀) / n]

Two-sample proportions: z = (p̂₁ − p̂₂) / √[p̂(1 − p̂)(1/n₁ + 1/n₂)] where p̂ is the pooled proportion under the null.

One-sample mean with known sigma: z = (x̄ − μ₀) / (σ / √n)

The p value comes from the standard normal distribution using the selected tail. Confidence intervals are included for practical interpretation.

How to Use This Calculator

Choose the score test type that matches your study design.
Select a two-sided, left-tailed, or right-tailed alternative.
Enter the relevant sample counts, means, sizes, and null value.
Set the confidence level used for the interval output.
Click the calculate button to view results above the form.
Review the z statistic, p value, interval, and decision.
Use the CSV or PDF buttons to export your result table.
Confirm assumptions before using the result in reports.

Frequently Asked Questions

1. What does a score test measure?

A score test checks whether observed data are compatible with a null parameter value. It uses information evaluated under the null hypothesis and reports a z statistic and p value.

2. When should I use a one-sample proportion score test?

Use it when you want to compare an observed success rate against a benchmark proportion, such as a defect rate, click rate, approval rate, or response rate.

3. Why does the two-sample proportion test use a pooled proportion?

Under the null hypothesis of equal proportions, both groups are assumed to share one common underlying probability. The pooled estimate reflects that shared value when computing the score statistic.

4. Is this the same as a Wald test?

No. A score test evaluates curvature and slope under the null. Wald tests center on the sample estimate instead. Score tests often behave better near boundaries or smaller samples.

5. What assumptions matter most?

You need independent observations, a suitable design, and adequate sample size for normal approximation. For mean tests, the population sigma must be known and the sampling model should be appropriate.

6. How do I interpret a p value here?

The p value is the probability of getting a test statistic at least as extreme as the observed one, assuming the null hypothesis is true.

7. Why include confidence intervals if the test already gives a decision?

A decision tells you whether evidence crosses a threshold. The interval adds magnitude and uncertainty, helping you judge whether the estimated effect is practically important.

8. Can I use this calculator for very small samples?

Be careful. Very small samples can weaken the normal approximation used by score tests. In such cases, exact methods or alternative modeling approaches may be more appropriate.