Propensity Score Calculator

Calculator inputs

Mode

Estimation fits a logistic model: T ~ X1..X5.

Estimand for weights

Weights shown for both dataset and single-case.

Select covariates

Unselected covariates are ignored.

Use X1

Use X2

Use X3

Use X4

Use X5

Single-case values

Treatment flag (T)

X1

X2

X3

X4

X5

Single-case propensity is computed using the chosen model.

Coefficients (logit scale)

Intercept (b0)

b1 for X1

b2 for X2

b3 for X3

b4 for X4

b5 for X5

This section is used only in “Use your coefficients” mode.

CSV data (for estimation)

Paste data with header columns: T,X1,X2,X3,X4,X5. Use 0/1 for T.

Rows with missing selected covariates are skipped.

Example data table

T	X1	X2	X3	X4	X5
1	35	1	0.42	120	0
1	40	1	0.30	115	0
0	29	0	0.10	90	1
0	33	0	0.15	95	1

Use any covariate meanings you prefer, such as age, baseline score, or risk indicators.

Formula used

A propensity score is the probability of receiving treatment given observed covariates: e(x) = P(T=1 | X=x).

This calculator uses logistic regression: logit(e(x)) = b0 + b1·X1 + b2·X2 + b3·X3 + b4·X4 + b5·X5, where e(x) = 1 / (1 + exp(-logit(e(x)))).

For weighting, common choices are: ATE weights (treated 1/e, control 1/(1-e)) and ATT weights (treated 1, control e/(1-e)).

How to use

Select the covariates you want included in the model.
Choose Estimate from CSV data and paste your dataset, or choose Use your coefficients.
Enter single-case covariate values to score one individual or scenario.
Pick an estimand to view inverse-probability weights for rebalancing.
Press Calculate to view propensity, diagnostics, and balance checks.
Use the download buttons to export the computed table and report.

Why propensity scores matter

Propensity scores summarize the probability of receiving treatment given observed covariates. In this calculator, treatment is T=1 and the score e(x) is computed from selected X1–X5 inputs. Using a single number helps compare treated and control units on a common scale, supporting matching, stratification, or weighting. When e(x) is well estimated, downstream effect estimation can reduce bias from measured confounding, while keeping modeling assumptions explicit.

Model specification and diagnostics

The tool fits a logistic model, logit(e)=b0+b1X1+b2X2+b3X3+b4X4+b5X5, using iterative reweighted least squares. After estimation, it reports log-likelihood, McFadden R², and AUC. Log-likelihood tracks overall fit, McFadden R² compares the fitted model to an intercept-only baseline, and AUC summarizes discrimination between treated and control observations. These diagnostics guide whether additional covariates, transformations, or interaction terms may be justified.

Overlap and positivity checks

A key validity condition is positivity: for each covariate pattern, both treatment states should be plausible. The calculator reports propensity ranges separately for treated and control groups to highlight overlap. Limited overlap often produces extreme weights, unstable estimates, and sensitivity to minor model changes. Practical steps include trimming non-overlapping regions, restricting to a common support interval, or redefining the target population to where comparisons are credible.

Weighting and balance metrics

For causal estimation, weights depend on the estimand. For ATE, treated units receive 1/e and controls receive 1/(1−e). For ATT, treated units receive 1 and controls receive e/(1−e). The calculator computes standardized mean differences (SMD) before and after weighting for each selected covariate. As a rule of thumb, absolute SMD below 0.10 indicates good balance, though tighter thresholds may be preferred in high-stakes analyses.

Reporting and reproducibility

To support auditability, exportable CSV output includes each row’s T, covariates, propensity, and weight, while the PDF summary captures diagnostics, overlap, coefficients, and balance results. Record the covariate set, any trimming decisions, and the estimand used, because these choices define the causal question. When results change materially across reasonable specifications, treat conclusions as fragile and consider sensitivity analyses or alternative adjustment strategies in operational settings today.

FAQs

What inputs should I include as covariates?

Include variables measured before treatment that affect both treatment assignment and outcome. Avoid post-treatment variables. Start with key drivers, then refine using balance checks and domain expertise.

Does a higher propensity score mean a better outcome?

No. The score only reflects treatment likelihood given covariates. It is not a risk score or outcome prediction. Use it to improve comparability between treated and control groups.

How do I interpret extreme weights?

Extreme weights usually indicate limited overlap or very small e(x) or 1−e(x). They can inflate variance and make estimates unstable. Consider trimming, stabilizing weights, or restricting to common support.

What balance target should I aim for?

Many analyses target absolute SMD below 0.10 after weighting or matching. For sensitive decisions, aim lower and also inspect overlap and weight distributions, not just a single threshold.

When should I choose ATE vs ATT weights?

ATE targets the average effect in the full population represented by your data. ATT targets the effect among treated units. Choose the estimand that matches the decision context and reporting audience.

Is a good AUC required for a valid analysis?

Not necessarily. High AUC can even worsen overlap by separating groups too strongly. Focus on achieving covariate balance and adequate overlap, then report diagnostics transparently.

Notes and good practice

Look for overlap between treated and control propensity ranges.
Large weights can signal limited overlap or model instability.
After weighting, aim for small absolute SMD values across covariates.
Propensity scores do not prove causality; assumptions still matter.