Measure model quality using diagnostic statistics instantly. Review confusion counts, balance, and predictive strength clearly. Make smarter classification decisions with clear statistical evidence today.
Provide the confusion matrix values and optional labels. The tool calculates core and advanced diagnostic statistics instantly.
This sample table shows how predicted probabilities convert into binary predictions using a 0.50 threshold.
| ID | Actual Class | Predicted Probability | Predicted Class | Outcome |
|---|---|---|---|---|
| 1 | Positive | 0.93 | Positive | TP |
| 2 | Positive | 0.81 | Positive | TP |
| 3 | Positive | 0.41 | Negative | FN |
| 4 | Negative | 0.62 | Positive | FP |
| 5 | Negative | 0.12 | Negative | TN |
| 6 | Negative | 0.28 | Negative | TN |
From record-level predictions, aggregate TP, TN, FP, and FN values, then enter them into the tool.
Accuracy = (TP + TN) / (TP + TN + FP + FN)
Shows overall classification correctness across all observations.
Precision = TP / (TP + FP)
Measures how many predicted positives are truly positive.
Recall / Sensitivity = TP / (TP + FN)
Measures the share of actual positives correctly identified.
Specificity = TN / (TN + FP)
Measures the share of actual negatives correctly identified.
F1 Score = 2 × Precision × Recall / (Precision + Recall)
Balances precision and recall into one harmonic metric.
Balanced Accuracy = (Recall + Specificity) / 2
Useful when class sizes are uneven.
Matthews Correlation = (TP×TN − FP×FN) / √[(TP+FP)(TP+FN)(TN+FP)(TN+FN)]
A robust summary metric for imbalanced binary classification.
Cohen's Kappa = (Observed Accuracy − Expected Accuracy) / (1 − Expected Accuracy)
Adjusts agreement by removing agreement expected by chance.
Likelihood Ratios = Recall / FPR and FNR / Specificity
Helpful in diagnostic testing and evidence-based model interpretation.
It measures how well a model separates two classes using confusion matrix counts. It reports accuracy, precision, recall, specificity, F1 score, balanced accuracy, likelihood ratios, kappa, and more.
Accuracy can look strong even when a model misses many rare positives. Precision, recall, specificity, and Matthews correlation give a more reliable view, especially when classes are imbalanced.
Focus on recall when missing a positive case is costly. Medical screening, fraud detection, and safety alerts often prioritize catching as many true positives as possible.
Precision matters when false alarms are expensive. It is useful in tasks where each positive prediction triggers time, money, or manual review.
Balanced accuracy averages recall and specificity. It helps when one class is much larger than the other and prevents majority-class dominance from hiding weak minority detection.
Matthews correlation summarizes all four confusion matrix cells in one score. It is often more informative than accuracy for imbalanced binary classification problems.
Kappa adjusts observed agreement by accounting for chance agreement. It is useful when you want a stricter agreement measure than raw accuracy.
The CSV export saves the calculated metrics in a structured table. The PDF export captures the visible results section for reporting, sharing, or client documentation.
Important Note: All the Calculators listed in this site are for educational purpose only and we do not guarentee the accuracy of results. Please do consult with other sources as well.