Measure pairwise mistakes across positives, negatives, and margins. Visualize loss behavior with detailed comparative outputs. Export summaries fast for audits, tuning, validation, and reporting.
Use comma, space, semicolon, or line breaks between values. Positive scores should outrank negative scores.
| Label Group | Scores | Weights | Notes |
|---|---|---|---|
| Positive Items | 2.8, 2.1, 1.9, 1.4 | 1, 1, 1, 1 | Expected to rank above all negative items. |
| Negative Items | 1.7, 1.1, 0.9, 0.3 | 1, 1, 1, 1 | Used to create all positive-negative pair combinations. |
| Margin | 0.50 | Not applicable | Pairs below this gap receive a penalty. |
| Suggested Loss Type | Hinge | Not applicable | Good for direct margin-based interpretation. |
Each positive score is compared with each negative score. For a pair, let the gap be g = spositive - snegative. Let the target margin be m. The shifted gap becomes z = g - m.
If weights are provided, each pair receives the product of its positive and negative weights. Weighted loss equals the sum of pair loss × pair weight. Weighted average loss divides that weighted sum by total pair weight.
Ranking loss measures how often higher-priority items fail to score above lower-priority items. Lower values indicate better ordering performance and fewer pairwise mistakes.
Use hinge loss when you want a direct margin-based penalty. It is easy to interpret because only pairs failing the target margin contribute loss.
Logistic loss gives smooth gradients around the margin boundary. That makes it helpful when you want stable optimization and softer penalties for near-correct pairs.
The margin defines the desired separation between positive and negative scores. Larger margins demand stronger ranking confidence and usually increase penalties for close pairs.
Weights let you emphasize important observations. A pair built from larger weights contributes more to the weighted loss and weighted accuracy metrics.
For positive-negative comparisons, pair accuracy closely matches empirical AUC interpretation. It represents the share of pairs ordered correctly by the scoring model.
Yes. Any numeric scores work if larger values indicate stronger relevance. They can be raw model scores, margins, logits, or calibrated probabilities.
Export CSV when you need spreadsheet review or auditing. Export PDF when you need a compact summary for presentations, validation notes, or client reporting.
Important Note: All the Calculators listed in this site are for educational purpose only and we do not guarentee the accuracy of results. Please do consult with other sources as well.