Refine datasets through percentile controls and instant diagnostics. Review capped values, spread, and center quickly. Build stronger analyses with exports, notes, and reusable tables.
This example uses 12.5% lower and 12.5% upper winsorization. One lowest and one highest value are capped.
| Position | Original Sorted Value | Winsorized Value | Status |
|---|---|---|---|
| 1 | 12 | 13 | Lower cap |
| 2 | 13 | 13 | Unchanged |
| 3 | 14 | 14 | Unchanged |
| 4 | 15 | 15 | Unchanged |
| 5 | 16 | 16 | Unchanged |
| 6 | 18 | 18 | Unchanged |
| 7 | 19 | 19 | Upper cap reference |
| 8 | 100 | 19 | Upper cap |
Raw mean: 25.875
Winsorized mean: 15.875
Interpretation: The extreme outlier no longer dominates the average, so the central tendency becomes more stable and representative.
The winsorized mean replaces extreme tail observations with cutoff values based on the chosen lower and upper percentages.
Step 1: Sort the dataset from smallest to largest.
Step 2: Compute replacement counts.
kL = floor(n × pL / 100)
kU = floor(n × pU / 100)
Here, n is the number of observations, pL is the lower percentage, and pU is the upper percentage.
Step 3: Replace the smallest kL values with the next retained lower value.
Step 4: Replace the largest kU values with the next retained upper value.
Winsorized Mean = (Σ winsorized values) / n
This calculator also reports the raw mean, trimmed mean, median, variance, standard deviation, and the exact change caused by winsorization.
A winsorized mean replaces extreme observations with nearby cutoff values before averaging. It reduces outlier influence while still keeping every record in the dataset.
A trimmed mean removes tail observations entirely. A winsorized mean keeps the same sample size, but caps extreme values at selected boundary positions.
Use it when a few extreme values distort the average, but you still want every observation represented in the final calculation and summary statistics.
The calculator uses floor counts. In small samples, low percentages may produce zero capped values because the tail count rounds down.
Yes. Asymmetric winsorization is useful when one tail contains stronger outliers than the other or when domain rules justify uneven treatment.
Yes. Capping extreme values usually reduces spread, so the winsorized variance and standard deviation are often smaller than the raw statistics.
Yes. The calculator accepts integers, decimals, and negative numbers, provided every entry is numeric and separated clearly.
It lists each sorted value, its winsorized replacement, the numeric adjustment, and whether the point stayed unchanged or was capped.
Important Note: All the Calculators listed in this site are for educational purpose only and we do not guarentee the accuracy of results. Please do consult with other sources as well.