Upload CSV, pick settings, and view scree curve. See cumulative variance and suggested components quickly. Download results as PDF or CSV in seconds securely.
This sample has four correlated variables. Try it using “Load Example”.
| A | B | C | D |
|---|---|---|---|
| 12 | 18 | 25 | 40 |
| 15 | 20 | 22 | 38 |
| 14 | 19 | 24 | 41 |
| 16 | 21 | 23 | 39 |
| 13 | 17 | 26 | 42 |
| 18 | 24 | 28 | 45 |
| 17 | 23 | 27 | 44 |
| 19 | 25 | 29 | 46 |
| 20 | 26 | 30 | 47 |
| 21 | 27 | 31 | 48 |
The scree plot graphs eigenvalues \(\lambda_i\) by component index. Larger eigenvalues indicate components explaining more variability.
A scree plot summarizes how much structure each principal component captures. This calculator forms a covariance or correlation matrix from your numeric columns, computes its eigenvalues, then orders them from largest to smallest. The first points typically fall sharply because early components absorb most variance. The explained-variance column reports 100×λi/Σλ, while the cumulative column tracks Σk≤i 100×λk/Σλ. For a correlation matrix, Σλ equals the number of variables.
Variance targets translate the curve into an actionable cutoff. Many applied studies retain enough components to reach 70%–95% cumulative variance, depending on noise, measurement precision, and interpretability needs. Enter a target such as 0.90 to keep the smallest k where cumulative variance ≥ 90%. If your variables are highly correlated, k may be small; if features are diverse, expect a larger k. Use the table to justify the trade-off.
Two diagnostics help confirm the choice. The elbow heuristic here uses second differences in adjacent eigenvalues to locate the sharpest change in slope, flagging a point after which additional components add diminishing returns. When you choose the correlation matrix, the Kaiser rule is also shown: keep components with eigenvalue > 1, meaning they explain more variance than an average standardized variable. Treat Kaiser as a guide, not a strict rule.
Preprocessing affects eigenvalues, so check scale and missing data carefully. Mean-centering is appropriate when all columns share comparable units. Z-scoring standardizes columns to unit variance and is preferred for mixed units or different magnitudes; it is automatically applied for correlation analysis. Columns with near-zero variance are dropped because they cannot contribute meaningful directions. For missing cells, you can drop incomplete rows or impute with column means to preserve sample size.
Use downloads to keep results reproducible. The CSV export includes component index, eigenvalue, explained variance percent, and cumulative percent, which can be plotted in spreadsheets. The PDF export places the scree chart and table on an A4 page for reports. Pair the suggested k values with domain constraints, then document your selection and threshold so future datasets can be compared consistently.
Use a rectangular numeric table: rows are observations and columns are variables. Keep units consistent per column, avoid text inside the data area, and include a header only if you enable the header option.
Use correlation when variables have different units or ranges, because it standardizes each column. Use covariance when all variables share comparable scales and you want components influenced by absolute variability.
If a column has near-zero variance, it adds no meaningful direction and can cause numerical issues. The calculator removes such columns and notifies you in Notes.
Elbow detection looks for the largest curvature change using second differences across eigenvalues. It highlights where the curve transitions from steep decline to a flatter tail.
Set “Max Components” to show only the first k components in the plot and table. Computations still use all retained variables; the limit only affects display and exports.
Yes. Scaling changes the variance structure, so eigenvalues differ between centered-only and z-scored data. If you select correlation, z-scoring is applied automatically to make results scale-free.
Important Note: All the Calculators listed in this site are for educational purpose only and we do not guarentee the accuracy of results. Please do consult with other sources as well.