Cluster observations with maximum pairwise distance logic. Review merges, compare metrics, and export polished results for deeper statistical grouping decisions today.
Use one row per observation in the format: Label, x1, x2, x3.
This sample shows six observations with two variables.
| Label | Variable 1 | Variable 2 |
|---|---|---|
| A | 2 | 3 |
| B | 3 | 4 |
| C | 8 | 7 |
| D | 9 | 8 |
| E | 3 | 2 |
| F | 8 | 9 |
Complete linkage defines the distance between two clusters as the maximum distance between any point in cluster A and any point in cluster B.
Complete Linkage Distance:
D(A,B) = max { d(i,j) } for all points i ∈ A and j ∈ B.
Euclidean distance: d(i,j) = √Σ(xik - xjk)²
Manhattan distance: d(i,j) = Σ|xik - xjk|
Chebyshev distance: d(i,j) = max|xik - xjk|
The algorithm begins with each observation as its own cluster, then merges the pair with the smallest complete linkage distance until the requested cluster count is reached.
It measures the farthest pairwise distance between two clusters. This creates compact clusters and avoids merging groups when any two members are very far apart.
Standardize when variables use different units or scales. Without standardization, large-scale variables can dominate the distance calculation and distort clustering results.
Euclidean works well for geometric similarity. Manhattan is useful for grid-like movement. Chebyshev fits problems where the largest coordinate gap matters most.
Because it uses the maximum inter-cluster distance, it penalizes wide or stretched clusters. That tends to keep merged groups relatively compact and well separated.
It lists each clustering step, the two clusters merged, the merge distance, the resulting cluster, and how many clusters remain afterward.
Large jumps in merge distance often suggest natural separation. A sharp increase can indicate that merging beyond that point forces dissimilar clusters together.
Yes. Each row can contain a label followed by any consistent number of numeric variables. Every row must have the same dimensionality.
They include the merge history summary. This makes it easy to share clustering steps, review merge distances, and keep a record of the analysis.
Important Note: All the Calculators listed in this site are for educational purpose only and we do not guarentee the accuracy of results. Please do consult with other sources as well.