Silhouette analysis can be used to study the separation distance between the resulting clusters. The silhouette plot displays a measure of how close each point in one cluster is to points in the neighboring clusters and thus provides a way to assess parameters like number of clusters visually. This measure has a range of [-1, 1].
Silhouette coefficients (as these values are referred to as) near +1 indicate that the sample is far away from the neighboring clusters. A value of 0 indicates that the sample is on or very close to the decision boundary between two neighboring clusters and negative values indicate that those samples might have been assigned to the wrong cluster.
In the field of machine learning, one of the key objectives is to achieve accurate predictions. These predictions can be achieved through various learning algorithms, categorized as supervised and unsupervised.
K-means clustering is one such unsupervised algorithm that groups data points based on their similarity. In this blog, we will explore the use cases, advantages, and working principles of the K-means clustering algorithm.
How to Run Cluster Analysis in Excel
This is a step by step guide on how to run k-means cluster analysis on an Excel spreadsheet from start to finish. Please note that there is an Excel template that automatically runs cluster analysis available for free download on this website. But if you want to know how to run a k-means clustering on Excel yourself, then this article is for you.