PACKAGE: MTU CLUSTER
ANALYSIS: 1. AGGLOMERATIVE HIERARCHICAL CLUSTERING
2. DIVISIVE HIERARCHICAL CLUSTERING
AUTHOR: - Revanth Chappidi, Dhairya Kothari, Akshay Gaonkar, Bharath Inti
CONTACT AT: - chappidi@mtu.edu, dmkothar@mtu.edu, adgaonka@mtu.edu, binti@mtu.edu
MANAGED BY: - Dr. Shane T. Mueller (shanem@mtu.edu ) ; Revanth Chappidi, Dhairya Kothari, Akshay Gaonkar, Bharath Inti
HIERARCHICAL CLUSTERING
Hierarchical clustering is an alternative approach to k-means clustering for identifying groups in the dataset. It does not require to pre-specify the number of clusters to be generated. The result is a tree-based representation of the observations which is called a dendrogram. It uses pairwise distance matrix between observations as clustering criteria. Below are the two types of Hierarchical clustering :
1. Agglomerative clustering:
It is a bottom up process in which pairs of clusters having minimum intraclass distance are merged successively until all the clusters have been merged into a single cluster containing all the data. agnes is the agglomerative nesting method which can be deployed for executing agglomerative clustering in R. The function prototype is as follows:
x is the distance among the observations and it can be calculated using following options:
agnes(x, method = “single”, …)
method is the types of linkages. The linkages differ in the way they merge the clusters.
2. Divisive Clustering:
It is a top down process in which the data is splitted continuously until all the data tuples are segregated. diana is the divisive hierarchical clustering method in R. The function prototype is as follows:
x is the distance among the observations and it can be calculated using following options:
diana(x,)
USE CASE:
Consider five points: A, B, C, D, E. Each of these points are having some similarities between them - W (Weight between the components). The following image explains the workflow of clustering the components with hierarchical clustering:
The above (representative) image explains the clustering of components: A, B, C, D, E with some linkage technique. The steps also shows how the formation of dendrogram takes place.
Dendrogram is shown in the image below.
Steps to use Hierarchical clustering (in Jamovi).