SSK4604-Data Mining - Agglomerative Clustering In Rapidminer

Agglomerative Clustering (Rapidminer)

Agglomerative Clustering In Rapidminer

Introduction To Dataset

In this step, we are using StudentEvent dataset from local repository. The value for this dataset has been standardized. This dataset contains 35 rows with 11 columns.

Step 1: Choose The Dataset

Create an empty process and drag the StudentEvent dataset into the blank process. It will create a Retrieve operator.

Step 2: Select Attribute

Choose Select Attribute operator to select and determine the attribute to be analyzed. Connect Retrieve operator with Select Attribute operator and select the attributes. In this analysis, we do not analyze Marks and MarksBin attributes.

Setting Parameter

Choose subset and invert selection option.

Select Attributes

Select the unwanted attributes to the right side.

Step 3: Set Role

Choose Select Operator and connect it with Select Attribute operator. Use the Select Role operator to set Student ID as id (identifier) and Grade as label. By doing this, these two attributes are excluded from the analysis as a feature.

Setting Parameter

Set Grade as label.

Select Attributes

Set Student ID as id.

Step 4: Setting Cluster Parameter

Connect Set Role operator with the Cluster (Agglomerative Clustering) operator. This operator performs Agglomerative clustering which is a bottom-up strategy of Hierarchical clustering.

Step 5: Flatten Clustering

The Flatten Clustering operator creates a flat cluster model from the given hierarchical cluster model by expanding nodes in the order of their distance until the desired number of clusters (specified by the number of clusters parameter) is reached. For this operator, we set 4 no. of clusters.

Step 6: Run Agglomerative Model

Step 7: Results And Visualization

Agglomerative Tree View

This is the view for Agglomerative Clustering Model that has been flattened to 6 clusters. From here we can see that Cluster 1 is the big cluster with 29 members.

Next Topic: Agglomerative Clustering in Python

Page updated

Report abuse

Agglomerative Clustering (Rapidminer)

Agglomerative Clustering In Rapidminer

Introduction To Dataset

Step 1: Choose The Dataset

Step 2: Select Attribute

Setting Parameter

Select Attributes

Step 3: Set Role

Setting Parameter

Select Attributes

Step 4: Setting Cluster Parameter

Step 5: Flatten Clustering

Step 6: Run Agglomerative Model

Step 7: Results And Visualization

Agglomerative Tree View

Next Topic: Agglomerative Clustering in Python

Copyright by 199607-Build using sites.google.com