DATA MINING LAB
(2022-2023)
(2022-2023)
1.Implement all basic R commands --- Solution
2.Interact data through .csv files (Import from and export to .csv files) ---Solution
3.Get and Clean data using swirl exercises. (Use ‘swirl’ package, library and install that topic from swirl). --Solution
4.Visualize all Statistical measures (Mean, Mode, Median, Range, Inter Quartile Range etc., using Histograms, Boxplots and Scatter Plots). ---- Solution
5.To implement data frame and its related operations. ---Solution
6. Write R Program using ‘apply’ group of functions to create and apply normalization function on each of the numeric variables/columns of iris dataset to transform them into a. 0 to 1 range with min-max normalization. b. a value around 0 with z-score normalization ---Solution
7.Create a data frame with 10 observations and 3 variables and add new rows and columns to it using ‘rbind’ and ‘cbind’ function ---- Solution
8.Write R program to implement linear and multiple regression on ‘mtcars’ dataset to estimate the value of ‘mpg’ variable, with best R2 and plot the original values in ‘green’ and predicted values in ‘red’. --- Solution
9.Implement k-means clustering using R. --- Solution
10.Implement k-medoids clustering using R. ---- Solution
11.implement density based clustering on iris dataset --- Solution
12.implement decision trees using ‘reading Skills’ dataset --- Solution
13.Implement decision trees using ‘iris’ dataset using package party and ‘rpart’. --- Solution
14.Use a Corpus() function to create a data corpus then Build a term Matrix and Reveal word frequencies --- Solution