Instructor: Dr. Yong-Mi Kim
The objectives of this course were built upon the foundation laid by the Introduction to Data Analytics course I completed earlier in my degree program. By the conclusion of this course, the aim was to enhance our proficiency in selecting analytical tools suitable for the data at hand and also be able to interpret the processed data using the most effective methods available. The assignment presented below offers an illustration of how the Random Forest method operates, generating decision trees based on provided data. I have chosen to highlight this assignment specifically because I believe it has the potential to make some decision-making processes easier within library settings (Personal Goals 1.2 and 1.3).
Analysis
As mentioned in the description, the assignment above demonstrates the application of the Random Forest method in the software "R" for generating decision trees. The Random Forest method involves splitting provided datasets into training and testing sets. It then utilizes one of these datasets to construct decision trees that evaluate each provided factor's importance in determining the outcome. This approach could prove valuable in collection development within academic libraries. In real-world practice, a list of books and their attributes could be compiled into a spreadsheet and analyzed using "R"(OU SLO 5 - Methods and Analysis). Each attribute of the books would be assessed to determine its significance to the collection's needs. Decision trees would be generated to illustrate how each attribute compares to the others. The combined insights from these decision trees might reveal, for instance, that books currently trending in popularity are more crucial additions to the collection than those belonging to the science-fiction genre (Personal Goal 1.3).
Reflection
This course was one of the most significant challenges for me throughout my time in the MLIS program. However, it was because of these challenges that it proved to be incredibly rewarding. I encountered opportunities to push myself beyond my previous limits and truly embraced the concept of learning from my own mistakes. The assignment I've included required a substantial amount of time and involved a trial-and-error process to accurately execute the required tasks. Upon completion, I came to appreciate the potential benefits it could offer to individuals seeking assistance in decision-making. Furthermore, it underscored the ongoing importance and indispensability of data in libraries within an ever-evolving landscape.