Syllabus
Department(s): MCA
Semester: 2
Section(s):A, B
Lectures/week: 04
Subject: Data Mining with Business Intelligence
Code: 22MCA252
Course Instructor(s): Dr.Gnaneswari
Data Mining and Business Intelligence
Course Code 22MCA252
CIE Marks 50
Teaching Hours/Week (L:P:SDA) 2:0:2
SEE Marks 50
Total Hours of Pedagogy 40
Course Learning objectives:
· To introduce the concept of data Mining as an important tool for enterprise data management and as a cutting edge technology for building competitive advantage.
· To enable students to effectively identify sources of data and process it for data mining.
· To impart skills that can enable students to approach business problems analytically by identifying opportunities to derive business value from data.
· Learning how to gather and analyse large sets of data to gain useful business understanding.
Module-1
Overview and concepts Data Warehousing and Business Intelligence: Why reporting and Analysing data, Raw data to valuable information-Lifecycle of Data - What is Business Intelligence - BI and DW in today‟s perspective - What is data warehousing - The building Blocks: Defining Features - Data warehouses and data 1marts - Overview of the components - Metadata in the data warehouse - Need for data warehousing - Basic elements of data warehousing - trends in data warehousing. The Architecture of BI and DW BI and DW architectures and its types - Relation between BI and DW - OLAP (Online analytical processing) definitions - Difference between OLAP and OLTP - Dimensional analysis - What are cubes? Drill-down and roll-up - slice and dice or rotation - OLAP models - ROLAP versus MOLAP - defining schemas: Stars, snowflakes and fact constellations.
Module-2
Introduction to data mining (DM): Motivation for Data Mining - Data Mining-Definition and Functionalities – Classification of DM Systems - DM task primitives - Integration of a Data Mining system with a Database or a Data Warehouse - Issues in DM – KDD Process Data Pre-processing:Why to pre-process data? - Data cleaning: Missing Values, Noisy Data - Data Integration and transformation - Data Reduction: Data cube aggregation, Dimensionality reduction - Data Compression - Numerosity Reduction - Data Mining Primitives - Languages and System Architectures: Task relevant data - Kind of Knowledge to be mined - Discretization and Concept Hierarchy.
Module-3
Concept Description and Association Rule Mining What is concept description? - Data Generalization and summarization-based characterization - Attribute relevance - class comparisons Association Rule Mining: Market basket analysis - basic concepts - Finding frequent item sets: Apriori algorithm - generating rules – Improved Apriori algorithm – Incremental ARM – Associative Classification – Rule Mining.
Module-4
Classification and prediction: What is classification and prediction? – Issues regarding Classification and prediction: Classification methods: Decision tree, Bayesian Classification, Rule based, CART, Neural Network Prediction methods: Linear and nonlinear regression, Logistic Regression. Introduction of tools such as DB Miner /WEKA/DTREG DM Tools.
Module-5
Data Mining for Business Intelligence Applications: Data mining for business Applications like Balanced Scorecard,
Fraud Detection, Click stream Mining, Market Segmentation, retail industry, telecommunications industry, banking & finance and CRM etc., Data Analytics Life Cycle: Introduction to Big data Business Analytics - State of the practice in
analytics role of data scientists Key roles for successful analytic project - Main phases of life cycle - Developing core deliverables for stakeholders.
Course Plan