- Cross Industry Standard Process for Data Mining (CRISP-DM) is a methodology that describes the approach use in tackling data mining problems
- Phases of CRISP-DM
- Phase 1 – Business Understanding
- Determine business objectives
- Assess situation
- Determine data mining goals
- Produce project plan
- Phase 2 – Data Understanding
- Collect initial data
- Describe and explore data
- Verify data quality
- Phase 3 – Data Preparation
- Select data and Clean data
- Construct and integrate data
- Format data
- Phase 4 – Modeling
- Select the modeling technique
- Generate test design
- Build and assess model
- Phase 5 – Evaluation
- Evaluate results
- Review process and determine next steps
- Phase 6 – Deployment
- Plan deployment
- Plan monitoring and maintenance
- Produce final report and presentation
- Review project