Skip to content

almirgouvea/The-Crisp-DM-Methodology

Repository files navigation

CRISP-DM

This project contains guidance on the important steps of the CRISP-DM cycle.

What is the CRISP-DM cycle?

The CRoss-Industry Standard Process for Data Mining (CRISP-DM) is a standard process model that describes common approaches to conducting a data mining project.

CRISP-DM organizes the data mining process into six phases: business understanding, data understanding, data preparation, modeling, evaluation, and deployment.

Phases of the CRISP-DM

Figure: Phases of the CRISP-DM

1. Business Understanding

This phase aims to understand the project objectives and requirements from a business perspective, converting the knowledge into a data mining problem definition and then developing a preliminary plan to achieve the objectives.

2. Data Understanding

This phase aims to increase familiarity with the data, to identify data quality problems, to discover initial insights into the data and detect interesting subsets to form hypotesis about hidden information.

3. Data Preparation

This phase aims to build the final dataset to be used in the modeling tools.

4. Modeling

This phase aims to select and apply modeling techniques to calibrate their parameters to optimal values.

5. Evaluation

This phase aims to evaluate the model and review the model construction steps to ensure it adequately achieves business objectives and also to verify if considered all business issues.

6 Deployment

This phase aims to organize and present the knowledge acquired with the project so that the client can use the created models in the organization’s decision-making processes.

Project

All the steps are explained in detail on the link below.

About

This repository contains the Crisp-DM framework developed on Jupyter book

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published