Introduction Rule-based classification is a machine learning method that enables classification of given features based on a specific set of rules. This method is used to classify new data based on a predetermined set of rules.
A game company wants to estimate any new customer will how much averages income brings to that company. Therefore, the company wants us to create level-based new customer definitions and use some features of that customers to create segments according to these new customer definitions and estimate the average income for new customers according to these segments.
The Persona.csv dataset contains the prices of the products sold by an international gaming company and some demographic information of the users who bought these products. The dataset consists of records generated for each sales transaction. This means that the table is not singularized. In other words, a user with certain demographic characteristics may have made multiple purchases.
| Column | Description | Data Type |
|---|---|---|
| Price | Amount spent by the customer | Numeric |
| Source | Type of device the customer connected to | Nominal |
| Sex | Gender of the customer | Nominal |
| Country | Country of the customer | Nominal |
| Age | Age of the customer | Numeric |