## Categorical Data Types

Understanding data types is crucial for effective data analysis. Categorical data falls into two major types: nominal (no inherent order) and ordinal (ordered categories). 

- **Quantitative Data**: Numeric and measurable. Can be discrete (countable) or continuous (measured within ranges).
- **Qualitative Data**: Also called categorical. This includes data sorted into groups based on characteristics, not always expressible numerically.


### Categorical Data: Nominal and Ordinal

#### Nominal Data

- **Definition**: Nominal data are labels or names assigned to variables. They represent categories that have no natural order or ranking.
- **Key Traits**:
  - Cannot be logically ordered
  - Used for classification
  - Examples: Gender (male, female, nonbinary), eye color (brown, blue, green), countries, animal species.
  - Sometimes coded with numbers (e.g., 1=male, 2=female), but these numbers have no arithmetic meaning.
- **Typical Usage**: Often collected in surveys or forms as free-form answers, radio buttons, or multiple-choice groups.
- **Analysis**: Frequency counts, mode, visualization with pie charts and bar charts.

#### Ordinal Data

- **Definition**: Ordinal data are categorical data that can be ranked or ordered. There are clear levels or hierarchy, but the intervals between categories are not statistically meaningful.
- **Key Traits**:
  - Categories have a logical order (e.g., low < medium < high)
  - No quantifiable distance between categories
  - Examples: Customer satisfaction (low, medium, high), education level (elementary, high school, college), race placement (1st, 2nd, 3rd).
- **Typical Usage**: Commonly used in ratings, rankings, feedback forms, bug severity levels (e.g., minor, major, critical).
- **Analysis**: Median, percentiles, ranking tests, ordinal regression, visuals with ordered bar charts.

#### Comparing Nominal and Ordinal Data

| Feature           | Nominal Data                     | Ordinal Data                                    |
|-------------------|----------------------------------|--------------------------------------------------|
| Order             | No inherent order                | Ordered/ranked categories                        |
| Numeric Meaning   | None                             | None (but can be assigned for convenient coding) |
| Analysis Method   | Frequency, mode                  | Ranking, median, non-parametric tests            |
| Example           | Gender, colors, types of fruits  | Grades, satisfaction level, economic status      |
| Visualization     | Bar chart, pie chart             | Ordered bar chart, median ranking charts         |


### Additional Points

- **Both nominal and ordinal data are non-parametric**: They do not fit normal distributions and often require specialized statistical tests for analysis.
- **Ordinal data sometimes assigned numeric codes**: These should only be used for ordering, not arithmetic calculations.
- **Survey Insights**:
  - Nominal questions allow respondents to express unique, unranked categories.
  - Ordinal questions restrict responses to predefined ordered options, enabling concise and targeted analysis.
- **Graphical Analysis**: Pie and bar charts for both; ordered ordinal charts emphasize ranking.


### Summary

Nominal and ordinal data are foundational for exploratory data analysis, helping data scientists understand, classify, and organize information. Nominal data focuses on pure categories with no order, whereas ordinal data involves a meaningful order but undefined interval spacing. Recognizing the difference safeguards against incorrect conclusions and ensures the right statistical approach in machine learning projects.

Sources:

[1](https://stats.oarc.ucla.edu/other/mult-pkg/whatstat/what-is-the-difference-between-categorical-ordinal-and-interval-variables/)
[2](https://www.formpl.us/blog/nominal-ordinal-data)
[3](https://www.dummies.com/article/academics-the-arts/math/statistics/types-of-statistical-data-numerical-categorical-and-ordinal-169735/)
[4](https://builtin.com/articles/ordinal-data)
[5](https://www.geeksforgeeks.org/data-science/nominal-vs-ordinal-data/)
[6](https://www.scribbr.com/statistics/levels-of-measurement/)
[7](https://panintelligence.com/blog/types-of-data/)
[8](https://www.freecodecamp.org/news/types-of-data-in-statistics-nominal-ordinal-interval-and-ratio-data-types-explained-with-examples/)