Data Complexity Measures

This repository contains code, examples, and datasets related to data complexity measures, specifically focusing on the following two indices:

Separation Index: It shows that how much input data points separate the labels from each others.

Smoothness Index: It shows that how much input data points make the output targets smooth

Introduction

Data complexity measures play a crucial role in various machine learning and data analysis tasks. This repository provides implementations and resources for two important complexity indices: Separation Index and Smoothness Index. These indices help in understanding the structure and characteristics of datasets, which can be useful in feature selection, model selection, and data preprocessing.

Getting Started

To get started with using the code and exploring the data complexity measures, follow these steps:

Clone this repository to your local machine:

git clone https://github.com/Arhosseini77/data_complexity_measures

Install the required dependencies by running:

pip install -r requirements.txt

3 . Explore the code and documentation in the repository to understand how to use the data complexity measures in your projects.

Usage

You can use the provided code and functions to calculate Separation Index and Smoothness Index for your datasets. Detailed usage instructions and examples are available in the Examples section below.

Examples

To see how to use Separation Index and Smoothness Index in practice, refer to the Examples directory. You will find Jupyter Notebook files with step-by-step demonstrations of these measures on sample datasets.

Data (Soon)

Sample datasets for testing the data complexity measures are available in the data directory. You can use these datasets to experiment with the provided code and to understand how the indices work.

Contributing

If you'd like to contribute to this project, please follow our contribution guidelines. We welcome contributions in the form of bug reports, feature requests, code improvements, and more.

License

This project is licensed under the MIT License - see the LICENSE file for details.T

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
example		example
feature_selection		feature_selection
images		images
models		models
relative_density		relative_density
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Complexity Measures

Table of Contents

Introduction

Getting Started

Usage

Examples

Data (Soon)

Contributing

License

About

Releases

Packages

Languages

Arhosseini77/data_complexity_measures

Folders and files

Latest commit

History

Repository files navigation

Data Complexity Measures

Table of Contents

Introduction

Getting Started

Usage

Examples

Data (Soon)

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages