Data Preprocessing Module

This python module preprocesses a csv dataset, which has a categorical data at the last column. This module makes use of scikit-learn, pandas and numpy.

The following steps of preprocessing be done in order:

Importing Dataset
Missing Value treatement
Encoding Categorical Data
Splitting Dataset into training and testing set
Feature Scaling using Standard Scaler

Inut and output

The input value to the sole function preprocess is a csv dataset
The return value is a tuple of 4 values in the fashing of train_test_split of scikit-learn, i.e X_train, X_test, y_train, y_test**

Module Installation from PyPI

$ pip install processdat

Usage

> import processdat as pro

...
X_train, X_test, y_train, y_test = pro.preprocess('Data.csv')
...

Developing processdat

To install processdat, along with the tools you need to develop and run tests, run the following in the terminal/environment:

$ pip install -e .[dev]

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
test		test
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data Preprocessing Module

This python module preprocesses a csv dataset, which has a categorical data at the last column. This module makes use of scikit-learn, pandas and numpy.

Inut and output

Module Installation from PyPI

Usage

Developing processdat

Created by: Shayan Banerjee (shayanbanerhee96@gmail.com)

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ShayanBanerjee/processdat

Folders and files

Latest commit

History

Repository files navigation

Data Preprocessing Module

This python module preprocesses a csv dataset, which has a categorical data at the last column. This module makes use of scikit-learn, pandas and numpy.

Inut and output

Module Installation from PyPI

Usage

Developing processdat

Created by: Shayan Banerjee (shayanbanerhee96@gmail.com)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages