easyds

Personal Python library to make usual data science workflow easy.

I created this package to lessen my time when doing ds task. All of the features that I've established were tested in real world problems. Feel free to contribute and suggest for further improvement.

Installation

pip install easyds

Sample usage

basic_clean

Gives an usual cleaning to your data.

import pandas as pd
from easyds.data_cleaner import basic_clean

# create sample data
df = pd.DataFrame()

str_numbers = ['1', '2', '3']
df['str_Numbers'] = str_numbers

str_date = ['1998/10/29', '1998/10/30', '1998/10/31']
df['STR DaTe'] = str_date

str_float = ['1.0', '1.3', '4.4  ']
df['Str Float'] = str_float

numbers = [1, '2', '3']
df['NuMbeRs'] = numbers

str_Numbers	STR DaTe	Str Float	NuMbeRs
1	1998/10/29	1.0	1
2	1998/10/30	1.3	2
3	1998/10/31	4.4	3

df.dtypes
str_Numbers       object
STR DaTe          object
Str Float         object
NuMbeRs           object

After basic_clean,

str_numbers	str_date	str_float	numbers
1	1998/10/29	1.0	1
2	1998/10/30	1.3	2
3	1998/10/31	4.4	3

df.dtypes
str_numbers       int64
str_date          datetime64[ns]
str_float         float64
numbers           int64

feature_extraction

Clean the corpus input into its most useful form. This is helpful for those interested in text analysis.

Sample usage

import pandas as pd
from easyds.text_cleaner import feature_extraction

# create sample data
df = pd.DataFrame()
comments = ['This is good and easy to install', 'Ang bagal ng internet', 'Sana mas pabilisan pa ang serbisyo']
df['comments'] = comments

# i define my own collection of stop words
stop_words = ['ng', 'ikaw', 'and', 'is', 'pa', 'ang']

comments
This is good and easy to install
Ang bagal ng internet
Sana mas pabilisin pa ang serbisyo

After feature_extraction,

df = feature_extraction(df, 'comments', stop_words)

comments	new_comments
This is good and easy to install	[this, good, easy, install]
Ang bagal ng internet	[bagal, internet]
Sana mas pabilisin pa ang serbisyo	[sana, mas, pabilisin, serbisyo]

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
easyds		easyds
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

easyds

Installation

Sample usage

basic_clean

feature_extraction

About

Releases 2

Packages

Languages

License

king-ds/easyds

Folders and files

Latest commit

History

Repository files navigation

easyds

Installation

Sample usage

basic_clean

feature_extraction

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages