Release v0.1.0 · Rickyy-Sam07/CleansiPy

📦 Puripy v0.1.0 – First Public Release

Welcome to the initial release of Puripy, a modular and powerful Python package designed for cleaning and preprocessing messy data across Text, Categorical, Numerical, and Datetime fields.

Features

Text Cleaner

✅ Contraction expansion, emoji/URL/HTML removal
✅ Stopword removal, stemming, lemmatization
✅ Spelling correction, profanity filtering, n-gram generation
✅ Auto column detection & parallel processing

🏷️ Categorical Cleaner

✅ Fuzzy typo correction with thefuzz
✅ Rare category grouping
✅ OneHot, Ordinal, and Label encoding via sklearn
✅ Text normalization and full reporting

Numerical Cleaner

✅ Missing value imputation (mean, median, mode)
✅ Outlier handling (IQR method)
✅ Type conversion and precision control
✅ Duplicate detection and domain rule enforcement

Datetime Cleaner

✅ Flexible datetime parsing and fuzzy matching
✅ Timezone normalization
✅ Missing date imputation using STL decomposition
✅ Feature extraction (year, month, day, quarter, fiscal, etc.)

What's New in `v0.1.0`?

Initial release with full support for text, categorical, numerical, and datetime cleaning.
Built-in support for parallel processing and logging.
Highly customizable pipelines using configuration dictionaries.
Auto-generated cleaning reports for auditability.

Tech Stack

pandas, numpy, nltk, textblob, sklearn, emoji, contractions, better_profanity, tqdm, joblib, pytz, statsmodels, and more.

Known Notes

This is a pre-1.0 release — APIs and behavior might change in future versions.
Ideal for testing, experimentation, and feedback.

Contribute

Feedback, issues, and pull requests are welcome!
Star ⭐ the repo and help shape Puripy into a go-to tool for data cleaning.

Let me know if you'd also like to generate a sample CHANGELOG.md or GitHub Action workflow for automated releases.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.1.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

📦 Puripy v0.1.0 – First Public Release

Features

Text Cleaner

🏷️ Categorical Cleaner

Numerical Cleaner

Datetime Cleaner

What's New in `v0.1.0`?

Tech Stack

Known Notes

Contribute

Uh oh!

v0.1.0

📦 Puripy v0.1.0 – First Public Release

Features

Text Cleaner

🏷️ Categorical Cleaner

Numerical Cleaner

Datetime Cleaner

What's New in v0.1.0?

Tech Stack

Known Notes

Contribute

Uh oh!

What's New in `v0.1.0`?