Skip to content

qltf8/1004_LPL_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark-lean

Spark-lean, an interactive PySpark-based Data Cleaning Library

Features

  • Data versioning
  • Missing value detection
  • Text cleaning
  • Featurization
  • String Matching
  • Anomaly detectation

Installation

pip install spark-lean

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages