Spark-lean Spark-lean, an interactive PySpark-based Data Cleaning Library Features Data versioning Missing value detection Text cleaning Featurization String Matching Anomaly detectation Installation pip install spark-lean