IMPORTANT: This package has been renamed to
ride-cli
. Please use the new package for all future installations and updates.
To migrate to the new package:
# Uninstall the old package
pip uninstall prepup-linux
# Install the new package
pip install ride-cli
All functionality remains the same. The only change is the package name and command:
- Old command:
prepup
- New command:
ride
orride-cli
Prepup began in summer 2023 as the Preprocessing Utility Package (PrePUP) with just 5 terminal flagsโa learning project that evolved into a comprehensive data tool. After creating prepup-linux to address cross-platform compatibility issues, we realized the name incorrectly suggested Linux exclusivity, when our vision has always been platform independence. We also tested our first menu-driven approach in prepup-linux. We're now transitioning to RIDE-CLI (Rapid Insights Data Engine), a name that better reflects our tool's capabilities: rapid data preprocessing, meaningful insights generation, and cross-platform functionality. This rebranding represents our growth from a simple utility to a robust data engine, while maintaining our commitment to continuous improvements and expanded features across all platforms.
Prepup is a powerful, user-friendly data preprocessing tool designed to simplify and streamline your data analysis workflow directly from the terminal. Whether you're a data scientist, analyst, or researcher, Prepup provides an intuitive interface for exploring, cleaning, and preparing your datasets.
- ๐ Load datasets from various formats (CSV, Excel, Parquet)
- ๐ Comprehensive data inspection
- ๐ Advanced data exploration
- ๐งน Missing value handling
- ๐ Feature visualization
- ๐ค Automatic Machine Learning (AutoML) model selection
- Data Loading
- Feature Inspection
- Correlation Analysis
- Distribution Checking
- Outlier Detection
- Missing Value Imputation
- Feature Standardization
- Automatic Model Training
โ ๏ธ Important: Creating a virtual environment is highly recommended when installing prepup-linux. As a data processing library, it has various dependencies that may conflict with your existing packages.
# Create virtual environment
python -m venv prepup-env
# Activate virtual environment
prepup-env\Scripts\activate
# Deactivate when done
deactivate
# Create virtual environment
python3 -m venv prepup-env
# Activate virtual environment
source prepup-env/bin/activate
# Deactivate when done
deactivate
# Inside your activated virtual environment
pip install prepup-linux
# Inside your activated virtual environment
git clone https://github.com/sudhanshumukherjeexx/prepup-linux.git
cd prepup-linux
pip install .
prepup
prepup path/to/your/dataset.csv
- Load Dataset
- Inspect Data
- Explore Data
- Visualize Data
- Impute Missing Values
- Standardize Features
- Export Data
- AutoML (Train & Evaluate Models)
-
Launch Prepup
prepup
-
Load Your Dataset: Choose option 1 and enter your dataset path
-
Inspect Data: Use option 2 to explore features, data types, and missing values
-
Preprocess: Impute missing values | Standardize features
-
Analyze: Visualize data distributions | Perform correlation analysis | Run AutoML for model selection
- Supports both Classification and Regression tasks
- Evaluates multiple machine learning algorithms
- Provides performance metrics
- Saves results to CSV
- NumPy
- Pandas
- Scikit-learn
- Matplotlib
- and more (see requirements.txt)
Contributions are welcome! Please feel free to submit a Pull Request.
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
Distributed under the MIT License. See LICENSE
for more information.
This package is deprecated and will no longer receive updates. Please migrate to ride-cli for the latest features and support.