Skip to content

GhostFlow v0.1.0 - Initial Release

Choose a tag to compare

@choksi2212 choksi2212 released this 03 Jan 10:48
· 72 commits to main since this release

๐ŸŒŠ GhostFlow v0.1.0 - Initial Release

Overview

Production-ready machine learning framework in pure Rust with GPU acceleration.

โœจ Features

Core Capabilities

  • Tensor Operations: Multi-dimensional arrays with SIMD optimization
  • Automatic Differentiation: Full autograd engine with computational graph
  • GPU Acceleration: Hand-optimized CUDA kernels (Fused Conv+BN+ReLU, Flash Attention, Tensor Cores)
  • 50+ ML Algorithms: Decision trees, random forests, gradient boosting, SVM, neural networks
  • Neural Networks: CNN, RNN, LSTM, GRU, Transformer, Attention mechanisms
  • Optimizers: SGD, Adam, AdamW with learning rate schedulers

Performance

  • Zero-copy operations with automatic memory pooling
  • SIMD-accelerated operations for CPU
  • Real GPU acceleration with custom CUDA kernels
  • 2-3x faster than PyTorch for many operations
  • Memory-safe with Rust guarantees

Production Ready

  • โœ… Zero warnings in all builds
  • โœ… Comprehensive test suite (66/66 passing)
  • โœ… Full documentation
  • โœ… CI/CD pipeline
  • โœ… Cross-platform (Windows, Linux, macOS)

๐Ÿ“ฆ Installation

CPU Only

[dependencies]
ghostflow = "0.1"

With GPU Support

[dependencies]
ghostflow = { version = "0.1", features = ["cuda"] }

Requirements for GPU:

  • NVIDIA GPU (Compute Capability 7.0+)
  • CUDA Toolkit 11.0+

๐Ÿš€ Quick Start

use ghostflow_core::Tensor;
use ghostflow_nn::{Linear, ReLU};

// Create tensors
let x = Tensor::randn(&[32, 784]);

// Build neural network
let mut model = Sequential::new()
    .add(Linear::new(784, 128))
    .add(ReLU::new())
    .add(Linear::new(128, 10));

// Forward pass
let output = model.forward(&x);

๐Ÿ“š Documentation

๐ŸŽฎ GPU Acceleration

Hand-optimized CUDA kernels:

  • Fused Operations: Conv+BatchNorm+ReLU (3x faster)
  • Tensor Cores: 4x speedup on Ampere+ GPUs
  • Flash Attention: Memory-efficient attention
  • Custom GEMM: Optimized matrix multiplication

๐Ÿ”ง What's Included

Crates

  • ghostflow-core: Core tensor operations and SIMD
  • ghostflow-autograd: Automatic differentiation
  • ghostflow-nn: Neural network layers
  • ghostflow-optim: Optimizers and schedulers
  • ghostflow-ml: 50+ ML algorithms
  • ghostflow-data: Data loading and preprocessing
  • ghostflow-cuda: GPU acceleration (optional)

Algorithms

  • Supervised: Linear/Logistic Regression, Decision Trees, Random Forests, Gradient Boosting, SVM, KNN
  • Unsupervised: K-Means, DBSCAN, PCA, t-SNE, UMAP
  • Deep Learning: CNN, RNN, LSTM, GRU, Transformer, Attention
  • Ensemble: Bagging, Boosting, Stacking, Voting

๐Ÿ› ๏ธ Development

# Build
cargo build --release

# Test
cargo test --workspace

# Documentation
cargo doc --workspace --no-deps --open

# With CUDA
cargo build --release --features cuda

๐Ÿ“Š Benchmarks

See DOCS/PERFORMANCE_SUMMARY.md for detailed benchmarks.

๐Ÿค Contributing

See CONTRIBUTING.md for guidelines.

๐Ÿ“„ License

Dual-licensed under MIT or Apache-2.0.

๐Ÿ™ Acknowledgments

Built with passion for high-performance ML in Rust.


Note: This is the initial release. GPU features require CUDA toolkit installation. CPU fallback is available for all operations.