This is a simple PyTorch-like library for personal use, generated from MiniTorch.
The project is implemented for the following purposes:
- Familiar with the design and optimizations of machine learning libraries.
- Practice C++ skills.
- Learn parallelism and CUDA.
- Replace numpy NDArray with my own implementation.
- Replace numba CPU parallelism with my OpenMP implementation (also considering other approaches).
- Replace numba GPU parallelism with my CUDA implementation.