This is the project for 2022 Sprinf ECE-GY 9143 High Performance Machine Learnig, maintained by Jingxuan Wang and Yuchen Kou. We compare the performance of ResNet based on CIFAR-10 dataset under different precion situations.
- Content in this repository
- Environment
- Usage
- Code Structure
- Results and Observation
- Challenges we met
(how to execute the code)
Precision | Training Time | Model Size | Bandwidth |
---|---|---|---|
fp16 | 单元格 | ---- | ---- |
TF32 | 单元格 | ---- | ---- |
mp | 单元格 | ---- | ---- |
fp32 | 单元格 | ---- | ---- |
- When we started this project, the lecture did not involve relevant knowledge, and the principle was not clear. We spent a lot of time looking at the NVIDIA manual, related papers and blogs to learn about the possible performance and differences between different precisions.
- We planned to use multiple machines in parallel to calculate under different precisions situations, and we were not sure if we could accomplish this task.
- We are not sure if the experimental results will be consistent with our prediction。