Skip to content

Conversation

coketaste
Copy link
Contributor

@coketaste coketaste commented Sep 22, 2025

Pull Request

📋 Description

Brief description of changes and motivation.

🎯 Type of Change

  • 🐛 Bug fix (non-breaking change fixing an issue)
  • ✨ New feature (non-breaking change adding functionality)
  • 💥 Breaking change (fix or feature causing existing functionality to not work)
  • 📚 Documentation update
  • 🚀 Performance improvement
  • 🔨 Code refactoring
  • 🧪 Test improvements
  • 🐳 Docker/Infrastructure changes

📚 Module Impact

Which modules are affected by this change?

  • Module 1: Foundations of GPU Computing
  • Module 2: Multi-Dimensional Data Processing
  • Module 3: GPU Architecture and Execution Models
  • Module 4: Advanced GPU Programming Techniques
  • Module 5: Performance Engineering and Optimization
  • Module 6+: Future modules
  • Build system/Infrastructure
  • Documentation only

🧪 Testing

Compilation Testing:

  • ✅ Compiled successfully on CUDA
  • ✅ Compiled successfully on HIP
  • ✅ Cross-platform compatibility verified
  • 🚫 Not applicable (documentation/infrastructure only)

Runtime Testing:

  • ✅ Tested example execution
  • ✅ Verified correct output
  • ✅ Performance benchmarking completed
  • 🚫 Not applicable

Environment Testing:

  • ✅ Native installation
  • ✅ Docker containers (CUDA)
  • ✅ Docker containers (ROCm)
  • ✅ Multiple GPU configurations

📖 Documentation

  • ✅ Updated relevant README files
  • ✅ Updated module content.md files
  • ✅ Added/updated code comments
  • ✅ Updated build system documentation
  • 🚫 No documentation changes needed

🔍 Code Quality

  • ✅ My code follows the project style guidelines
  • ✅ I have performed a self-review of my code
  • ✅ I have commented complex algorithms and optimizations
  • ✅ I have added proper error handling
  • ✅ My changes generate no new compiler warnings

⚡ Performance

  • ✅ No performance regression verified
  • ✅ Performance improvements documented
  • ✅ Benchmarking results included
  • 🚫 Not applicable

🔄 Backward Compatibility

  • ✅ Changes are backward compatible
  • ⚠️ Changes include breaking changes (documented below)
  • 🚫 Not applicable

💥 Breaking Changes

🔗 Related Issues

Fixes #
Relates to #

📸 Screenshots/Output

🧾 Additional Notes

✅ Final Checklist

  • I have tested this change thoroughly
  • I have updated all relevant documentation
  • I have considered cross-platform compatibility
  • I have considered the educational impact of this change
  • I understand this change may be tested on multiple GPU platforms

…erfile to avoid PEP 668.

Fix the nvcc compile error: stop forcing hardcoded -arch flags (sm_70, sm_75, etc.) and detect the actual GPU arch at build time.
Clean up source code broken by CUDA 13 deprecations: replace uses of memoryClockRate and memoryBusWidth from cudaDeviceProp with cudaDeviceGetAttribute.
Remove Python pip from Dockerfile: completed (your build log confirms success)
Repo-wide sweep for hardcoded sm_XX: completed
@coketaste coketaste merged commit 4ce87e0 into main Sep 22, 2025
6 checks passed
@coketaste coketaste deleted the coketaste/docker-update branch September 22, 2025 18:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant