



## Thermal management for S-NUCA Many-Cores via Synchronous Thread Rotations

Yixian Shen, Sobhan Niknam, Anuj Pathania, Andy D Pimentel

## S-NUCA many-cores suffer from thermal issues

■ LLC cache → Logically shared but physically distributed

Inherent Heterogeneity(Cache latency is non-uniform)



DVFS-based DTM introduces the significant performance degradation



## Thread rotations penalty < DVFS-based penalty

DVFS based DTM for blacksholes (Master+Slave threads)





□ Thread rotations for blacksholes (Master+Slave threads)





## **Hot-potato scheduling**

☐ A theoretical one-shot peak temperature calculation



☐ A thermal and architecture-aware thread rotations scheduler



- ☐ Access cache latencyRing1 < Ring2 < Ring3</li>
- ☐ Thermal dissipation condition

  Ring1 < Ring2 < Ring3