# **Compute Platform for AI**

An Open-Source Summary

Thomas Debelle



September 25, 2025

#### **Contents**

| 1                                                              | Lecture 1: Towards heterogeneous many-core processors |                            |   |  |  |  |
|----------------------------------------------------------------|-------------------------------------------------------|----------------------------|---|--|--|--|
|                                                                | 1.1                                                   | Scaling                    | 1 |  |  |  |
|                                                                |                                                       | 1.1.1 Denard broke down    | 1 |  |  |  |
|                                                                |                                                       | 1.1.2 Dark and Dim Silicon | 2 |  |  |  |
|                                                                | 1.2 Area for energy in single-core                    |                            |   |  |  |  |
|                                                                |                                                       |                            |   |  |  |  |
| 1.4 Area for energy through domain specific accelerators (DSA) |                                                       |                            |   |  |  |  |
| 2                                                              | License                                               |                            |   |  |  |  |
|                                                                | 2.1                                                   | Modification               | 3 |  |  |  |

# 1 Lecture 1: Towards heterogeneous many-core processors

#### Paper to read further

Paper1: A New Golden Age for Computer Architecture J. Hennessy AND D. Patterson

Paper2: Apple M1: Ditching x86 A. Frumusanu

## 1.1 Scaling

In IC and chip design, there is two fondamental laws:

- **Denard's law:** as transistors get smaller, the power density remains constant, which leads to lower and lower supply voltage to avoid to break down the oxide due to a strong  $\vec{E}$ .
- Moore's law: every generation can fit twice as many transistors on a certain area.

#### 1.1.1 Denard broke down

Denard's law is based on the fact that the width, length, oxide thickness and voltage is reduced by a factor  $\alpha$  each time. This factor also influences:

- Density:  $\alpha^2$
- Capacitance:  $1/\alpha$
- Delay:  $1/\alpha$

Thus, Power =  $CV_{DD}^2 f \propto 1/\alpha \cdot cst \cdot 1/\alpha = 1/\alpha^2$ . Finally the power density is a constant.

On paper, this is valid but we can't infinitely scale down V or we will have lower speed the closer we come to  $V_{th}$  which is not feasible. Moreover, the wire cannot scale down as desired or we will have a bad resistance in the wire. This will make the wires a bit more "capacitive" and so the  $\alpha$  factors will no longer cross out as Denard predicted. The power is constant and the power density is  $\alpha^2$  which is quite problematic.

Thomas Debelle 1

Compute Platform for AI

#### 1.1.2 Dark and Dim Silicon

The end of the Denard's scaling lead to a plateau in the power consumption of chips. We have to buy this energy efficiency. We know that the power density scales with  $\alpha^2$  at maximum clock frequency. So we will introduce:

- **Dim silicon:** silicon running at below max  $f_{\phi}/\alpha^2$
- **Dark silicon:**  $1/\alpha^2$  blocks that totally shut down when not used

In practice, if we scale with a factor S=2 we can put  $2^2$  more silicon and the speed should increase by a factor 2. In dark silicon, we will still speedup the clock frequency but all the newly added cores will be shutdown. This is quite unsustainable as more cores (due to Adhalm's law) won't leverage from parallelism. The better idea is to not clock faster and use this extra factor 2 to produce dim silicon and then the rest with dark silicon.



Figure 1: The two aforementioned techniques

Recently, we are using more and more dark silicon as accelerators that get turned on for specific task – accelerators.

#### 1.2 Area for energy in single-core

To make a processor faster, we can use either:

- Instruction-level parallelism: VLIW, OOO super-scalar
- Data-level parallelism: SIMD, GPU

Won't re-explain what those are, check the computer architecture lecture: link

Those are prime examples of dim silicon with lower clock speed for same throughput thanks to parallelism.

#### 1.3 Area for energy through multi-core

### 1.4 Area for energy through domain specific accelerators (DSA)

Thomas Debelle 2

Compute Platform for AI

#### 2 License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0).

You are free to:

- **Share** copy and redistribute the material in any medium or format.
- Adapt remix, transform, and build upon the material.

Under the following terms:

- **Attribution** You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- **NonCommercial** You may not use the material for commercial purposes.
- **ShareAlike** If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.
- **No additional restrictions** You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.

For the full legal text of the license, please visit: https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode

#### 2.1 Modification

| To contribute to this worl | k or any other one fro | om this project ple | ease find more info | rmation at the Gi | thub repository. |
|----------------------------|------------------------|---------------------|---------------------|-------------------|------------------|
|                            |                        |                     |                     |                   |                  |
|                            |                        |                     |                     |                   |                  |

© 2025 Authors of the Summary, Professors of the Course and possible book's authors. Some Rights Reserved.

Thomas Debelle 3