# MS108 Project Report

ZYHowell

Pitfall, Stories and Great Ideas(from others)

January 2, 2020

#### Overview

- Tomasulo:
- iCache(DM and 2-way), dCache(write back);
- Branch Mask.
- GShare. (no more space to improve on FPGA)

## Final Design



#### Branch Mask

- The reason for large ALUrs and regfile is and only is logic for branch.
- Faster



#### Sometimes bigger and dumber is better?

Pitfall, according to CAAQA ed.6, chapter three. Pentium 4.

- Simply adding ROB line or RS have almost no improvement;
- It costs a lot to enlarge the shadow page but is obviously useless.
  However,
- Enlarging the Cache seems useless, due to the limited testbench. (admitting that meeting the testbench is the most important for a design)
  - Itanium 2(well, it is also a failure) and i7 vs. Pentium 4.

## And sometimes smarter is better than bigger and dumber?

Pitfall, according to CAAQA ed.6, chapter three. Really?

- The story of calculating head and tail in shadow register;
- The story of write-back dCache. However:
- The result of branch masks;
- The result of special judge when entering RS.

CAAQA mentions to consider the situation.

### My path

- Learn and design; (6th Sep 1st assignment)
- One without ROB first; (after python interpreter 23rd Nov, debug until 5th Dec)
- Add ROB, branch policy is stall; (23rd Nov 9th Dec)
- Add branch mask, then make ROB a pseudo one to simplify. Shadow regfile also;(till 15th Dec)
- Add Dcache and Gshare. (1st Jan 2nd Jan)

#### More stories

- Shadow register again: only shadow tag, no shadow data:  $99\% \rightarrow 50\%$  LUT; remark: considering the most complex logic, instead of the largest reg. (FF is no more than 20%)
- Misprediction judgement; (more LUT but not that more, 1ns less delay)
- Not assign a=b, simply use b; (not elegant, but 0.7ns less delay)
- Only control Enable signal strictly and others loosely.

# How to judge a design?

- testbench?
  - -actually not enough this time(still wrong after passing them);
- CPI?
  - -bad design can have good CPI but low frequency;
- Frequency and CPI?
  - -based on the tech of chip creation.
  - -1950s and early 1960s did not need Tomasulo.

So what is this project for? A 1960 one or a 2020 one? Is Tomasulo a 2020 one? Remind Pentium 4.

# **Thanks**