Branch Prediction

In this lab, you will need to modify the existing branch predictor. See the Your Own Repository Guide for our recommended way to organize and collaborate on labs.

Pre-Lab Questions

What is the purpose of a branch predictor? Why does a single-cycle core not need branch prediction?
Define and compare and contrast the following:
- Static branch prediction vs. Dynamic branch prediction
- One-level branch prediction vs. Two-level predictor
- Local branch prediction vs. Global branch prediction
Define BHT, BTB and RAS. What are they used for?
Look at frontend.sv. What are the 4 types of instructions that the branch predictor handles, and how are they handled?
How is a branch resolution handled?
What kind of dynamic branch predictor does CVA6 use?
Provide a GitHub permalink to where in ariane_pkg the branch predictor structs are defined.
When can more than 1 instruction be fetched per cycle?

Part 1 - CVA6 Predictor

Add functionality to frontend.sv that records the branch predictor hit-rate.

To do this, you can create an always_ff @(posedge clk) block that counts how many times a branch has been resolved, and how many of those resolutions were mispredicts. (Branch resolve net; branch resolve type.) The hit-rate can be calculated with 1-num_mispredicts/num_valid_resolutions. You can then record the hit-rate to a file on every clock cycle; this way, you are certain to get the final hit-rate when the simulation terminates.

Part 1 Questions

Highlight your changes to "frontend.sv" that records the hit-rate.
What are the final hit-rate percentages of each of the bp benchmarks?
Compare the performance of the bp benchmarks after choosing 3 new values for NR_ENTRIES. Display the 4 hit-rates in a table and explain how and why each program changes its hit-rate as BHT size changes. Note: When changing NR_ENTRIES, be sure to change NR_ENTRIES in the bht instantiation in "frontend.sv", not the bht declaration in "bht.sv". Also your NR_ENTRIES values should be on the order of 16 to have interesting results.

Example of How to Write to a File in Verilog/SystemVerilog

integer f;
integer counter;
initial begin
    f = $fopen("bp.txt","w");
end
always @(posedge clk) begin
    $fwrite(f, "%f\n", $itor(counter));
end

Part 2 - Global Predictor

In this part, you will modify the bht.sv and turn it into a global two-level adaptive branch predictor. First, read through the Global Branch Predictor Specifications section.

A few notes on your implementation:

Choose your own values for n and m
You can use whatever algorithm you want to calculate the BHT index
- You can use Gshare or Gselect
- You may also choose to use XORShift, a simple random-number-generation algorithm, to create a hash that will index into your BHT
Be sure that your BHT size is decided by the parameter NR_ENTRIES
Be sure to remove all unused lines of code leftover from the initial implementation
Be sure to comment your design clearly

Part 2 Questions

Share your modified "bht.sv" that implements the global two-level adaptive branch predictor.
What specifications did you decide on for your predictor? What is your BHT index generation algorithm? How wide is your GHR? Which address bits do you use for your address?
Briefly explain your reasoning behind the BHT index generation algorithm you chose.
Compare the performance of the bp benchmarks after choosing 3 new values for NR_ENTRIES. Display the 4 hit-rates in a table and explain how and why each program changes its hit-rate as BHT size changes.

Global Branch Predictor Specifications

This section describes the specifications required to build a global two-level adaptive branch predictor.

For global branch predictors, a global history record (GHR) must be kept. A GHR keeps a record of the past n branches using a FIFO method. To maintain the GHR, when a branch has been resolved, the branch result must be shifted into the GHR while simultaneously dropping the nth result.

Similar to a one-level branch predictor, a two-level branch predictor contains a branch history table (BHT) where its entries are (often) two-bit saturation counters. However, one-level and two-level predictors differ in how the BHT index is calculated.

For a global two-level adaptive branch predictor, the BHT index is calculated using the current GHR value and using m bits from the resolved branch's program counter. The GHR and PC can either be concatenated together (called Gselect) or xor'ed together (called Gshare). Other BHT index calculation algorithms exist but have little effect on the predictor performance.

2-Bit Saturation Counter

Global Predictor

Extra Credit

Branches come in many patterns in real code, and it is not uncommon to find that different styles of branch prediction have value in different situations. Extend your predictor to contain a "Tournament" of CVA6's predictor and your global predictor, with a sensible mechanism for updating trust based on successful or failed predictions.

Code Submission

Submit to the Gradescope Autograder your modified "bht.sv" that implements a global two-level adaptive branch predictor. The autograder will verify the hit-rate for several different programs, but your implementation will be verified manually.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

branch-prediction.md

branch-prediction.md

Branch Prediction

Pre-Lab Questions

Part 1 - CVA6 Predictor

Part 1 Questions

Example of How to Write to a File in Verilog/SystemVerilog

Part 2 - Global Predictor

Part 2 Questions

Global Branch Predictor Specifications

2-Bit Saturation Counter

Global Predictor

Extra Credit

Code Submission

Files

branch-prediction.md

Latest commit

History

branch-prediction.md

File metadata and controls

Branch Prediction

Pre-Lab Questions

Part 1 - CVA6 Predictor

Part 1 Questions

Example of How to Write to a File in Verilog/SystemVerilog

Part 2 - Global Predictor

Part 2 Questions

Global Branch Predictor Specifications

2-Bit Saturation Counter

Global Predictor

Extra Credit

Code Submission