Superscalar Processor

A simulator for a superscalar out-of-order processor in python.

How to Run

To simply run an benchmark program, try: python processor.py ../benchmark_kernels/fib.asm

For more information try: python processor.py --help

with a Register Alias Table (RAT) and Reorder Buffer (ROB)

Features

Instruction Set Architecture

Supports Arithmetic, Load and Store, Unconditional Jumps (e.g. jump and link, system call), Conditional Jumps (e.g. branch if equal)
Support multi-cycle instructions
Only supports integer operation (no float point i.e. FPU)

Pipeline

7 stage pipeline - fetch, decode, issue, dispatch, execute, writeback, commit
Execution Units - ALU (Arithmetic Logic Unit), MU (Multiplication Unit), DU (Division Unit), LSU (Load Store Unit)
Decouples fetch-decode and writeback-commit with instruction queue and reorder buffer.
Execution units of multi-cycle instructions are fully pipelined (bar the DU, which cannot be pipelined)
Priority writeback - prioritises the retirement of slow multi-cycle instructions when more options than the pipeline width are available

Branch Prediction

Speculative execution with infinite levels of speculative depth
Implements a Two-level local dynamic/adaptive predictor for conditional branch prediction
- Implements a N-bit local pattern history (N = 2 default)
- Implements a 2^N entry branch history register table with S-bit saturating counters (S = 2 default)
Implements a Branch Target Address Cache and Instruction Cache (BTAIC) to cache branch speculations
Implements a Return Address Stack (RAS) to return from multiple nested function calls
- Uses a checkpointing mechanism to recover from failed branch speculation
Recovers from mispredicted branches by flushing at commit

Optimisations

N-way superscalar where N is configurable
Implements Tomasulos Algorithm - for out-of-order execution of instructions that writeback to registers
Implements a Load Store Queue (with store-to-load forwarding) - for out-of-order execution of instructions that writeback to memory
- Recovers from a wrongly speculatively loaded value of memory addresses by flushing at commit
Implements Register Renaming with a Register Alias Table (RAT) and Reorder Buffer (ROB)

Vectorization

configurable vector length - default 16 (AVX2 ISE) .
Vector ISA - Vector Arithemetic Operations, Vector Load and Store Operations, Vector Mask Operations, Vector Blend Operations

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
benchmark_kernels		benchmark_kernels
src		src
test_kernels		test_kernels
.gitignore		.gitignore
README.md		README.md
instruction-set.txt		instruction-set.txt
pipeline.png		pipeline.png
presentation.pdf		presentation.pdf
processor.png		processor.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmark_kernels

benchmark_kernels

src

src

test_kernels

test_kernels

.gitignore

.gitignore

README.md

README.md

instruction-set.txt

instruction-set.txt

pipeline.png

pipeline.png

presentation.pdf

presentation.pdf

processor.png

processor.png

Repository files navigation

Superscalar Processor

How to Run

Features

Instruction Set Architecture

Pipeline

Branch Prediction

Optimisations

Vectorization

About

Releases

Packages

Languages

Charana123/Superscalar-CPU-Simulator

Folders and files

Latest commit

History

Repository files navigation

Superscalar Processor

How to Run

Features

Instruction Set Architecture

Pipeline

Branch Prediction

Optimisations

Vectorization

About

Resources

Stars

Watchers

Forks

Languages