GPGPU processor architecture
C++ C SystemVerilog Assembly Python Makefile Other
Latest commit 80097b7 Aug 15, 2016 @jbush001 hardware: a few more minor testbench/build fixes
- Turn on *all* warnings by default and only disable ones that are problematic:
  * Wunused: This triggers for global structure definitions in
    Also complains if bits of a signal are unused, which is hard to avoid with
    generic interfaces.
  * Wdeclfilename: Fails for global definitions in
  * Wpinconnectempty: Library modules like SRAM often have unused outputs. I use empty
    pins purposely to call this out (and avoid a missing pin warning). Don't see
    any way to avoid this warning, doesn't seem super useful.
  * Wundriven: interfaces in port list seem to erroneously trip this assertion. May also
    be verilator issue?
- Make cosimulation timer variables have same types and group them together

Nyuzi Processor

Build Status Chat at

Nyuzi is an experimental multicore GPGPU processor. It supports vector floating point, hardware multithreading, and cache coherence. The SystemVerilog-based hardware implementation is synthesizable and runs on FPGA. This project also includes an LLVM-based C++ toolchain, tests, and other tools. It is useful for microarchitecture experimentation, performance modeling, and parallel software development.

Mailing list:!forum/nyuzi-processor-dev
License: Apache 2.0

Getting Started

The following instructions explain how to set up the Nyuzi development environment. This includes an emulator and cycle-accurate hardware simulator, which allow hardware and software development without an FPGA.

Install Prerequisites

Linux (Ubuntu)

This requires Ubuntu 16 (Xenial Xeres) or later to get the proper package versions. It should work for other distributions, but you will probably need to change some package names.

sudo apt-get -y install autoconf cmake make gcc g++ bison flex python \
    perl emacs openjdk-8-jdk swig zlib1g-dev python-dev libxml2-dev \
    libedit-dev libncurses5-dev libsdl2-dev gtkwave imagemagick

Emacs is used for verilog-mode AUTO macros. The makefile executes this operation in batch mode


These instruction assume OSX Mavericks or later.

Open the AppStore application, search for XCode and install it.

Install MacPorts (

Install the command line compiler tools:

xcode-select --install

Installs the remaining packages using MacPorts.

sudo port install cmake bison swig swig-python imagemagick libsdl2 curl emacs

You may optionally install GTKWave for analyzing waveform files.


I have not tested this on Windows. Many of the libraries are cross platform, so it should be possible to port it. But the easiest route is probably to run Linux under a virtual machine like VirtualBox.

Build (Linux & MacOS)

The following script will download and install the Nyuzi toolchain and Verilator Verilog simulator. (Although some Linux package managers have Verilator, they have old versions). It will ask for your root password a few times to install stuff.


Build everything else:


Run tests:

make test

If you are on a Linux distribution that defaults to python3, you may run into build problems with the compiler. In tools/NyuziToolchain/tools/CMakeLists.txt, comment out the following line:


Occasionally a change will require a new version of the compiler. To rebuild:

git submodule update
cd tools/NyuziToolchain/build
sudo chown -R `whoami` .
sudo make install

What next?

Sample applications are available in software/apps. You can run these in the emulator by typing 'make run' (some need 3rd party data files, details are in the READMEs in those directories).

For example, this will render a 3D model:

cd software/apps/sceneview
make run

Running on FPGA

See instructions in hardware/fpga/de2-115/