# TSEK06 Transistor-Level Design Report

Group 5

Editor: Johannes Klasson

Version P1B

### Status

| Reviewed | Johannes Klasson    | 2016-03-3 |
|----------|---------------------|-----------|
| Approved | Martin Nielsen-Lönn | -         |

# PROJECT IDENTITY

 $\begin{tabular}{ll} VT, 2016, Group 5 \\ Linköpings Tekniska Högskola, ISY \end{tabular}$ 

### ${\bf Group\ members}$

|                  | 1                |             |                         |
|------------------|------------------|-------------|-------------------------|
| Name             | Responsibility   | Phone       | E-mail                  |
| Johan Isaksson   | Project Leader   | 070-2688785 | johis024@student.liu.se |
| Johannes Klasson | Document Manager | 073-8209003 | johkl226@student.liu.se |
| Jonas Tarassu    | VLSI Designer    | 070-5738583 | jonta760@student.liu.se |
| Alexander Yngve  | VLSI Designer    | 076-2749762 | aleyn573@student.liu.se |

 $\mathbf{Customer} \colon \mathrm{ISY}$ 

Contact at customer: Martin Nielsen-Lönn Course resposible: Atila Alvandpour Consultant: Martin Nielsen-Lönn CONTENTS March 9, 2016

# Contents

| 1            | Introduction                                                                                                                                       | 1                     |
|--------------|----------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------|
| 2            | Block Level Description 2.1 SPI-in/PRBS                                                                                                            |                       |
| 3            | SPI out         3.0.1 Shift register          3.0.2 Control logic          3.1 Protocol                                                            | 6<br>6<br>7<br>7      |
| 4            | Simulation Results           4.1 SPI In            4.2 Kogge-Stone Adder            4.3 Comparator            4.4 SPI Out            4.5 Top Level | 7<br>8<br>8<br>8<br>8 |
| 5            | Risks and Delays                                                                                                                                   | 8                     |
| A            | Block diagram of the Kogge-Stone Adder                                                                                                             | 10                    |
| В            | Truth Tables for the Kogge-Stone Adder                                                                                                             | 11                    |
| $\mathbf{C}$ | Time Plan                                                                                                                                          | 13                    |

CONTENTS March 9, 2016

# Document history

| Version | Date       | Changes     | Performed by   |
|---------|------------|-------------|----------------|
| P1A     | 2016-02-19 | First draft | Johan Isaksson |

### 1 Introduction

This document describes the state of the 16-bit Kogge-Stone adder project in the course TSEK06 after finishing the high level design phase. The meaning of high level is that the every basic logic gate is implemented in VerilogA. The main reson for doing this is to be able to simulate all logic to make sure that everything works as intended. Block level diagrams can be found in section 2, simulation results in section 4 and a small risk analysis in section 5. In appendix C and ?? a time plan of the next phase and a time report of this phase can be found.

# 2 Block Level Description

This section contains block level descriptions of all parts of the system seen in figure 1.



**Figure 1** – Block diagram of the system.

# 2.1 SPI-in/PRBS

The SPI-in module consists of a lot of registers and some control logic moving data between these registers. Since the PRBS module share some registers with the SPI-in module, and the PRBS module is relatively small, we included it in the SPI-in module.

The first block in the SPI-in module is the SPI-receive block (see figure 2). It consists of 16 resettable D flip-flops (DFFSR), that is connected one after another. They are clocked on the SPI clock, and on each positive clock pulse, we get a new bit to shift in. After 16 pulses we have 16 bits stored, and a load signal is triggered so that each bit is moved to the correct PRBS-register. The PRBS-registers (see figure 3) are register that consists of four DFFSR (one for each addition) and that can be run in two different modes. During the first mode, the normal mode, the registers are triggered with a load signal, and the data to the first DFFSR comes from the SPI-receive block. During the second mode, the PRBS mode, the registers and triggered with the system clock, and the data to the first DFFSR is the value of the third and forth DFFSR passed through a XOR. The mode is chosen by the SPI-enable signal. Of course, the correct sum does not need the PRBS mode, so it is only a normal register that only can run in normal mode.

The last block is the control block (see figure 4), which is the most complex block of the SPI-in module. The heart of the control block is a 6-bit counter. The first four bits signalizes if we have read a 16 bit word or not. The last two bits signalizes which of our four words we are currently reading. By combining these signals like in figure we are able to produce the load signals that trigger the PRBS-registers.



 ${\bf Figure} \ {\bf 2} - {\bf Block} \ {\bf diagram} \ {\bf of} \ {\bf the} \ {\bf SPI-receive} \ {\bf block}.$ 



Figure 3 – Block diagram of the SPI-PRBS block.



 ${\bf Figure}~{\bf 4}-{\rm Block~diagram~of~the~SPI-controller~block}.$ 

# 2.2 16-bit Kogge-Stone Adder

The Kogge-Stone adder consists of four simple blocks connected in a complex way, as can be seen in A. These four blocks can be seen in figure 5-8. The red block constitute the initial stage which takes two binary numbers A and B as input. The corresponding truth table is found in table 2

in appendix B. The output signals P and G generated from this block are later used by other blocks in the adder. The G, also called the Generate signal, trickles down through the hierarchy of yellow, and yellow carry blocks to finally end up in the sum block. The truth table for this block can be found in table 5. Truth tables for the yellow and yellow carry blocks are found in table 3 and 4.



Figure 5 – Schematic view of the red block.



 ${\bf Figure}~{\bf 6}-{\bf Schematic~view~of~the~yellow~block}.$ 



 ${\bf Figure}~{\bf 7}-{\bf Schematic~view~of~the~yellow~carry~block}.$ 

2.3 Comparator March 9, 2016



Figure 8 – Schematic view of the sum block.

# 2.3 Comparator

The comparator consists of 17 2-input XNOR gates where one bit of each number is fed into each gate. The output from the XNOR gates are fed into a couple of AND gates which generates the final output. The comparator is 17 bits wide since it compares two 16 bit numbers plus their carry bits. The logic table of the XNOR gates is shown in table 1.

Table 1 – Logic table of XNOR block.

| $A_i$ | $B_i$ | $Y = \overline{(A_i \oplus B_i)}$ |
|-------|-------|-----------------------------------|
| 0     | 0     | 1                                 |
| 0     | 1     | 0                                 |
| 1     | 0     | 0                                 |
| 1     | 1     | 1                                 |

2.4 SPI out March 9, 2016

### 2.4 SPI out

This chapter will discuss the changes to the SPI output module.

#### 2.4.1 Shift register

The output consist of a 68 bit shift register where each cell in the register contains one D flip-flop and one multiplexer. As we have transistor schematics for both the flip flop and the multiplexer nothing had to be changed to the individual cells, just change the configuration files to use schematics instead of the verilog code.



 ${\bf Figure} \,\, {\bf 9} - {\bf Shift} \,\, {\bf register} \,\, {\bf cell}$ 

6

#### 2.4.2 Control logic

As all verilog code were replaced by transistor schematics, the problem with this design got exposed. Each of the four enable signal has a large fan out as they are connected to 17 cells in the shift register.

The solution was to size the the multiplexer generating the control signals. The last internal block that includes this multiplexer can be seen in Fig. 10.



Figure 10 – Functional block containing an AND and a MUX.

By testing we found that a MUX that is approximately six times larger should be sufficient. In Fig. 11 the schematic of the sized multiplexer, implemented using NAND and an inverter, is shown.



Figure 11 – Sized multiplexer.

#### 2.4.3 Protocol

Due to a misunderstanding in the group, regarding if the SPI clock should be kept high or low during the high time of the SPI enable signal, the data on the spi output is now available for read already on the first falling edge of the SPI clock. This is possible as the group decided to keep the clock low during high time of SPI enable so that data can be written to the output at first rising edge.

### 3 Simulation Results

This section describes the high level simulation results. All files referenced to in this section can be found in the attached zip-file.

3.1 SPI In March 9, 2016

#### 3.1 SPI In

The first thing to test in the system is where it all begins, at the input. The basics of it can be seen in test\_spi\_receive. As can be seen, as soon as the SPI\_enable signal goes low and the SPI\_clk starts we start to receive one bit on every positive edge. The data is then shifted trough all of the 16 registers. As the 16th bit is shifted in, the first load signal is triggered. Every 16 bits after that another load signal is triggered, and this can be seen in test\_spi\_load. One can also see that once the last load signal is triggered, SPI\_enable goes high again.

The last thing in the SPI in goodule to test is how the data trough out of the BBPS positions.

The last thing in the SPLin module to test is how the data travels out of the PRBS registers. This can be seen in test\_spi\_prbs. One important thing to note is that as soon as SPI\_enable goes high, the registers are triggered on the system clock. As one can see in the figure, the first bit is ready for a long time, and as soon as the last bit is ready, we start to add at full speed. And after the four bits are done the system continues to add the pseudo random numbers.

# 3.2 Kogge-Stone Adder

The simulation of the adder can be seen in test\_koggeadder and the input sequence is the same that were fed into the SPI. The most relevant part of the simulation is precisely after the topmost signal goes high. When this happens the data is already fed into the register in front of the adder and begins to shift into the adder on positive clock edge. The out, or to makes things more clear, the BISTout signal clearly shows that the two first addition yields the correct result but the thirds makes the same signal go low. This is a construction of the input from our side to test if the comparator can detect errors. After this the BISTout are low for a while before it goes high again which means that the fourth addition was successful.

### 3.3 Comparator

The simulation of the comparator is seen in test\_corr. The same reasoning as in the section above applies here. The simulation is quite striped down due to readability but one can clearly see that out, which is BISTout, is high if and only if the corr-signals and sum signals match.

#### 3.4 SPI Out

The critical parts of this module are the events after a transition on the spi enable signal. In the image test\_spi\_out a simulation of the module is shown. When the spi enable signal goes high, the control logic successfully creates the four enable pulses for the different sections. The four clock cycles long pulse (4p) prevents the occurrence of more than one pulse on any of the enable signals. The counter enable signal (2b\_en) goes low to prevent the counter from restarting before the 4p-signal ends.

When spi enable goes low, the four enable signals are the same as the spi clock.

#### 3.5 Top Level

One can get a overall picture of the behaviour of the system by looking at at the simulations in the order spi\_receive, spi\_prbs, koggeadder, corr and last the spi\_out. This is possible because all this simulations are part of a bigger one.

# 4 Risks and Delays

During the high level design phase there hasn't been any condiderable risks or delays. The only thing that is an issue is that all members in the group has quite different schedules which sometimes can hinder cooperation. However the group has solved this by using good tools for project tracking (Trello) and communication (Slack). This has helped considerably with the delegation of tasks and keeping track of what needs to be done.

During the next phases of the project, cooperation will be more important since the tasks will be harder. We will need to plan ahead and schedule occasions where we all can meet and work together.

# A Block diagram of the Kogge-Stone Adder



# B Truth Tables for the Kogge-Stone Adder

 ${\bf Table} \ {\bf 2} - {\rm Logic} \ {\rm table} \ {\rm of} \ {\rm red} \ {\rm block}.$ 

| $A_i$ | $B_i$ | $P = A_i \oplus B_i$ | $G = A_i \wedge B_i$ |
|-------|-------|----------------------|----------------------|
| 0     | 0     | 0                    | 0                    |
| 0     | 1     | 1                    | 0                    |
| 1     | 0     | 1                    | 0                    |
| 1     | 1     | 0                    | 1                    |

Table 3 – Logic table of yellow block.

| $G_i$ | $G_{i,prev}$ | $P_i$ | $P_{i,prev}$ | $P = P_i \wedge P_{i,prev}$ | $G = (P_i \land G_{i,prev}) \lor G_i$ |
|-------|--------------|-------|--------------|-----------------------------|---------------------------------------|
| 0     | 0            | 0     | 0            | 0                           | 0                                     |
| 0     | 0            | 0     | 1            | 0                           | 0                                     |
| 0     | 0            | 1     | 0            | 0                           | 0                                     |
| 0     | 0            | 1     | 1            | 1                           | 0                                     |
| 0     | 1            | 0     | 0            | 0                           | 0                                     |
| 0     | 1            | 0     | 1            | 0                           | 0                                     |
| 0     | 1            | 1     | 0            | 0                           | 1                                     |
| 0     | 1            | 1     | 1            | 1                           | 1                                     |
| 1     | 0            | 0     | 0            | 0                           | 1                                     |
| 1     | 0            | 0     | 1            | 0                           | 1                                     |
| 1     | 0            | 1     | 0            | 0                           | 1                                     |
| 1     | 0            | 1     | 1            | 1                           | 1                                     |
| 1     | 1            | 0     | 0            | 0                           | 1                                     |
| 1     | 1            | 0     | 1            | 0                           | 1                                     |
| 1     | 1            | 1     | 0            | 0                           | 1                                     |
| 1     | 1            | 1     | 1            | 1                           | 1                                     |

 ${\bf Table} \ {\bf 4} - {\bf Logic} \ {\bf table} \ {\bf of} \ {\bf yellow} \ {\bf with} \ {\bf carry} \ {\bf block}.$ 

| $P_i$ | $G_i$ | $G_{i,prev}$ | $   G = (P_i \wedge G_{i,prev}) \vee G_i $ |
|-------|-------|--------------|--------------------------------------------|
| 0     | 0     | 0            | 0                                          |
| 0     | 0     | 1            | 0                                          |
| 0     | 1     | 0            | 1                                          |
| 0     | 1     | 1            | 1                                          |
| 1     | 0     | 0            | 0                                          |
| 1     | 0     | 1            | 1                                          |
| 1     | 1     | 0            | 1                                          |
| _1    | 1     | 1            | 1                                          |

 ${\bf Table}~{\bf 5}-{\bf Logic~table~of~sum~block}.$ 

| $P_i$ | $C_{i-1}$ | $S_i = P_i \oplus C_{i-1}$ |
|-------|-----------|----------------------------|
| 0     | 0         | 0                          |
| 0     | 1         | 1                          |
| 1     | 0         | 1                          |
| 1     | 1         | 0                          |

# C Time Plan

|    |                                                           |       | Planning       |    |      |    |    |          |         |         |      |          |     |     |     |      |          |      |         |   |
|----|-----------------------------------------------------------|-------|----------------|----|------|----|----|----------|---------|---------|------|----------|-----|-----|-----|------|----------|------|---------|---|
| Pı | roject: 16 bit Kogge-Stone ad                             |       |                |    |      |    |    |          |         |         |      |          |     |     |     |      |          |      |         |   |
|    | oject group: 5                                            |       | Date: 160204   |    |      |    |    |          |         |         | Б    | levi     | iew | /ed |     |      | П        | П    |         |   |
|    | ustomer: Martin Nielsen-Lön                               |       | Version: P1B   |    |      |    |    |          |         |         | ľ    |          |     |     |     |      |          |      |         |   |
|    |                                                           |       |                |    |      |    |    |          |         |         | +    | JC       | ona | nne | s ĸ | lass | son      | -    | -       | - |
| C  | ourse: TSEK06                                             |       | Author: JI     |    |      |    |    |          |         |         | L    |          |     |     |     |      |          |      | Щ       | Щ |
|    | ACTIVITIES                                                | TIME  | WHO            | L  |      | _  |    | TIN      | _       |         | _    | <u> </u> | _   | _   | _   | _    | <u> </u> |      |         |   |
| 10 | Description                                               | hours | Initials       | 4  | 5    | 6  | 7  | 8        | 9       | 10 1    | 1 12 | 13       | 14  | 15  | 16  | 17   | 18       | 19 2 | 20 2    | 1 |
| 1  | Define structure of the SPI unit                          | 10    | JI, JK         |    | 9,5  |    |    |          | $\perp$ |         |      | L        |     |     |     |      |          | Ш    | $\perp$ |   |
| 2  | Implement counters in Verilog-A                           | 10    | JI, JK         |    | 4    |    |    |          | $\perp$ |         |      | L        |     |     |     |      |          | Ш    | $\perp$ | ╛ |
| 3  | Implement control logic in Verilog-A                      | 10    | JI, JK         |    |      | 10 |    |          | $\perp$ |         |      | L        |     |     |     |      |          | Ш    | $\perp$ | _ |
| 4  | Implement 1:4 decoder in Verilog-A                        | 10    | JI, JK         |    |      | 5, |    |          |         |         |      |          |     |     |     |      |          | Ш    | $\perp$ | 4 |
| 5  | Integrate to high level design of SPI                     | 10    | JI, JK         |    |      | 17 |    |          |         |         |      |          |     |     |     |      |          |      | ᆚ       | _ |
| 6  | Simulation and test of high level design (SPI)            | 5     | JI, JK         |    |      |    | 13 |          |         |         |      |          |     |     |     |      |          |      | $\perp$ |   |
| 7  | Implement transistor level design of the SPI unit         | 30    | JI, JK         |    |      |    |    | 15 1     | 5       |         |      |          |     |     |     |      |          |      | $\perp$ |   |
| 8  | Simulation and test of transistor design (SPI)            | 20    | JI, JK         |    |      |    |    |          | 2       | 20      |      | L        |     |     |     |      |          |      | $\perp$ |   |
| 9  | ,                                                         | 30    |                | L  |      |    |    |          | $\perp$ | $\perp$ | Ļ    | 25       | 25  | _   | L   |      |          | Ц    | $\perp$ |   |
| 0  |                                                           | 20    |                | L  |      |    |    |          | $\perp$ | $\perp$ | 1    | L        | L   | 20  |     |      |          | Ц    | $\perp$ | Ц |
| 1  | Define structure of the generator                         | 10    | AY             |    | 0    | L  |    |          |         |         | L    |          |     |     |     |      |          | Ц    | $\perp$ | Ц |
| 12 |                                                           | 10    | AY             |    |      | 0  |    |          |         |         | L    |          |     |     |     |      |          | Ц    | $\perp$ | Ц |
| 13 | Integrate to high level design of the generator           | 10    | AY             |    |      | 0  |    |          |         |         | L    |          |     |     |     |      |          | Ц    | $\perp$ | Ц |
| 4  | Simulation and test of the high level design (generator)  | 5     | AY             |    |      |    | 0  |          | ╛       |         | ┸    |          | L   |     |     |      |          | Ц    | 1       | 4 |
| 5  | Implement transistor level design of the generator unit   | 15    | JI, JK         |    |      |    |    | 10       | 5       | $\perp$ |      | L        | L   |     |     |      |          | Ш    | $\perp$ | _ |
| 6  | Simulation and test of the transistor design (generator)  | 10    | JI, JK         |    |      |    |    |          | _       | 10      |      |          |     |     |     |      |          |      | $\perp$ |   |
| 7  | Implement layout level design of generator unit           | 15    |                |    |      |    |    |          |         |         |      | 10       | 5   |     |     |      |          |      | Ш       |   |
| 8  | Simulation and test of layout (generator)                 | 10    |                |    |      |    |    |          |         |         |      |          |     | 10  |     |      |          |      |         |   |
| 9  | Define structure of the adder                             | 10    | JT, AY         |    | 3    |    |    |          |         |         |      |          |     |     |     |      |          |      |         |   |
| 20 | Implement Generate calculation logic in Verilog-A         | 10    | JT, AY         |    | 1    |    |    |          |         |         |      |          |     |     |     |      |          |      |         |   |
| 21 | Implement Propagate calculation logic in Verilog-A        | 10    | JT, AY         |    | 1    |    |    |          |         |         |      |          |     |     |     |      |          |      |         |   |
| 22 | Implement Sum calculation logic in Verilog-A              | 10    | JT, AY         |    | 1    |    |    |          |         |         |      |          |     |     |     |      |          |      |         |   |
| 23 | Integrate to high level design of adder                   | 20    | JT, AY         |    |      |    | 17 |          |         |         |      |          |     |     |     |      |          |      |         |   |
| 24 | Simulation and test of high level design (adder)          | 20    | JT, AY         |    |      |    | 3, |          |         |         |      |          |     |     |     |      |          |      |         |   |
| 25 | Implement transistor level design of the adder unit       | 40    | JT, AY         |    |      |    |    | 20 2     | 0       |         |      |          |     |     |     |      |          |      | $\perp$ |   |
| 26 | Simulation and test of the transistor design (adder)      | 20    | JT, AY         |    |      |    |    | 1        | 0       | 10      |      |          |     |     |     |      |          |      | $\perp$ |   |
| 27 | Implement layout level design of adder unit               | 40    |                |    |      |    |    |          |         |         |      | 10       | 13  | 15  | 15  |      |          |      | $\perp$ |   |
| 28 | Simulation and test of layout (adder)                     | 20    |                |    |      |    |    |          |         |         |      |          |     |     |     | 20   |          |      | $\perp$ |   |
| 29 | Define structure of the comparator                        | 5     | JT             |    | 0    |    |    |          |         |         |      |          |     |     |     |      |          |      | Ш       |   |
| 30 | Implement bit comparator in Verilog-A                     | 5     | JI             |    |      | 0  |    |          |         |         |      |          |     |     |     |      |          |      | $\perp$ | ] |
| 31 | Integrate to high level design of the comparator          | 10    | JI             |    |      | 2  |    |          |         |         |      |          |     |     |     |      |          |      |         |   |
| 32 | Simulation and test of the high level design (comparator) | 5     | JI             |    |      |    | 2  |          |         |         |      |          |     |     |     |      |          |      |         |   |
| 33 | Implement transistor level design of the comparator unit  | 20    | JT, AY         |    |      |    |    | 5        | 5       | Ι       | Ι    | Γ        | Г   |     |     |      |          |      | Ι       | ] |
| 34 | Simulation and test of the transistor design (comparator) | 10    | JT, AY         | Γ  |      |    |    |          | ŀ       | 10      | Ι    | Γ        |     |     |     |      |          |      | Ι       |   |
| 35 | Implement layout level design of comparator unit          | 20    |                |    |      |    |    |          |         | Ι       | Ι    | 15       | 15  |     |     |      |          |      | Ι       |   |
| 36 | Simulation and test of layout (comparator)                | 10    |                | L  |      |    |    |          | J       | Ι       | Ι    |          |     | 10  |     |      |          |      | Ι       | ] |
| 37 | Off-chip hardware interface                               | 30    |                | L  |      |    |    |          | J       | Ι       | Ι    | Γ        | L   |     |     | Ĺ    | 15       | 15   | Ι       | ] |
| 88 | Documentation and presentation                            | 60    |                | 28 |      |    | 35 | 2        | 0 2     | 20      | Ι    |          | L   |     |     | Ĺ    |          | 10 2 | 20      | ] |
| 39 | Meetings                                                  | 60    |                |    | 11   | 0  | 0  | 4        | 4       | 4       | I    | 4        |     | 4   | 4   | 4    | 4        | 4    | 4       | 4 |
| 0  | Buffer time                                               | 80    |                |    |      |    |    | 15 1     | 0       | 10      | I    | 5        | 5   | 5   | 5   | 5    | 5        | 5 1  | 10      | 0 |
| 11 | High level integration                                    | 15    | JI, JH, JT, AY |    |      |    | 14 |          |         |         | I    |          | L   |     |     | Ĺ    |          |      |         | ] |
| 12 | Transistor level integration                              | 10    |                |    |      |    |    |          | ľ       | 10      | I    |          |     |     |     |      |          |      |         | ] |
| 13 | Layout level integration                                  | 15    |                |    |      |    |    |          | I       | ፗ       | Ι    | Γ        | Γ   |     | 30  |      |          | J    | Ţ       | ] |
| 14 | Implementation of test bench for SPI                      | 5     | JH             |    |      | 0  |    |          | 1       | T       |      |          |     |     |     |      |          |      | T       | 1 |
| 15 | Implementation of test bench for generator                | 5     | JI             |    | 0    |    |    |          | T       | T       |      |          |     |     |     |      |          |      | T       | 1 |
| 16 | Implementation of testbench for adder                     | 10    | AY             |    | 0    |    |    | $\Box$   | 1       | T       | T    | T        |     |     |     |      |          | П    | T       | 1 |
| 17 | Implementation of test bench for comparator               | 5     | JT             |    | 0    |    |    |          | ]       |         |      | Ι        |     |     |     |      |          |      | T       | 1 |
| 18 | Implementation of test bench for the complete system      | 20    |                | Γ  |      |    | 15 | $\sqcap$ | T       | T       | T    | T        | Γ   |     | Π   | Г    | П        | П    | T       | 1 |
| Ť  | Sum, number of hours                                      | 755   |                |    | 30,5 | 34 | 99 | 69 8     | 9       | 14      | ) (  | 69       | 67  | 64  | 54  | 29   | 24       | 34 3 | 34      | 4 |