# EECE6080 - HW 3

Max Thrun & Xiaohui Qi

October 4, 2013

The objective of this part was to write a VHDL model of a parameterized n-bit FUN generator which accepts a generic parameter n and two n-bit vectors. The VHDL model we came up with is shown below:

```
library IEEE;
1
   use ieee.std_logic_1164.all;
2
   entity fun is
        generic (
5
            n : integer := 8
        );
        port (
                        std_logic_vector(n-1 downto 0);
                        std_logic_vector(n-1 downto 0);
            b
                 : in
10
                 : out std_logic_vector(n-1 downto 0)
11
12
   end entity;
13
14
   architecture rtl of fun is
15
   begin
16
        process (a,b) begin
17
            if (a<b) then
18
                 f \ll a;
19
            else
20
                 f \ll b;
21
            end if;
        end
            process;
23
   end
       rtl;
24
```

Listing 1: Generic n-bit FUN generator

In order to test the functionality of our module we decided on an exhaustive test that checked every possible input. Given the small size of the design this was a reasonable approach as the execution time was negligible and it proved 100% correct functionality in the design. By using assert statements to check the output of the FUN module for each input condition the test is completely automated. The testbench used to drive the simulation is shown below. Note that the same testbench was used for both 3 and 8 bit-widths with the only change being the constant n.

```
library ieee;
1
   use ieee.std_logic_1164.all;
   use ieee.numeric_std.all;
   use ieee.std_logic_unsigned.all;
   entity fun_3_tb is
   end fun_3_tb;
   architecture tb_rtl of fun_3_tb is
9
10
       constant n : integer := 3;
11
12
       signal a : std_logic_vector(n-1 downto 0);
13
       signal b : std_logic_vector(n-1 downto 0);
14
       signal f : std_logic_vector(n-1 downto 0);
15
16
       component fun
17
            generic (
                n : integer := 8
19
            );
20
```

```
port (
21
                           std_logic_vector(n-1 downto 0);
22
                 a: in
                           std_logic_vector(n-1 downto 0);
23
                  f : out std_logic_vector(n-1 downto 0)
24
             );
25
        end component;
26
27
   begin
28
29
30
        uut : fun
31
        generic map(
             n \Rightarrow n
32
33
        port map(
34
             a \Rightarrow a,
35
             b \Rightarrow b,
             f \Rightarrow f
37
        );
38
39
        process begin
40
             for aa in 0 to (2**n)-1 loop
41
                  for bb in 0 to (2**n)-1 loop
43
                      a <= std_logic_vector(to_unsigned(aa, a'length));</pre>
44
                      b <= std_logic_vector(to_unsigned(bb, b'length));</pre>
45
                      wait for 10 ns;
46
                      if (a < b) then
47
                           assert f = a report "F not equal A" severity error;
48
                       else
                           assert f = b report "F not equal B" severity error;
                      end if;
51
52
                  end loop;
53
             end loop;
54
             report "Test Complete" severity note;
55
             wait;
57
        end process;
58
   end tb_rtl;
59
```

 $\textbf{Listing 2:} \ \mathrm{FUN} \ \mathrm{Testbench}$ 

While our testbench is exhaustive and reported no failures we still spot checked the waveforms to ensure it was working as expected. The two screenshots below show that each configuration is performing as expected with the correct functionality.



Figure 1: 3-Bit FUN Generator Simulation Waveform



Figure 2: 8-Bit FUN Generator Simulation Waveform

The objective of this part was to redesign the FUN module to utilize 1-bit slices. In order to do this we first came up with a slice design which we think is fairly optimized. It is composed of two main parts: a magnitude comparator, and a 2:1 multiplexer. The idea is that you can chain the magnitude comparators together and loop back the output of the last comparator to the select lines of each mux. A schematic representation of this design is shown below.



Figure 3: Gate Level Slice

An example circuit showing 4 slices changed together is shown below to illustrate the connections.



Figure 4: 4 Slice Chain

With the slice designed we translated it into a behavioral VHDL module shown below.

```
library ieee;
   use ieee.std_logic_1164.all;
2
   entity slice is
4
        port (
5
            a : in std_logic;
6
            b : in std_logic;
            s : in std_logic;
            f : out std_logic;
            c_i : in std_logic;
10
            c_o : out std_logic
11
        );
12
   end slice;
13
14
   architecture rtl of slice is begin
15
16
        process(a, b, s, c_i) begin
17
18
            — comparator
19
            if (a = '1') and (b = '0') then
20
                 c_{-}o <= \ '0';
21
            elsif (a = '0') and (b = '1') then
22
                 c_{-}o <= '1';
23
24
                 c_o <= c_i;
25
            end if:
26
27
28
            — тих
            if (s = '1') then
29
                 f \ll a;
30
            else
31
                 f \leq b:
32
            end if;
33
34
        end process;
35
36
   end rtl;
37
```

Listing 3: Single Slice

We then transformed the FUN module to use generate in order to automatically chain n slices together. The FUN module is fairly simple with just a single generate block and a vector to hold the carries between slices.

```
library ieee;
   use ieee.std_logic_1164.all;
2
   entity fun is
       generic (
5
           n : integer := 8
6
       );
       port (
           a : in std_logic_vector((n-1) downto 0);
           b : in std_logic_vector((n-1) downto 0);
10
            f : out std_logic_vector((n-1) downto 0)
11
       );
12
   end fun;
13
```

```
14
   architecture rtl of fun is
15
16
        component slice
17
             port (
18
                         : in std_logic;
19
                       b : in std_logic;
20
                       s : in std_logic;
21
                       f : out std_logic;
22
23
                       c_i : in std_logic;
24
                       c_o : out std_logic
                  );
25
        end component;
26
27
        -- vector to hold carry values between slices
28
        signal carry : std_logic_vector(n downto 0) := (others => '0');
29
30
31
        fun_gen : for i in 0 to n-1 generate
32
             slice_i: slice port map(
33
                  a \Rightarrow a(i),
34
                  b \Rightarrow b(i),
                  f \Rightarrow f(i),
36
                                          -- i+1 because carry(0) is initial
                  c_o \Rightarrow carry(i+1),
37
                  c_i \Rightarrow carry(i),
                                          -- carry(i) is c<sub>-</sub>o of previous slice
38
                                          -- mux uses output of last slice
                  s \Rightarrow carry(n)
39
40
             );
        end generate;
41
42
   end rtl;
```

Listing 4: FUN module using 'generate'

In order to ensure that the generate block was mapping the slices correctly we synthesized a 3-bit version of our design in Quartus and viewed the resulting RTL block diagram. From the diagram below we can see that our slices are properly linked and the carry out of the final slice is connected to the mux input of each slice. We can also see that the carry in to the first slice is a 0 which is the expected configuration.



Figure 5: RTL block diagram of generated slices

Once we verified that our generate block was correct we ran it through the same simulations from Parts 1&2. Both simulations completed successfully and spot checks of the resulting waveforms also confirmed correct functionality. Given that we ran the exact same exhaustive testbench as Parts 1&2 we can say with certainty that our sliced design is functionally equivalent.



Figure 6: 3-Bit FUN Generator



Figure 7: 8-Bit FUN Generator

In this section we were required to each design a layout based on the slice design from Part 2. Since we already had a gate level representation of our slice it was fairly easy to convert it to a full CMOS representation using FETs. A diagram illustrating this representation is shown below with all transistors labeled. The size of the every transistor for both designs is tabulated below:

| Parameter | $\mathbf{Size}$ |
|-----------|-----------------|
| length    | 0.6u            |
| n width   | 1.2u            |
| p width   | 1.2u            |

Table 1: Transistor Dimensions



Figure 8: Transistor Diagram

# Design A (Qi)



Figure 9: Design A: Layout

```
stepsize 10
analyzer A B cin S F cout
vector vec1 A B cin S
setvector vec1 1100
s setvector vec1 0011
s setvector vec1 1111
s setvector vec1 0111
s setvector vec1 1000
s setvector vec1 1000
s setvector vec1 1000
```

Listing 5: Design A: IRSIM Command File



Figure 10: Design A: IRSIM Output

```
1 ********
2
3 .include ../../../models/model_t36s.sp
4 .include bitslice.spice
5
6 VDD vdd gnd 5v
7 Va A gnd PWL(0 5v 4.9n 5v 5n 0v 9.9n 0v 10n 5v 14.9n 5v 15n 0v 19.9n 0v 20n 5v 24.9n 5v)
8 Vb B gnd PWL(0 5v 4.9n 5v 5n 0v 9.9n 0v 10n 5v 19.9n 5v 20n 0v 24.9n 0v)
9 Vcin cin gnd PWL(0 0v 4.9n 0v 5n 5v 19.9n 5v 20n 0v 24.9n 0v)
10 Vs S gnd PWl(0 0v 4.9n 0v 5n 5v 19.9n 5v 20n 0v 24.9n 0v)
11 .option post
12 .tran 0.1n 25n
13 .end
```

Listing 6: Design A: Spice Simulation File



Figure 11: Design A: A to C\_OUT Delay



Figure 12: Design A: A to F Delay

# Design B (Thrun)



Figure 13: Design B: Layout

```
stepsize 50
1
   logfile irsim_f.log
  h VDD
5
   I GND
6
   I A B C_IN S
   vector in A B C_IN S
  ana A B C_IN S C_OUT F
  w A B C_IN S C_OUT F
12
   setvector in 0000
13
14
   path F
15
   setvector in 0001
16
17
   path F
18
   setvector in 0010
19
   S
20
```

Listing 7: Design B: IRSIM Command File (truncated)



Figure 14: Design B: IRSIM Output

```
* MODELS
2
3
   . \ include \ \ldots / \ldots / \ models / model\_t36s.sp
5
   .include slice.spice
   * CIRCUIT
10
11
   VDD vdd gnd 5V
12
13
14
   * SIMULATION
15
16
17
   * delay B to F
18
            gnd PWL(0n 0V)
   ∗va A
```

```
gnd PWL(0n 0V 4.9n 0V 5n 5V 9.9n 5V 10n 0V)
   *vb
        В
        C_{-}IN
               gnd PWL(0n 5V 4.9n 5V 5n 0V 9.9n 0V 10n 5V)
   * V C
21
        S
               gnd PWL(0n 0V)
   * V S
23
   * delay B to C_OUT
24
              gnd PWL(0n 0V)
   va
25
   vb
       В
              gnd PWL(0n 0V 4.9n 0V 5n 5V 9.9n 5V 10n 0V)
26
              gnd PWL(0n 0V)
       C_IN
27
   ٧C
              gnd PWL(0n 0V)
       S
28
   ٧s
29
30
   .option post
   .tran 0.01n 15n
31
   . end
32
```

Listing 8: Design B: Spice Simulation File



Figure 15: Design B: B to C\_OUT Delay



Figure 16: Design B: B to F Delay

## **Design Summary**

A comparison of delays obtained from hspice for both designs is tabulated below. From the results we can see that design B is significantly faster. Given this result, combined with the fact that the layout of design B lends itself to easier patterning, led us to choose design B as the design to move forward with.

| Parameter           | Design A             | Design B  |
|---------------------|----------------------|-----------|
| A/B to C_Out (Fall) | 820.44 ps            | 393.48 ps |
| A/B to C_Out (Rise) | $1.1378 \mathrm{ns}$ | 593.89 ps |
| A/B to F (Fall)     | $1.7239 \mathrm{ns}$ | 293.53 ps |
| A/B to F (Rise)     | $2.0320 \mathrm{ns}$ | 309.88 ps |

Table 2: Delay Comparison

The objective of this part was to rewrite our slice architecture from Part 2 to use component instances of VHDL models of the static gates we used to implement the slice ensuring that we include the worst case delay. In order to achieve this the first thing we needed to do was simulate each gate and find the worst case delay times. The spice simulation file and the resulting output for each of the three different gates are shown below.



Figure 17: 2 Input NAND Gate

```
.include ../../models/model_t36s.sp
   .param n=4
   .param wp=0.3u*n
   .param wn=0.3u*n
   .param hd=2u
   mPMos1 c a vdd vdd pfet w=wp I=0.6u ad=hd*wp pd=2*hd+2*wp as=hd*wp ps=2*hd+2*wp
   mPMos2 b a vdd vdd pfet w=wp I=0.6u ad=hd*wp pd=2*hd+2*wp as=hd*wp ps=2*hd+2*wp
10
  mNMos1 c a d d nfet w=wn l=0.6u ad=hd*wn pd=2*hd+2*wn as=hd*wn ps=2*hd+2*wn
^{11}
  mNMos2 d b gnd gnd nfet w=wn l=0.6u ad=hd*wn pd=2*hd+2*wn as=hd*wn ps=2*hd+2*wn
12
  VDD vdd gnd 5v
14
   Va a gnd PWL(0 0v 4.9n 0v 5n 5v 9.9n 5v 10n 0v 14.9n 0v )
15
   Vb b gnd PWL(0 0v 4.9n 0v 5n 5v 9.9n 5v 10n 5v 14.9n 5v )
16
17
   .option post
18
   .tran 0.1n 15n
   . end
20
```

Listing 9: 2 Input NAND Gate Spice Simulation File



Figure 18: 3 Input NAND Gate

```
.include ../../ models/model_t36s.sp
   .param n=4
   .param wp=0.3u*n
   .param wn=0.3u*n
   .param hd=2u
   mPMos1 d a vdd vdd pfet w=wp l=0.6u ad=hd*wp pd=2*hd+2*wp as=hd*wp ps=2*hd+2*wp
   mPMos2 d b vdd vdd pfet w=wp I=0.6u ad=hd*wp pd=2*hd+2*wp as=hd*wp ps=2*hd+2*wp
   mPMos3 d c vdd vdd pfet w=wp I=0.6u ad=hd*wp pd=2*hd+2*wp as=hd*wp ps=2*hd+2*wp
10
11
   mNMos1 \ d \ a \ e \ e \ nfet \ w\!\!=\!\!wn \ l = 0.6u \ ad = hd * wn \ pd = 2*hd + 2*wn \ as = hd * wn \ ps = 2*hd + 2*wn
   mNMos2 \ e \ b \ f \ f \ nfet \ w\!\!=\!\!wn \ I=0.6u \ ad=hd*wn \ pd=2*hd+2*wn \ as=hd*wn \ ps=2*hd+2*wn
   mNMos3 f c gnd gnd nfet w=wn I=0.6u ad=hd*wn pd=2*hd+2*wn as=hd*wn ps=2*hd+2*wn
14
15
   VDD vdd gnd 5v
16
   Va a gnd PWL(0 0v 4.9n 0v 5n 5v 9.9n 5v 10n 0v 14.9n 0v )
17
   Vb b gnd PWL(0 0v 4.9n 0v 5n 5v 9.9n 5v 10n 5v 14.9n 5v )
   Vc c gnd PWL(0 0v 4.9n 0v 5n 5v 9.9n 5v 10n 5v 14.9n 5v )
19
20
   .option post
21
   .tran 0.1n 15n
22
   end
23
```

 $\textbf{Listing 10:} \ \ 3 \ \ \text{Input NAND Gate Spice Simulation File}$ 



Figure 19: NOT Gate

```
.include ../../ models/model_t36s.sp
   .param n=4
  .param wp=0.3u*n
  .param wn=0.3u*n
   .param hd=2u
  mPMos1 b a vdd vdd pfet w=wp l=0.6u ad=hd*wp pd=2*hd+2*wp as=hd*wp ps=2*hd+2*wp
  mNMos2 b a gnd gnd nfet w=wn l=0.6u ad=hd*wn pd=2*hd+2*wn as=hd*wn ps=2*hd+2*wn
10
  VDD vdd gnd 5v
11
  Va a gnd PWL(0 0v 4.9n 0v 5n 5v 9.9n 5v 10n 0v 14.9n 0v 15n 5v 19.9n 5v)
12
13
   .option post
14
  .tran 0.1n 20n
  . end
16
```

Listing 11: NOT Gate Spice Simulation File

The worst case delays for each of these gates is tabulated below.

| Gate   | Worst Case Delay |
|--------|------------------|
| NAND-2 | 260.02 ps        |
| NAND-3 | 528 ps           |
| NOT    | 256.55 ps        |

Table 3: Worst Case Gate Delay

Knowing the worst case delay for each gate we can construct each as a separate module in VHDL.

```
library ieee;
   use ieee.std_logic_1164.all;
2
   entity nandgate2 is
4
       port (
5
       a,b:in std_logic;
6
       c : out std_logic
   );
   end entity;
10
   architecture arc of nandgate2 is
11
   begin
12
       process(a,b) begin
13
            c \le not(a and b) after 260.02 ps;
14
       end process;
15
   end architecture;
```

Listing 12: 2 Input NAND Gate Spice Simulation File

```
library ieee;
   use ieee.std_logic_1164.all;
2
   entity nandgate3 is
       port (
       a,b,c:in std_logic;
       d: out std_logic
   );
   end entity;
9
10
   architecture arc of nandgate3 is
11
12
       process(a,b,c) begin
13
            d <= not(a and b and c) after 528 ps;</pre>
14
       end process;
15
   end architecture;
```

Listing 13: 3 Input NAND Gate Spice Simulation File

```
library ieee;
   use ieee.std_logic_1164.all;
2
   entity notgate is
       port (
5
                 a : in std_logic;
6
                 b : out std_logic
   end entity;
   architecture arc of notgate is
12
       process (a) begin
13
            b <= not a after 256.55 ps;
14
       end process;
15
   end architecture;
```

Listing 14: NOT Gate Spice Simulation File

With each gate implemented we can rewrite our slice to match the original gate level design shown in part 2. The new slice module is shown below. Note that the FUN module remains the same as Part 2.

```
library ieee;
1
   use ieee.std_logic_1164.all;
2
   entity slice is
       port (
            a : in std_logic;
           b : in std_logic;
            s : in std_logic;
            f : out std_logic;
9
            c_i : in std_logic;
10
            c_o : out std_logic
11
       );
12
   end slice;
13
14
   architecture rtl of slice is
15
       -- comparator
16
       signal g1_o : std_logic;
17
       signal g2_o : std_logic;
19
       signal g3_o : std_logic;
       signal g4_o : std_logic;
20
21
       — тих
22
       signal g6_o : std_logic;
23
       signal g7_o : std_logic;
24
       signal g8_o : std_logic;
25
26
   begin
27
28
       — comparator
29
       g1 : entity work.notgate
                                    port map(a,
                                                   g1_o);
30
       g2 : entity work.nandgate2 port map(b,
31
                                                   g1_o, g2_o);
       g3 : entity work.nandgate2 port map(c_i, g1_o, g3_o);
32
       g4 : entity work.nandgate2 port map(c_i, b, g4_o);
33
       g5 : entity work.nandgate3 port map(g2_o, g3_o, g4_o, c_o);
34
35
       -- mux
36
       g6 : entity work.notgate
                                    port map(s, g6_o);
37
       g7 : entity work.nandgate2 port map(s, a, g7_o);
       g8 : entity work.nandgate2 port map(b, g6_o, g8_o);
39
       g9 : entity work.nandgate2 port map(g7_o, g8_o, f);
40
41
   end architecture:
```

Listing 15: Slice With Static Gates

In our design the worst case delay is when a change in either A0 or B0 results in a change in F0. The change in A0 or B0 has to propagate through all the slices and then back through all the muxes until it finally reaches F0. Knowing this we can target our testbench to only simulate this condition. The testbench simulating a 3-bit configuration is shown below. Note that the testbench for the 8-bit configuration is exactly the same except the n constant is changed to 8 and the input vectors are also 8 bit.

```
library ieee;
2
   use ieee.std_logic_1164.all;
   entity fun_3_tb is
4
   end fun_3_tb;
5
   architecture rtl of fun_3_tb is
        constant n : integer := 3;
10
        signal a : std_logic_vector(n-1 downto 0);
11
        signal b : std_logic_vector(n-1 downto 0);
12
        signal f : std_logic_vector(n-1 downto 0);
13
14
        component fun
15
             generic (
16
                  n:integer:=8
17
             );
18
             port (
19
                           std_logic_vector(n-1 downto 0);
20
                  a : in
                            std_logic_vector(n-1 downto 0);
21
                  b: in
                  f : out std_logic_vector(n-1 downto 0)
22
             );
23
        end component;
24
25
26
   begin
27
        uut : fun
28
        generic map(
29
             n \Rightarrow n
30
31
        port map(
32
             a \Rightarrow a
33
             b \Rightarrow b,
             f \Rightarrow f
35
        );
36
37
        process begin
38
             a \le 001";
39
             b \le 001";
40
             wait for 100 ns;
41
             a<="000":
42
             b<="001";
43
             wait for 100 ns;
44
             a<="001";
45
             b <= "001";
46
             wait for 100 ns;
47
             wait;
48
        end process;
49
50
   end rtl;
51
```

Listing 16: Static Gate Slice Testbench

Looking at the output waveform we can measure the delta between the input state change and the output state change. Measurements were taken for both a falling edge and rising edge change for both the 3 and 8 bit configurations. The four output waveforms are shown below.



Figure 20: 3-Bit Fall Delay



Figure 21: 3-Bit Rise Delay



Figure 22: 8-Bit Fall Delay



Figure 23: 8-Bit Rise Delay

All the delays are tabulated below.

| Configuration | Fall Delay           | Rise Delay          |
|---------------|----------------------|---------------------|
| 3-Bit         | $3.3946 \mathrm{ns}$ | $0.519 \mathrm{ns}$ |
| 8-Bit         | $7.3321 \mathrm{ns}$ | $0.519 \mathrm{ns}$ |

 ${\bf Table~4:~Worst~Case~Delay~Through~Chain}$ 

The objective of this section is to generate the layouts of a 3-bit and 8-bit FUN generator and determine their worst case delays. This was easily achieved by using the array command in magic. Since our slice tiles together perfect it was almost trivial to chain them. The only paint we had to add was to connect the last slices C\_OUT to the last slices S. The layouts for both configurations are shown below.



Figure 24: 3-Bit Layout



Figure 25: 3-Bit Layout Expanded



Figure 26: 8-Bit Layout



Figure 27: 8-Bit Layout Expanded

In order to test the functionality of our multi-bit FUN generators we used two different strategies. For the 3-bit version we simply did an exhaustive simulation and manually looked at the waveform. To generate all the input vectors a simple Python script was written. For the 8-bit version a similar strategy was used except instead of a completely exhaustive search we skip over chunks.

```
for a in range(0, 2**3):
    for b in range(0, 2**3):
        print "setvector A %s" % (bin(a).replace("0b","").zfill(3))
        print "setvector B %s" % (bin(b).replace("0b","").zfill(3))
        print "s"
        print "path F"
```

Listing 17: Python Vector Generator

The output IRSIM waveforms for each configuration is shown below.



Figure 28: 3-Bit IRSIM Waveform



Figure 29: 8-Bit IRSIM Waveform

To get the worst case delay out of IRSIM we utilized the PATH command which tells you delay through the critical path. Interestingly it reported that our worst case delay was **0.003ns** for each configuration. This is hard to believe but it might be possible due to the fact that IRSIM says it's 'Ignoring lumped-resistance'. In regards to trying to increase the accuracy of the VHDL model it could be possible to account for additional capacitances introduced by the wiring of all the gates. Instead of introducing the delay per gate you could introduce the total delay per slice. A table comparing these results to the VHDL simulation is shown below.

| Configuration  | VHDL                 | IRSIM                                  |
|----------------|----------------------|----------------------------------------|
| 3-Bit<br>8-Bit | 3.3946ns<br>7.3321ns | $0.0030 \text{ns} \\ 0.0030 \text{ns}$ |

Table 5: Worst Case Delay Comparison Between VHDL, IRSIM, and Spice

The objective of this part was to simulate the layout level designs using Spice. Since we only wanted to focus on the worst case delay we chose an input transition on slice 0 that causes and output transition on slice 0. The simulation file to achieve this is shown below. Note that the simulation file for the 8-bit version is exactly the same except 5 more inputs are added, all pulled to ground.

```
MODELS
   *
2
3
   .include .../../models/model_t36s.sp
   .include ../part_5/fun_3.spice
     CIRCUIT
9
10
11
   VDD vdd gnd 5V
12
13
14
   * SIMULATION
15
16
17
   va0
         A0
                 gnd PWL(0n 5V 4.9n 5V 5n 0V 19.9n 0V 20n 5v)
18
                 gnd PWL(0n 0V)
   va1
         A1
19
         A2
                 gnd PWL(0n 0V)
   va2
20
21
   vb0
         B0
                 gnd PWL(0n 5V)
22
                 gnd PWL(0n 0V)
23
   vb1
         B1
                 gnd PWL(0n 0V)
         B2
24
   vb2
25
   .option post
26
   .tran 0.01n 25n
27
   . end
```

 $\textbf{Listing 18:} \ \, 3\text{-Bit FUN Generator Spice Simulation}$ 

The output from both simulations is shown below



Figure 30: 3-Bit Spice Waveform



Figure 31: 8-Bit Spice Waveform

From the results we can see that our designs do seem to work correctly. Looking at a final comparison of all the delays between VHDL, IRSIM, and Spice we can clearly see that the VHDL simulation and Spice simulation are in agreement. The IRSIM simulation results are unfortunately not in line with the other two simulations. This leads us to believe that our method of obtaining the worst case delay with IRSIM is incorrect.

| Configuration | VHDL                 | IRSIM                | Spice                |
|---------------|----------------------|----------------------|----------------------|
| 3-Bit         | $3.3946 \mathrm{ns}$ | $0.0030 \mathrm{ns}$ | 4.6083 ns            |
| 8-Bit         | $7.3321\mathrm{ns}$  | $0.0030 \mathrm{ns}$ | $10.496 \mathrm{ns}$ |

Table 6: Worst Case Delay Comparison Between VHDL, IRSIM, and Spice

There are a few things we can take with us into future design processes. The first of which is spending some time upfront to figure out why IRSIM does not seem to report the right delay. At the moment it seems that the easiest method to obtain an accurate simulation of our design is to export the Spice netlist from magic of either a individual gate or a functional block such as a slice. This spice simulation can be simple as we just want to know the worst case delay. Once we obtain the delay we can add it to our VHDL model. Using the VHDL model provides much more flexibility in terms of functional testing as we can totally automate the testbench as we did in this lab. Hopefully, with this insight we will be able to accomplish future design tasks in a more efficient manner.

## Task Breakdown

| Task   | Person                              |
|--------|-------------------------------------|
| Part 1 | Qi                                  |
| Part 2 | Thrun                               |
| Part 3 | Both                                |
| Part 4 | Qi                                  |
| Part 5 | Thrun                               |
| Part 6 | Qi (with Thrun input on conclusion) |

Table 7: Task Assignment