# **SoC Design Laboratory Lab3**

EE 112061545 Chih-ling Wu SoC Lab3 Github Link

#### 1. Introduction:

In this lab, we implemented a FIR system in Verilog, using AXI-Lite interfaces for coefficients and Stream interfaces for data. We also collaborated with a BRAM module to handle data efficiently. The core focus of this lab is to master the control of handshake signals within these protocols and to create a well-designed data flow.

# 2. System Block diagrams:



Fig1. FIR module block diagram



Fig2. Data path block diagram breakdown

# (1) Dataflows:

• Shift Register:

Input data is stored and shifted as new data arrives.

• MAC:

Performing the multiplication and addition operations for the FIR filter.

• Data RAM:

Storing input data.

• Tap RAM:

Storing filter coefficients (tap parameters).

# (2) Control signals:

Handshake Signals:

Facilitating communication with the AXI-Lite interface and the stream interface.

• Flags:

Indicating the configuration and operation status of the filter (ap\_start, ap\_idle, ap\_done).

• State Machine:

To control the operation of the FIR filter with states (IDLE, LOAD, MAC, DONE).

# FSM for fir computing



Fig3. Design for finite state machine for fir computing

# 3. Operations:

(1) How to Receive Data-In and Tap Parameters and Place into SRAM?

Input data is received through the "Stream Interface" (ss\_tvalid, ss\_tdata, ss\_tlast). Data is loaded into the "Data RAM" for processing and stored in the "Shift Register."

Filter coefficients (tap parameters) are configured through the AXI-Lite interface using write transactions. The coefficients are loaded into the "Tap RAM" for use in the FIR filter.

(2) How to Access Shift Register and Tap RAM to Do Computation?

During the "MAC" state, the FIR filter retrieves data from the "Shift Register" to perform the computation. The data is used as input to the "MAC Unit."

The FIR filter accesses the "Tap RAM" to retrieve the filter coefficients (tap parameters) corresponding to the current computation. These coefficients are also used by the "MAC Unit" for the multiplication operation.

# (3) How to generate ap signals?

ap\_start: This signal is generated when the coefficient read is done. It is set to 1 for one clock cycle when the address for reading coefficients is detected.

ap\_done: This signal is generated based on conditions, including the output count matching the data length and the current state not being IDLE.

ap\_idle: It is set to 1 when the FIR filter is idle or not actively processing data. It transitions to 0 when a new operation starts (ap\_start is 1) and returns to 1 when the operation is complete.

- (4) How to generate handshake signals in two interfaces?
- \* AXI-Lite Interface Handshake Signals:
- awready and arready (write and Read Ready): These signals indicate readiness to accept addresses for write and read operations, respectively. They are updated based on the presence of valid address signals.
- wready and rready (write and Read Ready): These signals reflect readiness to accept data for writing or to send data for reading. They are directly derived from the presence of valid data signals.

#### **※**Stream Interface Handshake Signals:

• ss\_tready (ready to Receive Data): This signal represents readiness to receive data through the Stream interface. Its status depends on the current state and the

presence of valid data.

• sm\_tvalid (valid for Sending Back Output Data): This signal indicates when the module is ready to send back output data through the Stream interface. It becomes valid when specific conditions are met, allowing the module to transmit data.

#### 4. Simulation waveform & results:

(1) Coefficient program, and read back:



Fig4. Coefficient read / program waveform

```
-Start simulation----
   -Start the data input(AXI-Stream)----
   Start the coefficient input(AXI-lite)----
Check Coefficient ...
                     0, rdata =
0K: exp =
                                           0
                   -10, rdata =
                                         -10
0K: exp =
                    -9, rdata =
                    23, rdata =
                                          23
                    56, rdata =
                                          56
                                          63
                    63, rdata =
                                          56
                    56, rdata =
0K: exp =
                    23, rdata =
                                          23
0K: exp =
                    -9, rdata =
                                          -9
                   -10, rdata =
                                         -10
0K: exp =
0K: exp =
                     0, rdata =
```

Fig5. Coefficient read / program simulation result

#### (2) Data-in stream-in:



Fig6. Data stream in waveform

#### (3) Data-out stream-out:



Fig7. Data stream out waveform

```
Golden answer:
                                                        -6588, Your answer:
                                                                                     -6588
                          569] Golden answer:
 [PASS]
        [Pattern
                                                       -6405, Your answer:
                                                                                     -6405
                                                       -6222, Your answer:
 PASS
        [Pattern
                          570] Golden answer:
 [PASS]
                          571] Golden answer: 572] Golden answer:
        [Pattern
                                                        -6039, Your answer:
                                                                                     -6039
[PASS]
        [Pattern
                                                       -5856, Your answer:
                                                                                     -5856
 [PASS]
        [Pattern
                          573] Golden answer:
                                                       -5673, Your answer:
                                                                                     -5673
                          574]
575]
 PASS 1
                                                       -5490, Your answer:
                                                                                     -5490
        [Pattern
                               Golden answer:
                                                       -5307, Your answer:
 [PASS]
        [Pattern
                               Golden answer:
                          576] Golden answer:
                                                       -5124, Your answer:
 [PASS]
        [Pattern
                                                                                     -5124
                                                       -4941, Your answer:
                          577]
578]
 PASS 7
        [Pattern
                               Golden answer:
                                                                                     -4941
 PASS]
        [Pattern
                               Golden answer:
                                                       -4758, Your answer:
                                                                                     -4758
                          579Ī
[PASS]
                                                       -4575, Your answer:
                                                                                     -4575
        [Pattern
                               Golden answer:
                                                       -4392, Your answer:
 [PASS]
        [Pattern
                          580]
                               Golden answer:
                          581]
582]
 [PASS]
        [Pattern
                               Golden answer:
                                                       -4209, Your answer:
                                                                                     -4209
                                                       -4026, Your answer:
 [PASS]
        [Pattern
                                                                                     -4026
                               Golden answer:
 [PASS]
                          583]
                               Golden answer:
                                                       -3843, Your answer:
        [Pattern
                                                                                     -3843
                          584]
                                                                                     -3660
 PASS 1
        [Pattern
                               Golden answer:
                                                       -3660, Your answer:
 [PASS]
        [Pattern
                          585]
                               Golden answer:
                                                       -3477, Your answer:
                                                                                     -3477
 PASS]
                          586]
                               Golden answer:
                                                       -3294, Your answer:
                                                                                     -3294
        [Pattern
                                                       -3111, Your answer:
                                                                                     -3111
 PASS
        [Pattern
                          587]
                               Golden answer:
                                                       -2928, Your answer: -2745, Your answer:
 [PASS]
        [Pattern
                          588]
                               Golden answer:
                                                                                     -2928
                          589] Golden answer:
        [Pattern
 [PASS]
                                                                                     -2745
 [PASS]
        [Pattern
                          590] Golden answer:
                                                       -2562, Your answer:
                                                                                     -2562
                          591] Golden answer:
592] Golden answer:
                                                       -2379, Your answer:
-2196, Your answer:
 PASSĪ
        [Pattern
                                                                                     -2379
        [Pattern
 [PASS]
                               Golden answer:
                                                                                     -2196
        [Pattern
                          593] Golden answer:
[PASS]
                                                       -2013, Your answer:
                                                                                     -2013
[PASS] [Pattern
[PASS] [Pattern
                          594] Golden answer:
595] Golden answer:
                                                       -1830, Your answer:
                                                                                     -1830
                                                       -1647, Your answer:
                                                                                     -1647
0K: exp =
                      0, rdata =
[PASS] [Pattern
                          596] Golden answer:
                                                       -1464, Your answer:
                                                                                     -1464
 -----End the data input(AXI-Stream)-----
[PASS] [Pattern
                          597] Golden answer:
                                                       -1281, Your answer:
                                                                                     -1281
                                                       -1098, Your answer:
-915, Your answer:
[PASS] [Pattern
                          598] Golden answer:
                                                                                     -1098
                          599] Golden answer:
[PASS] [Pattern
                                                                                      -915
0K: exp =
                       2, rdata =
0K: exp =
                      4, rdata =
 ------Congratulations! Pass-----
Simulation complete via $finish(1) at time 78675 NS + 0
./tb/fir tb.v:200
                             $finish;
ncsim> exit
[chihlinggg@node01 fir]$
```

Fig8. Data-out match golden results

#### (4) RAM access control:



Fig9. Tap RAM access control



Fig10. Data RAM access control

#### (5) Finite state machine transition:



Fig11. FSM transition

#### 5. Resource Usage:

```
Start RTL Component Statistics
Detailed RTL Component Info :
+---Adders :
     2 Input
               32 Bit
                            Adders := 2
    2 Input
               12 Bit
                            Adders := 2
    2 Input
                            Adders := 1
               10 Bit
    2 Input
                4 Bit
                            Adders := 1
+---Registers :
                 32 Bit
                           Registers := 27
                 10 Bit
                           Registers := 1
                  4 Bit
                           Registers := 1
                  2 Bit
                           Registers := 1
                           Registers := 6
                  1 Bit
+---Multipliers
                32x32 Multipliers := 1
 ---Muxes :
    2 Input
               32 Bit
                             Muxes := 9
     2 Input
               12 Bit
                             Muxes := 3
    2 Input
                             Muxes := 1
                6 Bit
    3 Input
                4 Bit
                             Muxes := 1
    2 Input
                4 Bit
                             Muxes := 2
                             Muxes := 1
    3 Input
                2 Bit
     4 Input
                2 Bit
                             Muxes := 2
    2 Input
                1 Bit
                             Muxes := 6
     3 Input
                1 Bit
                             Muxes := 1
    13 Input
                1 Bit
                             Muxes := 11
Finished RTL Component Statistics
```

Fig12. Resource: one 32bit multiplier and one 32bit adder

| 1. Slice Logic        |      |       |            |           |       |
|-----------------------|------|-------|------------|-----------|-------|
| Site Type             | Used | Fixed | Prohibited | Available | Util% |
| Slice LUTs*           | 487  | 0     | 0          | 53200     | 0.92  |
| LUT as Logic          | 487  | 0     | 0          | 53200     | 0.92  |
| LUT as Memory         | 0    | 0     | 0          | 17400     | 0.00  |
| Slice Registers       | 887  | 0     | 0          | 106400    | 0.83  |
| Register as Flip Flop | 887  | 0     | 0          | 106400    | 0.83  |
| Register as Latch     | 0    | 0     | 0          | 106400    | 0.00  |
| F7 Muxes              | 0    | 0     | 0          | 26600     | 0.00  |
| F8 Muxes              | 0    | 0     | 0          | 13300     | 0.00  |

Fig13. Resource: LUT and FFs

# Memory

| +              | +        | ++<br>  Fived | Prohibited |               | ++               |
|----------------|----------|---------------|------------|---------------|------------------|
| +              | +        | rixeu  <br>++ |            | AVALLADIE<br> | ++               |
| Block RAM Tile | 0        | 0             | 0          | 140           | 0.00             |
| RAMB36/FIF0*   | 0<br>  0 | 0             | 0          | 140<br>  280  | 0.00  <br>  0.00 |
| +              | +        | ++            |            | +             | ++               |

Fig14. No use of BRAM in module

# 6. Timing Reports:

```
| Timing Details
From Clock: axis_clk
To Clock: axis_clk
                                       0 Failing Endpoints, Worst Slack
0 Failing Endpoints, Worst Slack
0 Failing Endpoints, Worst Slack
                                                                                                                               0.320ns, Total Violation
0.146ns, Total Violation
5.500ns, Total Violation
Setup :
                                                                                                                                                                                                     0.000ns
Hold :
                                                                                                                                                                                                     0.000ns
                                                                                                                                                                                                     0.000ns
                                                                                                                                                                                                        Clock period: 12ns
Max Delay Paths
Slack (MET) :
                                                     0.320ns (required time - arrival time)
                                                     cur_data_reg[16]/c|
(rising edge-triggered cell FDCE clocked by axis_clk {rise@0.000ns fall@6.000ns period=12.000ns})

prev_sum_reg[31]/D
(rising edge-triggered cell FDCE clocked by axis_clk {rise@0.000ns fall@6.000ns period=12.000ns})
    Source:
   Destination:
    Path Group:
                                                      axis_clk
       ath Group: axis_clk
ath Type: Setup (Max at Slow Process Corner)
equirement: 12.000ns (axis_clk rise@12.000ns - axis_clk rise@0.000ns)
ata Path Delay: 11.543ns (logic 8.633ns (74.787%) route 2.910ns (25.213%))
ogic Levels: 10 (CARRY4=5 DSP48E1=2 LUT2=2 LUT3=1)
lock Path Skew: -0.145ns (DCD - SCD + CPR)
Destination Clock Delay (DCD): 2.128ns = ( 14.128 - 12.000 )
Source Clock Delay (SCD): 2.456ns
Clock Pessimism Removal (CPR): 0.184ns
    Path Type:
    Requirement:
   Data Path Delay:
Logic Levels:
Clock Path Skew:
                                                     0.035ns ((TS3^2 + TIJ^2)^1/2 + DJ) / 2 + PE
(TSJ): 0.071ns
(TIJ): 0.000ns
    Clock Uncertainty:
       Total System Jitter
Total Input Jitter
        Discrete Jitter
Phase Error
                                                            (DJ):
(PE):
                                                                               0.000ns
                                                                               0.000ns
```

```
Incr(ns) Path(ns)
Location
                          Delay type
                                                                                       Netlist Resource(s)
                          (clock axis_clk rise edge)
                                                               0.000
                                                                            0.000 r
                                                               0.000
                                                                            0.000 r axis_clk (IN)
                                                                           0.000 axis_clk
r axis_clk_IBUF_inst/I
0.972 r axis_clk_IBUF_inst/O
1.771 axis_clk_IBUF
                          net (fo=0)
                                                               0.000
                          IBUF (Prop_ibuf_I_0)
net (fo=1, unplaced)
                                                               0.972
                                                               0.800
                                                                            r axis_clk_IBUF_BUFG_inst/I
1.872 r axis_clk_IBUF_BUFG_inst/O
                          BUFG (Prop_bufg_I_0)
                                                               0.101
                          net (fo=887, unplaced)
                                                                            2.456
                                                                                      axis_clk_IBUF_BUFG
                                                               0.584
                                                                                 r cur_data_reg[16]/C
```

```
FDCE (Prop_fdce_C_Q) 0.478 2.934 r cur_data_reg[16]/Q
                                        3.734 cur_data[16]
r temp__0/A[16]
                              0.800
net (fo=1, unplaced)
DSP48E1 (Prop_dsp48e1_A[16]_PCOUT[47])
                                         7.945 r temp__0/PCOUT[47]
                              4.211
                                        8.000 temp_0_n_106
r temp__1/PCIN[47]
net (fo=1, unplaced)
                              0.055
DSP48E1 (Prop_dsp48e1_PCIN[47]_P[0])
                                         9.518 r temp__1/P[0]
                              1.518
                                        10.318 temp_1_n_105
r sm_tdata_reg[19]_i_9/I0
net (fo=2, unplaced)
                              0.800
                                        10.442 r sm_tdata_reg[19]_i_9/0
10.442 sm_tdata_reg[19]_i_9_n_0
LUT2 (Prop_lut2_I0_0)
                              0.124
net (fo=1, unplaced)
                              0.000
                                              r sm_tdata_reg_reg[19]_i_6/S[1]
CARRY4 (Prop_carry4_S[1]_CO[3])
                              0.533
                                        10.975 r sm_tdata_reg_reg[19]_i_6/C0[3]
                                        10.984 sm_tdata_reg_reg[19]_i_6_n_0
r sm_tdata_reg_reg[23]_i_6/CI
net (fo=1, unplaced)
                              0.009
CARRY4 (Prop_carry4_CI_CO[3])
                              0.117
                                        11.101 r sm_tdata_reg_reg[23]_i_6/C0[3]
                                        11.101 sm_tdata_reg_reg[23]_i_6_n_0
r sm_tdata_reg_reg[27]_i_6/CI
net (fo=1, unplaced)
                              0.000
CARRY4 (Prop_carry4_CI_0[3])
                              0.331
                                        11.432 r sm_tdata_reg_reg[27]_i_6/0[3]
                                        12.050 temp_3[27]
r sm_tdata_reg[27]_i_2/I1
net (fo=1, unplaced)
                              0.618
LUT2 (Prop lut2 I1 0)
                              0.307
                                        12.357 r sm_tdata_reg[27]_i_2/0
                                        12.357 sm_tdata_reg[27]_i_2_n_0
net (fo=1, unplaced)
                              0.000
                                               r sm_tdata_reg_reg[27]_i_1/S[3]
CARRY4 (Prop_carry4_S[3]_CO[3])
                              0.376
                                        12.733 r sm_tdata_reg_reg[27]_i_1/CO[3]
                                        12.733 sm_tdata_reg_reg[27]_i_1_n_0
r sm_tdata_reg_reg[31]_i_1/CI
net (fo=1, unplaced)
                              0.000
CARRY4 (Prop_carry4_CI_0[3])
                              0.331
                                        13.064 r sm_tdata_reg_reg[31]_i_1/0[3]
net (fo=2, unplaced)
                              0.629
                                        13.693 cur_sum[31]
                                              r prev_sum[31]_i_1/I0
                                        14.000 r prev_sum[31]_i_1/0
14.000 prev_sum[31]_i_1_n_0
LUT3 (Prop_lut3_I0_0)
                              0.307
                                       14.000
net (fo=1, unplaced)
                              0.000
FDCE
                                             r prev_sum_reg[31]/D
 (clock axis_clk rise edge)
                                12.000
                                           12.000 r
                                 0.000
                                           12.000 r axis_clk (IN)
                                         12.000 axis_clk
 net (fo=0)
                                 0.000
                                                  r axis_clk_IBUF_inst/I
 IBUF (Prop_ibuf_I_0)
                                          12.838 r axis_clk_IBUF_inst/0
                                 0.838
                                0.760 13.598 axis_clk_IBUF
r axis_clk_IBUF_BUFG_inst/I
 net (fo=1, unplaced)
                                          13.689 r axis_clk_IBUF_BUFG_inst/0
 BUFG (Prop_bufg_I_0)
                                 0.091
 net (fo=887, unplaced)
                                 0.439
                                           14.128
                                                      axis_clk_IBUF_BUFG
 FDCE
                                                  r prev_sum_reg[31]/C
 clock pessimism
                                0.184
                                           14.311
 clock uncertainty
                                -0.035
                                           14.276
 FDCE (Setup_fdce_C_D)
                                0.044
                                          14.320
                                                      prev_sum_reg[31]
required time
                                           14.320
 arrival time
                                          -14.000
```

Fig15. Critical path

0.320

slack