# Introduction to Synthesis Flow with Synopsys Design Compiler

Lecturer: Heng-Yu, Liu



## **Outline**

### **✓** Section 1 Design Compiler Introduction

- Section 2 Basic Synthesis Flow
  - Develop HDL files
  - Specify libraries
  - Read design
  - Develop design environment
  - Set design constraints
  - Select compile strategy
  - Optimize the design
  - Analyze and resolve design problems
  - Save the design database
- Section 3 Generate & For Loop



## Review: IC design flow





## **Outline**

#### **✓** Section 1 Design Compiler Introduction

- ✓ Section 2 Basic Synthesis Flow
  - Develop HDL files
  - Specify libraries
  - Read design
  - Develop design environment
  - Set design constraints
  - Select compile strategy
  - Optimize the design
  - Analyze and resolve design problems
  - Save the design database
- ✓ Section 3 Generate & For Loop



## **Design Compiler Introduction**

### Design compiler

- It synthesizes your HDL designs (Verilog) into optimized technology-dependent(D35,U18...), gate-level designs.
- It can optimize both combinational and sequential designs for speed, area, and power.

```
RTL(.v) Netlist(.v)
```

No always block or assign



## Logic Synthesis

#### ✓ Logic synthesis

- A process by which behavioral model of a circuit is turned into an implementation in terms of logic gates
- Synthesis = Translation + Mapping + Optimization





## Data Flow in Design Compiler



## **Gate-level Simulation**





## Design Compiler Introduction (cont.)

#### Design compiler provides two user interfaces

command-line interface

 dctcl mode: Applying a script based on tool command language (tcl) which packages all the commands needed to implement specific Design Compiler functionality.

%dc\_shell-t

#### **Used** in this course

- graphical user interfaces (GUI)
  - Design Vision (can execute in both modes) %dv
  - %dc\_shell> gui\_start
  - %dc\_shell> gui\_stop



## Some Design Compiler comment

- Get topic command help
  - %dc\_shell> help
- Get a particular command help
  - %dc\_shell> command\_name -help
- Get topic command help
  - %dc\_shell> man command\_name
    - ex: man set
- ✓ More information about design compiler in user guide:
  - https://picture.iczhiku.com/resource/eetop/wYKfeQTQHSRITVVC.pdf

## Synthesize by Design Compiler

- ✓ Prepare RTL code(.v) and script file(.tcl)
- **✓** Invoke design compiler
  - Command :%dc\_shell-t -f script\_filename.tcl | tee log\_filename.log
  - Example : %dc\_shell-t -f syn.tcl | tee syn.log → ./01\_run\_dc
  - -f :format
  - tee: store the result into file and print on the screen
- Check log file(.log) to see if there are error(ex: Latch) messages
- Check report file(.report) to see if timing or area are met
- ✓ Run gate-level simulation



## System Object (Gate level)

```
design
module TOP(in1,in2,clk,out);
                                     clock
    input in1, in2, clk;
    output [1:0] out;
                                    port
                                                               net
    wire inv0, inv1, bus0, bus1;
    ADDER U_ADD1(.AIN(in1), .BIN(in2), .Q0(bus0), .Q1(bu1));
INV U_INV1(.A(bus0), .Z(inv0));
INV U INV2(.A(bus1), .Z(inv1));
                                                     niq
REGFILE U_REG(.D0(inv0),.D1(inv1),.CLK(clk), Q(OUT));
                               reference/design
endmodule
                                   Design Objects
    Design
             : A circuit that performs one or more logical functions
    Cell
             : An instantiation of a design within another design
    Reference: The original design that a cell "point to"
             : The input or output of a design
    Port
             : The input or output of a cell
    Pin
                                                            No always block or assign
             : The wire that connects ports or pins
    Net
             : Waveform applied to a port or pin identified as a clock source
    Clock
```

11

#### SDF File

#### ✓ Standard Delay Format file (.sdf file)

```
(DELAYFILE
(SDFVERSION "OVI 2.1" design
(DESIGN "CP")
(DATE "Wed Feb 26 18:12:26 2020")
(VENDOR "slow")
(PROGRAM "Synopsys Design Compiler cmos")
(VERSION "K-2015.06-SP1")
(DIVIDER /)
(VOLTAGE 1.62:1.62:1.62)
(PROCESS "slow")
(TEMPERATURE 125.00:125.00:125.00)
(TIMESCALE 1ns)
(CELL
  (CELLTYPE "ADDHXL")
                                       rising edge transition
                                                                        falling edge transition
  (INSTANCE U9469)
              min: typ: max
  (DELAY
    (ABSOLUTE
    (IOPATH A CO (0.218262:0.218911:0.218911) (0.221926:0.224530:0.224530
    (IOPATH B CO (0.218498:0.219461:0.219461) (0.207622:0.211288:0.211288))
    (COND B == 1'b1 (IOPATH A S (0.298010:0.300566:0.300566) (0.206581:0.207653:0.207653)))
    (COND B == 1'b0 (IOPATH A S (0.355384:0.356453:0.356453) (0.336261:0.338810:0.338810)))
    (IOPATH (posedge A) S (0.355384:0.356453:0.356453) (0.336556:0.337945:0.337945))
    (IOPATH (negedge A) S (0.355157:0.357118:0.357118) (0.336261:0.338810:0.338810))
    (COND A == 1'b1 (IOPATH B S (0.235169:0.238284:0.238284) (0.141208:0.142291:0.142291)))
    (COND A == 1'b0 (IOPATH B S (0.207619:0.208926:0.208926) (0.230646:0.234490:0.234490)))
    (IOPATH (posedge B) S (0.238927:0.240244:0.240244) (0.235283:0.236908:0.236908))
    (IOPATH (negedge B) S (0.235169:0.238284:0.238284) (0.230646:0.234490:0.234490))
```

## **Outline**

### **✓** Section 1 Design Compiler Introduction

#### Section 2 Basic Synthesis Flow

- Develop HDL files
- Specify libraries
- Read design
- Develop design environment
- Set design constraints
- Select compile strategy
- Optimize the design
- Analyze and resolve design problems
- Save the design database

More details in page10 pdf

✓ Section 3 Generate & For Loop















## **Specify Libraries**

#### Specify libraries

- 1)Synthetic Library
  - Specifies additional DesignWare libraries for optimization purposes.
  - Efficient implementations for adders, comparators, multipliers
- **(2)**Link Library
  - Specifies a list of libraries that Design Compiler can use to resolve design references.
- Target Library
  - During synthesis, compiler selects gates from target library.
  - It also calculates the timing of the circuit, using the vendor-supplied (UMC, TSMC ...) timing data of the lib.

### Synthesize in worst case. (xx.sldb)

- set synthetic\_library {dw\_foundation.sldb}
- set link\_library {\* dw\_foundation.sldb standard.sldb slow.db}
- set target\_library {slow.db}



#### .lib file

- Cell name
- Drive strength
- ✓ Area
- ✓ Pin
  - Internal delay
  - Leakage power
  - Internal power

```
input_trasition_time

A1

A2

Out_capacitance
```

```
cell (NANDX1) {
  pin(A1) {
    direction : input;
                                        Same information as .db file
    capacitance : 0.00683597;
  pin(A2) {
    direction : input;
    capacitance : 0.00798456;
  pin(ZN) {
    direction : output;
    capacitance : 0.0;
    internal power() {
     timing() {
                                                             out_capacitance
       cell_rise(table10){
         values ("0.020844,0.02431,0.030696,0.039694,0.048205,0.072168,0.10188",\
                  "0.024677,0.027942,0.035042,0.045467,0.054973,0.082349,0.11539",\
                  "0.032068,0.035394,0.042758,0.055361,0.065991,0.090936,0.13847",\
                  "0.046811,0.049968,0.057164,0.064754,0.086481,0.11676,0.15744",\
                  "0.073919,0.078805,0.080873,0.091007,0.11655,0.1579,0.21448",\
                  "0.13162,0.13363,0.1383,0.14793,0.1685,0.22032,0.30054",\
                 *"0.24661,0.24835,0.25294,0.26221,0.282,0.32417,0.42783");
 input trasition time
```

```
lu_table_template(table10){
  variable_1 : total_output_net_capacitance;
  variable_2 : input_transition_time;
  index_1 ("0.001400,0.003000,0.006200,0.012500,0.025100,0.050400,0.101000");
  index_2 ("0.0208,0.0336,0.06,0.1112,0.2136,0.4192,0.8304");
}
```





## Read Design

#### Read design

 The Design Compiler optimization process works only on the designs loaded in memory.

Method 1: analyze & elaborate

Method 2: read\_file · read\_verilog · read\_sverilog

(Please refer to the appendix for the detail usage of them)

#### Setting the Current Design

 After the read file command, set the current design on the module you want to focus on.

Method 2 Method 1

#------ analyze elaborate method -----## this method automatically link , so we don't have to add link command
analyze -f verilog \$DESIGN\.v
analyze -f verilog GENVAR\_PRAC.v
elaborate \$DESIGN
current\_design \$DESIGN



Note: read\_sverilog is compatible to read the Verilog format

## Read Design

|                     | read_file                                                                        | analyze and elaborate                                                                                                                                  |  |  |  |
|---------------------|----------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|
| Input<br>format     | All formats                                                                      | Synthesize VHDL or Verilog  Allow user to set parameter values on the elaborate command line  Can store analyzed results in specified design libraries |  |  |  |
| When to use         | Netlist, precompiled designs                                                     |                                                                                                                                                        |  |  |  |
| Generics            | Cannot pass parameters (must use directives in HDL)                              |                                                                                                                                                        |  |  |  |
| Design<br>libraries | Cannot store analyzed results                                                    |                                                                                                                                                        |  |  |  |
| Commands            | dc_shell>read_file -f keyword file_name e.g.: dc_shell>read_file -f verilog name | dc_shell>analyze -f keyword file_name<br>dc_shell>elaborate design_name                                                                                |  |  |  |
| Architecture        | Cannot specify architecture to be elaborated                                     | Allow user to specify architecture to be elaborated                                                                                                    |  |  |  |





## **Define Design Environment**

#### Design Environment

 Define the environment in which the design is expected to operate in by specifying operating conditions, wire load models, and system interface characteristics.



**Commands Used to Define the Design Environment** 



## Define Design Environment (Operating Conditions)

### Defining the Operating Conditions (PVT Variations)

- An operating condition is defined as a combination of Process,
   Voltage, Temperature(PVT).
- There are three kinds of manufacturing process models that are provided by the semiconductor foundry for digital designs: slow process model, typical process model, fast process model.
- For robust design, the design should be validated at the extreme corners (slow, fast process model) of the manufacturing process at last.
- Generally, for synthesis stage, we use worst-case to ensure that the timing (setup time) is met under strict constraints.





## Define Design Environment (Wire Load Models)

#### **✓** Defining Wire Load Models

 Before layout(APR), wire load models can be used to estimate capacitance, resistance and the area overhead due to interconnection.





e.g. set\_wire\_load\_model -name "umc18\_wl50"

## Define Design Environment (Wire Load Models)

✓ Design Compiler uses physical values to calculate wire delays and circuit speeds, there are three types of wire load mode.
set wire load mode top

#### Top

 Uses the wire load model specified for the top level of the design hierarchy for all nets

#### Enclosed

 Uses the wire load model of the smallest module that fully encloses the net

#### Segmented

 Each segment of the net gets its wire load model from the block that encompasses the net segment



## Define Design Environment (System Interface)

#### **✓** Modeling the System Interface

- Design Compiler supports the following ways to model the design's interaction with the external system:
  - Defining drive characteristics for input ports
  - Defining fan out loads on output ports
  - Defining loads on input and output ports





## Define Design Environment (System Interface)

### e.g. defining loads on input and output ports

- By default, it assumes zero capacitive load on input and output ports.
- Use the set\_load command to set a capacitive load value on input and output ports of the design.
- Example : Set a load of 0.05 picofarads on all output ports, set a drive of 1.5 kilo-ohms on all input ports.

set\_drive 1.5 [all\_inputs]

set\_load 0.05 [all\_outputs]





## Select Design Object



Example:

[get\_ports \*] : in1, in2 ,out, clk1

[get\_designs \*] : top

[get\_pins \*] : AIN,BIN,Q0,Q1, D0,D1,Q,CLK

[get\_pins U\_REG/\*] : D0,D1,CLK,Q

[get\_cells \*] : U\_ADD1, U\_REG, U\_INV1,U\_INV2

[get\_clocks \*] : clk1

[get nets \*] : bus0,bus1,inv0,inv1

Frequently Used:

[all inputs] : in1,in2,clk1

[all\_outputs] : out

[all\_ports] : in1,in2,clk1,out Reserved Word: clk, rst n





## Set Design Constraints

#### **✓** Constraint Priorities: (default)

|   | Priority (descending order) | Notes                   |              |  |
|---|-----------------------------|-------------------------|--------------|--|
|   | connection classes          |                         |              |  |
|   | multiple_port_net_cost      |                         |              |  |
|   | min_capacitance             | Design rule constraint  |              |  |
|   | max_transition              | Design rule constraint  |              |  |
|   | max_fanout                  | Design rule constraint  | Design rule  |  |
|   | max_capacitance             | Design rule constraint  | constraint   |  |
| 1 | cell_degradation            | Design rule constraint  |              |  |
| 1 | max_delay                   | Optimization constraint |              |  |
|   | min_delay                   | Optimization constraint | Optimization |  |
| l | power                       | Optimization constraint | constraint   |  |
|   | area                        | Optimization constraint |              |  |
|   | cell count                  |                         |              |  |

## Set Design Constraints<sub>(cont.)</sub>

## ✓ There are two categories of design constraints Design rule constraints

- Design rule constraints reflect technology-specific restrictions your design must meet in order to function as intended.
- Most technology libraries specify default design rules.
- You can apply more restrictive design rules, but you cannot apply less restrictive ones.

#### **Design optimization constraints**

- User defines speed(timing) and area optimization goals for Design Compiler.
- Speed(timing) constraints have higher priority than area.(The priority can be changed)
- Optimization constraints are secondary to design rule constraints.



## Set Design Constraints<sub>(cont.)</sub>

#### **✓** Design Constraints

- Design rule constraints
  - set\_max\_capacitance
  - set\_max\_transition
  - set\_fanout\_load
  - set\_max\_fanout
- Design optimization constraints



## **Design Rule Constraints**

#### Maximum capacitance

- dc\_shell>set\_max\_capacitance cap\_value port\_list
- It is set as a pin-level attribute that defines the maximum total capacitive load that an output pin can drive.

#### Maximum transition

- dc\_shell>set\_max\_transition trans\_value port\_list
- The maximum transition time for a net is the longest time required for its driving pin to change logic values.





## Design Rule Constraints<sub>(cont.)</sub>

#### ✓ Fan-out load

 Fan-out load is a dimensionless number, not a capacitance. It represents a numerical contribution to the total effective fan-out.

#### Maximum fan-out :

- The maximum fan-out load for a net is the maximum number of loads the net can drive.
- If a library fan-out constraint exists and a max\_fanout attribute is specified, Design Compiler tries to meet the more restrictive value.



## Design Rule Constraints<sub>(cont.)</sub>



- To check whether the maximum fanout constraint is met for driving pin Z, Design Compiler compares the specified max\_fanout attribute against the fanout load.
- In this case, the design constraint is met.

Total Fanout Load 8 ≥ 1.0 + 1.0 + 3.0 + 2.0 7



## Design Rule Constraints<sub>(cont.)</sub>

#### ✓ In some cases, the nets should be set to ideal.

- Nets that are assigned ideal timing conditions—that is, latency, transition time, and capacitance are assigned a value of zero..
- Such nets are exempt from timing updates, delay optimization, and design rule fixing.
- set\_ideal\_network net\_list
- e.g. set\_ideal\_network {clk}

## Set Design Constraints<sub>(cont.)</sub>

#### **✓** Design Constraints

- Design rule constraints
- Design optimization constraints
  - create\_clock
  - set\_clock\_latency
  - set\_clock\_uncertainty
  - set\_propagated\_clock
  - set\_input\_delay
  - set\_output\_delay
  - set\_max\_delay
  - set\_fix\_delay
  - set\_false\_path



### **Design Optimization Constraints**

#### create\_clock

Defines the period and waveform for the clock
 Syntax :

```
create_clock -name "clk name for tcl" source_objects \
    -period period_value -waveform { rise fall }
```

- source\_objects : A list of pins or ports on which to apply this clock.
- -period period\_value : the period of the clock waveform in library time units(ns).
- -waveform option: set the rising edge time and the falling edge time. If you do not specify the clock waveform, default waveform is a 50 percent duty cycle.

```
Example:
```

```
set CYCLE 10.0
create_clock -name "clk" [get_ports clk1] -period $CYCLE
```



## Design Optimization Constraints<sub>(cont.)</sub>

- Design Compiler treats clock networks as ideal (having no delay) by default.
- ✓ You can override the default behavior to obtain nonzero clock network delay and specify information about the clock network delays, whether the skews are predicted or actual.
- ✓ dc\_shell>set\_clock\_latency \* (more details in Chapter 7)
  - Define the delay from CLK to the register
- ✓ dc\_shell>set\_clock\_uncertainty \* (more details in Chapter 7)
  - Used to model various factors that can reduce the effective clock period.
- dc\_shell>set\_propagated\_clock \* (more details in Chapter 7)
  - Specify the clock latency be propagated throughout the clock network



### **Static Timing Analysis**





### **Constraining Input Paths**

Specify the arrival time at the input ports of the design.



## Constraining Input Paths (cont.)

✓ e.g.



**DUA (Design Under Analysis)** 

- Input delay =  $(T_{clk2q} + T_{C1})$
- clock cycle  $\geq (T_{clk2q} + T_{C1}) + T_{C2} + T_{Setup}$
- dc\_shell>set T<sub>clk2q</sub> 0.9
- dc\_shell>set T<sub>C1</sub> 0.6
- dc\_shell>set\_input\_delay -clock CLKA -max [expr \$T<sub>clk2q</sub>+\$T<sub>C1</sub>] [all\_inputs]

```
set CYCLE 7.0
set_input_delay [ expr $CYCLE*0.5 ] -clock clk [all_inputs]
```



### **Constraining Output Paths**

You specify the path required time at the output ports of the design.





## Constraining Output Paths (cont.)

✓ e.g.



**DUA (Design Under Analysis)** 

- Output delay =  $(T_{C2} + T_{setup})$
- clock cycle  $\geq T_{clk2q} + T_{C1} + (T_{C2} + T_{setup})$
- dc\_shell>set T<sub>C2</sub> 3.9
- dc\_shell>set T<sub>setup</sub> 1.1
- dc\_shell>set\_output\_delay -clock CLKQ -max [expr \$T<sub>C2</sub>+\$T<sub>setup</sub>] [all\_outputs]

```
set CYCLE 7.0
set_output_delay [ expr $CYCLE*0.5 ] -clock clk [all_outputs]
```



### **Combinational Design Constraint**



#### Syntax:

```
set_max_delay max_delay_value -from object -to object
!!! This only applies for Combinational Circuit Only !!!
```

#### **Example:**

```
set_max_delay $MAX_Delay -from [all_inputs] -to [all_outputs]
```

### Design Optimization Constraints<sub>(cont.)</sub>

# Design Compiler fix hold violations at register during compilation

- Set\_fix\_hold informs compile that hold time violations of the specified clocks should be fixed.
- To fix a hold violation requires slowing down data signals.
- Design Compiler considers the minimum delay cost only if the set\_fix\_hold command is used.
- Generally, fixing and optimizing setup time violation are more important than hold time violation before CTS.

```
- Syntax : set_fix_hold clock_list
e.g. :
The following command sets a fix_hold attribute on clock "clk1".
dc_shell> set_fix_hold clk1
```



### **Specify False Path**

# Remove timing constraints from particular path

- Make compiler to ignore paths that never occur in normal operation
- Timing checks of false path will be disabled. Therefore, use this command carefully



### **Basic Synthesis Flow**





### Select Compile Strategy

- ✓ You can use various strategies to compile your hierarchical design.
- **✓ The basic strategies are** 
  - Top-down compile, in which the top-level design and all its subdesigns are compiled together.
  - Bottom-up compile, in which the individual subdesigns are compiled separately, starting from the bottom of the hierarchy and proceeding up through the levels of the hierarchy until the top-level design is compiled.
  - Mixed compile, in which the top-down or bottom-up strategy, whichever is most appropriate, is applied to the individual subdesigns.



### **Basic Synthesis Flow**





### Multiple Instances of a Design Reference

#### Multiple instances of a design reference

- In a hierarchical design, subdesigns are often referenced by more than one cell instance, that is, multiple references of the design can occur.
- Ex: design C is referenced twice (U2/U3 and U2/U4).



## Use any of the following methods resolve multiple instances before running the compile command

- The uniquify method
- The compile-once-dont-touch method
- The ungroup method



### **Uniquify Method**

#### Uniquify (default)

- The command is to duplicate and rename the multiple referenced design so that each instance references a unique design.
  - Requires more memory
  - Takes longer to compile
  - "uniquify" may useless with "compile\_ultra"
- dc\_shell> uniquifydc\_shell> compile



Top-level Design With Instances

Design Compiler Memory: Loaded Designs



## **Uniquify Method**<sub>(cont.)</sub>

#### Example:

```
module ch6 ex(input clk, INF.DESIGN inf);
adder 1b add1(.clk(clk),.rst n(inf.rst n),.x1(inf.x1),.x2(inf.x2),.Sum(inf.Sum1));
adder 1b add2(.clk(clk),.rst n(inf.rst n),.x1(inf.x1),.x2(inf.x2),.Sum(inf.Sum2));
adder_lb add3(.clk(clk),.rst_n(inf.rst_n),.x1(inf.x1),.x2(inf.x2),.Sum(inf.Sum3));
endmodule :ch6 ex
module adder 1b ( clk, rst n, x1, x2, Sum );
  output [1:0] Sum;
  input clk, rst n, x1, x2;
 wire N1, N2;
 DFFTRX1 Sum reg 1 ( .D(N2), .RN(rst n), .CK(clk), .Q(Sum[1]));
 DFFTRX1 Sum req \theta ( .D(N1), .RN(rst n), .CK(clk), .Q(Sum[\theta]));
 XOR2X2 U3 ( .A(x2), .B(x1), .Y(N1) );
  AND2X2 U4 ( .A(x1), .B(x2), .Y(N2) );
endmodule
//module adder 1b(
// input clk,
// input logic rst n,
// input logic x1,
// input logic x2,
// output logic [1:0] Sum
                                             script(.tcl)
//);
//
//always_ff@(posedge clk)
// if(!rst n)
                                         Optimization
      Sum \leq 0;
                                      uniquify
      Sum \leq x1+x2;
//end
                                      #ungroup {add1 add2}
//endmodule :adder 1b
                                      compile
```

```
#set dont touch {add1 add2}
```

```
module ch6_ex ( clk, inf_rst_n, inf_x1, inf_x2, inf_Sum1, inf_Sum2, inf_Sum3
 output [1:0] inf Suml;
 output [1:0] inf Sum2;
 output [1:0] inf Sum3;
 input clk, inf rst n, inf x1, inf x2;
 adder 1b 2 add1 ( .clk(clk), .rst n(inf rst n), .x1(inf x1), .x2(inf x2),
         sum(inf Suml) );
 @dder_lb_lDadd2( .clk(clk), .rst_n(inf_rst_n), .x1(inf_x1), .x2(inf_x2),
        Sum(inf_Sum2) );
 @dder 1b 0 add3 ( .clk(clk), .rst n(inf rst n), .x1(inf x1), .x2(inf x2),
       .Sum(inf Sum3) ):
endmodule
```

```
module adder 1b 1 ( clk, rst n, x1, x2, Sum );
 output [1:0] Sum;
 input clk, rst n, x1, x2;
 wire N2, N1;
 DFFTRX1 Sum_reg_1 ( .D(N2), .RN(rst_n), .CK(clk), .Q(Sum[1]) );
 DFFTRX1 Sum reg 0 ( .D(N1), .RN(rst n), .CK(clk), .Q(Sum[0]) );
 XOR2X1 U1 ( .A(x2), .B(x1), .Y(N1) );
 AND2X1 U2 ( .A(x1), .B(x2), .Y(N2) );
endmodule
```

```
module adder 1b 0 ( clk, rst n, x1, x2, Sum );
 output [1:0] Sum;
 input clk, rst_n, x1, x2;
 wire N2, N1;
 DFFTRX1 Sum req 1 (.D(N2), .RN(rst n), .CK(clk), .Q(Sum[1]));
 DFFTRX1 Sum_reg_0 ( .D(N1), .RN(rst_n), .CK(clk), .Q(Sum[0]) );
 XOR2X1 U1 ( .A(x2), .B(x1), .Y(N1) );
 AND2X1 U2 ( .A(x1), .B(x2), .Y(N2) );
endmodule
```

```
module adder_1b_2 ( clk, rst_n, x1, x2, Sum );
 output [1:0] Sum;
 input clk, rst n, x1, x2;
 wire N2, N1;
 DFFTRX1 Sum req 1 ( .D(N2), .RN(rst n), .CK(clk), .Q(Sum[1]) );
 DFFTRX1 Sum_reg_0 ( .D(N1), .RN(rst_n), .CK(clk), .Q(Sum[0]) );
 XOR2X1 U1 ( .A(x2), .B(x1), .Y(N1) );
 AND2X1 U2 ( .A(x1), .B(x2), .Y(N2) );
endmodule
```



### Compile-Once-Don't-Touch Method

#### set\_dont\_touch

- It places the dont\_touch attribute on cells, nets, references, and designs in the current design to prevent these objects from being modified or replaced during optimization.
  - Compiles the reference design once
  - Requires less memory and less time than the uniquify method
- dc\_shell> set\_dont\_touch {U2/U3 U2/U4}





## Compile-Once-Don't-Touch Method<sub>(cont.)</sub>

Example: {add1 add2} are compiled by using the environment of one of its instances. In this case, no copies of the original sub-design are loaded into memory when running this command sequence.

```
module ch6 ex(input clk, INF.DESIGN inf);
adder_lb addl(.clk(clk),.rst_n(inf.rst_n),.xl(inf.xl),.x2(inf.x2),.Sum(inf.Suml));
adder lb add2(.clk(clk),.rst n(inf.rst n),.x1(inf.x1),.x2(inf.x2),.Sum(inf.Sum2));
adder 1b add3(.clk(clk),.rst n(inf.rst n),.x1(inf.x1),.x2(inf.x2),.Sum(inf.Sum3));
endmodule :ch6 ex
module adder 1b ( clk, rst n, x1, x2, Sum );
 output [1:0] Sum;
 input clk, rst_n, x1, x2;
 wire N1, N2;
 DFFTRX1 Sum req 1 ( .D(N2), .RN(rst n), .CK(clk), .Q(Sum[1]) );
 DFFTRX1 Sum reg \theta ( .D(N1), .RN(rst n), .CK(clk), .Q(Sum[\theta]) );
 XOR2X2 U3 (.A(x2), .B(x1), .Y(N1));
 AND2X2 U4 ( .A(x1), .B(x2), .Y(N2) );
endmodule
//module adder 1b(
// input clk,
// input logic rst n,
// input logic x1,
// input logic x2.
// output logic [1:0] Sum
                                             script(.tcl)
//always ff@(posedge clk)
//begin
                                          Optimization
// if(!rst n)
     Sum \leq 0;
     Sum \leq x1+x2:
                                      set dont touch {addl add2
//end
                                      #ungroup {addl add2}
                                      compile
//endmodule :adder 1b
```

```
module ch6 ex (clk, inf rst n, inf x1, inf x2, inf Sum1, inf Sum2, inf Sum3
  output [1:0] inf Sum1;
  output [1:0] inf Sum2;
  output [1:0] inf Sum3;
  input clk, inf rst n, inf x1, inf x2;
  adder 1b add1 \ .clk(clk), .rst n(inf rst n), .x1(inf x1), .x2(inf x2),
        .Sum(inf Sum1) );
  adder 1b add2 \sqrt{.clk(clk)}, .rst n(inf rst n), .x1(inf x1), .x2(inf x2),
        Sum(inf Sum2) );
 adder 1b 0 add3 ( .clk(clk), .rst n(inf rst n), .x1(inf x1), .x2(inf x2),
       .sum(inf Sum3) );
endmodule
```

```
module adder 1b 0 ( clk, rst n, x1, x2, Sum );
 output [1:0] Sum;
 input clk, rst_n, x1, x2;
 wire N2, N1:
 DFFTRX1 Sum reg 1 ( .D(N2), .RN(rst n), .CK(clk), .Q(Sum[1]) );
 DFFTRX1 Sum reg 0 ( .D(N1), .RN(rst n), .CK(clk), .Q(Sum[0]) );
 XOR2X1 U1 ( .A(x2), .B(x1), .Y(N1) );
 AND2X1 U2 ( .A(x1), .B(x2), .Y(N2) );
endmodule.
```

```
module adder 1b ( clk, rst_n, x1, x2, Sum );
  output [1:0] Sum;
  input clk, rst n, x1, x2;
 wire N2, N1;
 DFFTRX1 Sum reg 1 ( .D(N2), .RN(rst n), .CK(clk), .Q(Sum[1]));
 DFFTRX1 Sum reg_0 ( .D(N1), .RN(rst_n), .CK(clk), .Q(Sum[0]) );
 XOR2X2 U3 (.A(x2), .B(x1), .Y(N1));
  AND 2X2 U4 ( .A(x1), .B(x2), .Y(N2) );
endmodule
          Netlist(.v
```





### **Ungroup Method**

#### **✓** Ungroup

- use the ungroup command to ungroup one or more designs before optimization
  - Requires more memory and takes longer to compile than the compile-once-don't-touch method
  - Provides the best synthesis results
  - May increase the difficulty for ECO(Engineering change order).
- dc\_shell> current\_design B dc\_shell> ungroup {U3 U4} dc\_shell> current\_design top dc\_shell> compile





## Ungroup Method<sub>(cont.)</sub>

Example: the following command sequence uses the ungroup method to resolve the multiple instances of {add1 add2} except for add3.

```
module ch6 ex ( clk, inf rst n, inf x1, inf x2, inf Sum1, inf Sum2, inf Sum3
module ch6 ex(input clk, INF.DESIGN inf);
                                                                                  output [1:0] inf Sum1;
adder_lb addl(.clk(clk),.rst_n(inf.rst_n),.xl(inf.xl),.x2(inf.x2),.Sum(inf.Suml));
                                                                                  output [1:0] inf Sum2;
adder lb add2(.clk(clk),.rst n(inf.rst n),.xl(inf.xl),.x2(inf.x2),.Sum(inf.Sum2));
                                                                                  output [1:0] inf Sum3;
adder_lb add3(.clk(clk),.rst_n(inf.rst_n),.x1(inf.x1),.x2(inf.x2),.Sum(inf.Sum3));
                                                                                  input clk, inf rst n, inf x1, inf x2;
                                                                                          add2 N1, add2 N2;
endmodule :ch6 ex
module adder_1b ( clk, rst_n, x1, x2, Sum );
  output [1:0] Sum;
                                                                                          Sum(inf Sum3) )
  input clk, rst n, x1, x2;
  wire N1, N2;
                                                                                         inf Sum1[0]);
 DFFTRX1 Sum_reg_1 ( .D(N2), .RN(rst_n), .CK(clk), .Q(Sum[1]) );
                                                                                         inf Sum1[1]) );
 DFFTRX1 Sum_reg_0 ( .D(N1), .RN(rst_n), .CK(clk), .Q(Sum[0]) );
 XOR2X2 U3 (.A(x2), .B(x1), .Y(N1));
 AND2X2 U4 ( .A(x1), .B(x2), .Y(N2) );
                                                                                         inf Sum2[0]) );
endmodule
                                                                                         inf Sum2[1]) );
//module adder 1b(
// input clk,
// input logic rst n,
                                                script(.tcl)
// input logic x1,
                                                                                endmodu Le
// input logic x2,
// output logic [1:0] Sum
//);
                                                                                module adder_1b ( clk, rst_n, x1, x2, Sum );
                                               Optimization
//always ff@(posedge clk)
                                                                                  output [1:0] Sum;
//begin
                                                                                  input clk, rst n, x1, x2;
// if(!rst n)
                                            #uniquity
                                                                                  wire N2, N1;
     Sum \leq 0;
                                                               {2dd1 add2}
                                           ungroup {add1 add2}
     Sum \leq x1+x2;
//end
                                            compile
                                                                                  XOR2X1 U1 (.A(x2), .B(x1), .Y(N1));
//endmodule :adder 1b
                                                                                  AND2X1 U2 ( .A(x1), .B(x2), .Y(N2) );
```

```
adder 1b add3 ( .clk(clk), .rst n(inf rst n), .x1(inf x1), .x2(inf x2),
 DFFTRX1 add1 Sum reg 0 ( .D(add2 N1), .RN(inf rst n), .CK(clk), .Q(
 DFFTRX1 add1 Sum reg 1 ( .D(add2 N2), .RN(inf rst n), .CK(clk), .Q(
 DFFTRX1 add2 Sum req 0 ( .D(add2 N1), .RN(inf_rst_n), .CK(clk), .Q(
 DFFTRX1 add2 Sum reg_1 ( .D(add2_N2), .RN(inf_rst_n), .CK(clk), .Q(
 XOR2X1 U3 ( .A(inf x1), .B(inf x2), .Y(add2 N1) );
 AND2X1 U4 ( .A(\inf x2), .B(\inf x1), .Y(add2_N2));
 DFFTRX1 Sum_reg_1 ( .D(N2), .RN(rst_n), .CK(clk), .Q(Sum[1]) );
 DFFTRX1 Sum reg 0 ( .D(N1), .RN(rst n), .CK(clk), .Q(Sum[0]) );
endmodule
```



RTL(.v) ------Netlist(.v)

### Compile

- Default synthesis algorithm
  - dc\_shell> compile
- Advanced synthesis algorithm
  - dc\_shell> compile\_ultra
    - Automatically ungroups logical hierarchies
    - High-performance design
    - Maximum performance
    - Minimum area
    - Data path
- ✓ More information
  - Refer to "Design Compiler Optimization Reference Manual"

### **Basic Synthesis Flow**





### Report Analysis

#### Constraint report

- Syntax : report\_constraint [-all\_violators] [-verbose]
  - -all\_violators: Displays a summary of all of the optimization and design-rule constraints with violations in the current design.
  - -verbose : Indicates to show more detail about constraint

calculations

#### ✓ Area report

– dc\_shell-t> report\_area

| Constraint      | Cost       |
|-----------------|------------|
| max_transition  | 0.00 (MET) |
| max_capacitance | 0.00 (MET) |
| max_delay/setup | 0.00 (MET) |
| critical_range  | 0.00 (MET) |

Report : area Design : DAG Version: 2003.06

Date: Sun Oct 10 15:59:26 2004

\*\*\*\*\*\*\*\*\*\*\*\*

Library(s) Used:

slow (File: /RAID/Manager/lib.18/SynopsysDC/slow.db)

Number of ports: 40 Number of nets: 220 Number of cells: 160 Number of references: 21

Combinational area: 3775.465576 Noncombinational area: 2471.515869 Net Interconnect area: 2.482625

Total cell area: 6246.979492



Total area: 6249.463867

### Report Analysis (cont.)

#### report\_timing

\*\*\*\*\*\*\*\*\*\*\*\*

Report: timing
-path full
-delay max
-max\_paths 1
Design: DAG
Version: 2003.06

Date : Sun Oct 10 16:06:19 2004

\*\*\*\*\*\*\*\*\*\*\*

Operating Conditions: slow Library: slow Wire Load Model Mode: segmented

Startpoint: COUNT\_LOAD\_reg[1]

(rising edge-triggered flip-flop clocked by CLK)

Endpoint: ADDR\_reg[0]

(rising edge-triggered flip-flop clocked by CLK)

Path Group: CLK

Path Type: max Maximum delay analysis

| Point                        | Incr  | Path    |        |
|------------------------------|-------|---------|--------|
| clock CLK (rise edge)        | 0.00  | 0.00    |        |
| clock network delay (ideal)  | 0.00  | 0.00    |        |
| COUNT LOAD reg[1]/CK (D      | FFX1) | 0.00    | 0.00 r |
| COUNT LOAD reg[1]/QN (DFFX1) |       | 0.56    | 0.56 r |
| U134/Y (NAND3X1)             | 0.23  | 0.79 f  |        |
| U160/Y (INVX2)               | 0.90  | 1.69 r  |        |
| U137/Y (MX2X2)               | 0.52  | 2.21 f  |        |
| •••••                        |       |         |        |
| •••••                        |       |         |        |
| U226/Y (OR4X4)               | 0.57  | 8.24 f  |        |
| U206/Y (NOR2X4)              | 0.97  | 9.21 r  |        |
| U149/Y (AOI22X1)             | 0.27  | 9.49 f  |        |
| U148/Y (INVX2)               | 0.16  | 9.64 r  |        |
| ADDR_reg[0]/D (EDFFX1)       | 0.00  | 9.64 r  |        |
| data arrival time            | 9.64  |         |        |
| clock CLK (rise edge)        | 10.00 | 10.00   |        |
| clock network delay (ideal)  | 0.00  |         |        |
| ADDR reg[0]/CK (EDFFX1)      | 0.00  | 10.00 r |        |
| library setup time           | -0.29 | 9.71    |        |
| data required time           | 9.71  |         |        |
|                              |       |         |        |
| data required time           | 9.71  |         |        |
| data arrival time            | -9.64 |         |        |
| slack (MET)                  | 0.07  |         |        |
| DIMOR (IVIL)                 | 0.07  |         |        |



### Fix Glitch Suppression

#### ✓ When you meet glitch suppression at 03\_GATE\_SIM

- Always occur in "dD" umc18\_neg.v, line = 10043
- Add "-nontcglitch" in 03\_GATE ./01\_run
- ✓ To specify cells in the target library to be excluded during optimization
  - dc\_shell> set\_dont\_use [get\_lib\_cells "slow/JKFF\*"]

```
set dont use slow/JKFF*
```

### **Basic Synthesis Flow**





### Save Design

#### **✓** Save design

You use the write command to save the synthesized designs.
 Remember that Design Compiler does not automatically save designs before exiting.

#### ✓ Save a gate level Verilog file

- Syntax : write –f verilog –o *file\_name.v* –hierarchy
  - -f : format
  - -o : output file name
  - -hierarchy: Indicates to write all designs in the hierarchy

write -format verilog -output Netlist/\$DESIGN\\_SYN.v -hierarchy

### Save Design

#### Change naming rule script

- Make all net and port names conform to the naming conventions for layout tool
- Execute the script after compile your design

```
set bus_inference_style "%s\[%d\]"
set bus_naming_style "%s\[%d\]"
set hdlout_internal_busses true
change_names -hierarchy -rule verilog
define_name_rules name_rule -allowed "a-z A-Z 0-9 _" -max_length 255 -type cell
define_name_rules name_rule -allowed "a-z A-Z 0-9 _[]" -max_length 255 -type net
define_name_rules name_rule -map {{"\\*cell\\*" "cell\\\*" "cell"}}
define_name_rules name_rule _case_insensitive # if you want to run spice after APR
change_names -hierarchy -rules name_rule
```

### **Gate-level Simulation**

#### ✓ Post synthesis timing simulation

- The post synthesis design must be simulated with estimated delays from synthesis. A SDF( standard delay format ) can be generated from design compiler for this purpose. Use the following command to generate the SDF file.
- Syntax : write\_sdf -version sdf\_version file\_name
  - -version sdf\_version : Selects which SDF version to use.
     Supported SDF versions are 1.0, 2.1 or 3.0. SDF 2.1 is the default.
  - file\_name : Specifies the name of the SDF file to write.
- dc\_shell> write\_sdf -version 3.0 CHIP.sdf

#### ✓ Modify your test file (TESTBED.v)

- \$sdf\_annotate("the\_SDF\_file\_name", the\_instance\_name);
- ex : \$sdf\_annotate("CHIP.sdf", I\_DAG);



### Data Flow in Design Compiler



### **Outline**

- **✓** Section 1 Design Compiler Introduction
- **✓ Section 2 Basic Synthesis Flow** 
  - Develop HDL files
  - Specify libraries
  - Read design
  - Develop design environment
  - Set design constraints
  - Select compile strategy
  - Optimize the design
  - Analyze and resolve design problems
  - Save the design database
- **✓** Section 3 Generate & For Loop





### **Review: For Loop**

#### ✓ For loop in Verilog

- Duplicate same function
- Very useful for doing reset and iterated operation

```
reg [3:0] temp;
integer i;
always @(posedge clk) begin
  for (i = 0; i < 3; i = i + 1) begin: for_name
    temp[i] <= 1'b0;
  end
end</pre>
```

```
always @(posedge clk) begin

temp[0] <= 1'b0;

temp[1] <= 1'b0;

temp[2] <= 1'b0;

end
```

### Generate – generate vs regular for loop

#### ✓ How to use for loop with generate?

- For loop in generate : three always blocks
- Regular for loop : one always block

```
reg [3:0] temp;
genvar i;
generate
for (i = 0; i < 3; i = i + 1) begin: for_name
    always @(posedge clk) begin
        temp[i] <= 1'b0;
    end
end
end
endgenerate</pre>
```

**Generate block** 

```
reg [3:0] temp;
integer i;
always @(posedge clk) begin
  for (i = 0; i < 3; i = i + 1) begin:
    temp[i] <= 1'b0;
  end
end</pre>
```

**Regular for loop** 



```
wire [3:0] o_data;
genvar i;
generate
for (i=0 ; i < 4; i = i+1)begin : loop_1
    wire tmp_result;
    assign tmp_result = operand1[i] | operand2[i];
    if(i == 0)begin
        assign o_data[0] = tmp_result;
    end
    else begin
        assign o_data[i] = tmp_result & loop_1[i-1].tmp_result;
    end
end
end
end
endgenerate</pre>
```

<u>always block in for loop with</u> <u>genvar</u>

If-else in generate for can only use for select circuit but not logic determination



Ex: loop\_1[0].tmp\_result



#### A little complicated but scalable design example





- Generate blocks are useful when change the physical structure of module via parameters.
  - We can modify the parameter for different application

```
module top
adder add_8bit #(8) (.a(a0), .b(b0), .sum(sum_8bit))
adder add_16bit #(16) (.a(a1), .b(b1), .sum(sum_16bit))
endmodule
module adder
\#(parameter LENGTH = 16)
         [LENGTH-1:0]
input
                             a,
        [LENGTH-1:0]
input
                              b,
         [LENGTH:0]
output
                              sum
generate
          sum = a + b;
endgenerate
endmodule
```



#### Reference

- ✓ <a href="http://www.ee.bgu.ac.il/~digivlsi/slides/STA\_9\_1.pdf">http://www.ee.bgu.ac.il/~digivlsi/slides/STA\_9\_1.pdf</a> (p62, p63 圖)
- Static Timing Analysis for Nanometer Designs A Practical Approach, Bhasker, J./ Chadha, Rakesh
- ✓ Design compiler user guide

```
Author: 2004 Wan-Chun Liao (celesta@si2lab.org)
2006revised Ming-Wei Lai (mwlai@si2lab.org)
2008revised Chien-Ying Yu (cyyu@si2lab.org)
2008revised Jia-Lung Lin (jllin@si2lab.org)
2009revised Yung-Chih Chen (ycchen@oasis.ee.nctu.edu.tw)
2010revised Jen-Wei Lee (circus@si2lab.org)
2011revised Chia-Ching Chu (ccchu@oasis.ee.nctu.edu.tw)
2013revised Rong-Jie Liu (johnny510.ee98@g2.nctu.edu.tw)
2014revised Ming-Feng Shiu (mango@si2lab.org)
2015revised De-An Chen (and0350283@gmail.com)
2016revised Chien-Tung Shih (ct.shihg@gmail.com) \ Heng-Wei Hsu (hengwzx@gmail.com)
2016revised Yun-Sheng Chan (yschan.ee03g@g2.nctu.edu.tw)
```

