## FloorSet - a VLSI <u>Floor</u>planning Data<u>set</u> with Design Constraints of Real-World SoCs

Uday Mallappa Intel Labs USA Hesham Mostafa Intel Labs USA Mikhail Galkin Intel Labs USA

Mariano Phielipp Intel Labs USA

## Somdeb Majumdar Intel Labs USA

## **ABSTRACT**

Floorplanning for systems-on-a-chip (SoCs) and its sub-systems is a crucial and non-trivial step of the physical design flow. It represents a difficult combinatorial optimization problem. A typical large scale SoC with 120 partitions generates a search-space of  $\sim 10^{250}$ . As novel machine learning (ML) approaches emerge to tackle such problems, there is a growing need for a modern benchmark that comprises a large training dataset and performance metrics that better reflect real-world constraints and objectives compared to existing benchmarks. To address this need, we present FloorSet - two comprehensive datasets of synthetic fixed-outline floorplan layouts that reflect the distribution of real SoCs. Each dataset has 1M training samples and 100 test samples where each sample is a synthetic floorplan. FloorSet-Prime comprises fully-abutted rectilinear partitions and near-optimal wire-length. A simplified dataset that reflects early design phases, FloorSet-Lite comprises rectangular partitions, with < 5% white-space and near-optimal wire-length. Both datasets define hard constraints seen in modern design flows such as shape constraints, edge-affinity, grouping constraints, and pre-placement constraints. FloorSet is intended to spur fundamental research on large-scale constrained optimization problems. Crucially, FloorSet alleviates the core issue of reproducibility in modern ML driven solutions to such problems. FloorSet is available as an open-source repository for the research community<sup>1</sup>.

#### **ACM Reference Format:**

Uday Mallappa, Hesham Mostafa, Mikhail Galkin, Mariano Phielipp, and Somdeb Majumdar. 2024. FloorSet - a VLSI <u>Floor</u>planning Data<u>set</u> with Design Constraints of Real-World SoCs. In., 10 pages.

## 1 INTRODUCTION

Circuit partitioning is the first step of the back-end physical design flow. It divides a flat, and large circuit netlist into more manageable partitions. This step defines area budgets specific to each partition, inter-partition connectivity constraints, connections to external terminals, and the respective positions of these external terminals. These outcomes define the requirements and constraints of the floorplanning step. Furthermore, the floorplanning task is governed by numerous placement constraints on a subset or all of the partitions. The goal of the floorplanning step is to determine optimal physical

positions and shapes of the individual partitions comprising the SoC or a subsystem. The output from the floorplanning step serves as the starting point for the remainder of the physical design flow. Figure 1 shows the typical back-end design steps - our work directly addresses the first two steps of partition and sub-system placement.



Figure 1: Our work focuses on establishing realistic benchmarks FloorSet, for the first two steps (shaded) of the design planning phase of the back-end flow.

**Floorplan Constraints:** The key challenge of the floorplanning formulation lies in the requirement to satisfy hard design constraints. Some important constraints are:

- Outline-constraint: Modern ASIC design relies on hierarchical (top-down) floorplanning, with a "fixed-outline" constraint for the SoC and its sub-systems. A floorplan optimized for area without accounting for this constraint may fail to fit within the specified outline.
- Shape-constraints: These specify the acceptable range of width-to-height ratios of each partition's bounding box.
- Boundary-constraints: These specify that a partition must align with a specific edge or corner of the floorplanning outline. This alignment is driven by external interfacing requirements or system-level thermal considerations.
- Grouping-constraints: These specify a set of partitions that must be physically abutted - e.g., those operating on the same voltage or requiring simultaneous power-off.
- Multi-instantiation constraints: These define multiple partitions as instances of a shared main partition - e.g., eight

<sup>&</sup>lt;sup>1</sup>https://github.com/IntelLabs/FloorSet

<sup>© 2024</sup> Copyright held by the owner/author(s). Publication rights licensed to ACM.

- instances of a CPU core partition. Thus, all such instances must share the same shape.
- Pre-placement constraints: These specify pre-defined locations of partitions that are often derived from past designs.

Supervised Learning and Data Paucity: Lately, there has been remarkable advancement in deep learning, that construct large parametric approximators by combining simpler functions. Deep learning demonstrates exceptional performance - particularly in scenarios involving high-dimensional spaces and abundant data points. Therefore, learning-based approaches can serve as promising alternatives for complex combinatorial optimization problems. However, the lack of sufficient training layouts is a major impediment in utilizing supervised learning methods for floorplanning. Due to this, prior learning-based techniques are often restricted to reinforcement learning (RL) [4, 21, 34, 45] to yield optimal floorplan layouts. However, such approaches struggle to simultaneously optimize the objectives and respect the hard design constraints for large-scale combinatorial optimization problems. This data paucity stems primarily from intellectual-property (IP) restrictions of the chip-design industry. Our work addresses this gap by generating a large corpus of synthetic layouts that are reflect the optimal placement statistics and design constraints of real, commercial SoCs. This dataset, FloorSet, can serve as a comprehensive training and test set for ML based floorplanning algorithms.

**Overview:** The rest of the paper is organized as follows. Section 2 surveys the state of benchmarks and synthetic data in EDA. Section 3 describes our data generation pipeline, and the two dataset variants FloorSet-Prime and FloorSet-Lite. Section 4 summarizes the attributes of FloorSet and presents the complexity of the generated data. Finally, conclusions are discussed in Section 5.

#### 2 RELATED WORK

In recent years, an increasing number of works have applied ML to complex EDA problems [24, 43, 42]. These works have targeted geometric problems [46], graph processing and optimization techniques[17, 10], automated decision making [28, 3, 36], vision-based approaches[12, 30] and natural language models [33, 5]. Such ML-based solutions in the EDA flow mitigate the need for exhaustive design optimization iterations by judiciously pruning the solution space associated with sub-optimal design quality. The resulting advancements present significant strides in various design stages such as high-level logic design [27, 44], circuit design [39, 19], physical design [45, 35, 31, 29], verification [32, 14], and manufacturing aspects [8] of conventional chip-design methodologies, offering notable opportunities for improvement.

**EDA Benchmarks:** There are several examples of open-source benchmarks for physical design tasks. E.g., IWLS 2005 benchmarks [23] provide a repository of 85 synthesized RTL netlists that were collected from various public resources [40]. For discrete gate-sizing contest, ISPD 2012 [41] benchmarks annotate interconnect parasitics and timing constraints for 14 IWLS netlists. ISPD 2015 benchmarks [7] provide a 65nm dataset that comprises eight designs with routing constraints and design rules, for the purpose of blockage-aware detailed routing-driven placement task. For the multi-deck standard cell placement legalization problem, ICCAD 2017 [13] provides

eight benchmarks in LEF and DEF format, along with soft placement constraints. They are derived from ISPD 2015 dataset. For the detailed routing contest of ISPD, ISPD 2018 benchmarks [37] provide 10 test designs in 32nm and 45nm nodes, extracted from two real designs. It also defines design rules such as spacing tables, end-of-line spacing rules, cut spacing rules, min-area rules, and routing preference rules. For the macro-placement task, the ISPD02 IBM-MS mixed-size placement dataset [22] contains 18 designs with both hard macros and standard cells. The TILOS-AI-Institute added four more designs to reproduce the results of the RL-based MacroPlacement algorithm [16]. Though these efforts help in reproducibility, the scale of these benchmarks make it unusable to train modern ML models for physical design.

**Floorplanning Benchmarks:** For the floorplanning task, prior datasets GSRC [15] (*n10*, *n30*, *n50*, *n100*, *n200*, and *n300*), and MCNC [38] (*ami33*, *ami49*, *apte*, *xp*, and *xerox*) offer a standardized way to validate floorplan optimization algorithms. However, these benchmarks also suffer from being small scale, making them unusable for ML applications. In addition, the constraints used in these benchmarks do not capture many modern SoC floorplan constraints such as pre-placed constraints, boundary constraints, multi-instantiated partitions, and pin and net topologies.

Synthetic Data in EDA: IP issues often make it difficult for commercial design companies to distribute their historical EDA data. Synthetic data offers a practical solution to this data paucity problem. However, for it to be usable, it needs to reflect the structural characteristics of real designs. Gupta et al. [18] proposed benchmark circuits (called eyecharts) of arbitrary size, to diagnose the weaknesses of existing gate-sizing algorithms, and to enable a systematic and quantitative comparison of sizing algorithms. In later works, Han et al. [20] use artificial combinational paths to extract the sign-off timing values for ground truths. The artificial circuits in their work are created by sweeping the number of stages, fan-outs and segments, and cell types in timing paths. Kahng et al. [25] also use artificial circuits to train a regression model, to predict timing values at unobserved corners. Recently, PROBE2.0 [11] proposed an artificial circuit with a mesh-like netlist topology as a place-and-route (P&R) benchmark, for routability assessment. However, these artificial circuit timing paths do not reflect the distribution present in real circuits and, thus, do not capture the full complexity of real circuits. Kim et al. [26] address the aforementioned issues and introduce an artificial netlist generator (ANG) framework for constructing authentic P&R benchmarks suitable for ML. This framework generates gate-level netlists based on user-defined input parameters representing the topological attributes of realistic circuit. Our work shares a close affinity with the ANG framework, although we address the floorplanning task. To the best of our knowledge, we are the first to propose a large-scale dataset to enable ML techniques for floorplanning..

#### 3 METHODOLOGY

#### 3.1 Problem Formulation

The FloorSet benchmarks specifically target the first step of the floorplanning task within SoC and sub-system hierarchies, aiming to identify optimal shapes and positions for sub-systems and partitions while adhering to certain placement constraints. The inputs

to perform floorplanning task at SoC and sub-system hierarchies constitute:

- area budgets of partitions or sub-systems.
- locations of external terminals of the system.
- netlist connectivity that define the connections between various components of the system.
- placement constraints that define pre-determined positions for some components of the system.

The bookshelf format [6] is a widely adopted, open-source, and standardized representation of VLSI design data. The components of a floorplanning problem are captured in the  $\star$ .blocks,  $\star$ .nets and  $\star$ .pl file formats of the bookshelf format. As shown in Figure 2, we modify the  $\star$ .blocks to include componentwise placement constraints, area budgets, and fixed-outline dimensions. We include net weights in the  $\star$ .nets file, shown in Figure 3. The  $\star$ .pl file is unchanged and contains the (x,y) locations of terminals.

#### 3.2 Distribution of Real Layouts

In this section, we outline the topological statistics derived from real industrial floorplans extracted from heterogeneous SoCs and their sub-systems, with configurations upto 100 partitions. These real floorplan layouts capture modern human-designed heterogeneous SoC and sub-system implementations in the industry. Due to the necessity of preserving the IP rights of these SoCs and their implementations, we are unable to disclose the exact layouts and their associated statistics. However, we provide the details of statistics that represent these realistic floorplan layouts. We extract 10 parameters, listed in Table 1. These 10 distributions allow us to systematically explore the design space and produce synthetic layouts that closely mimic real-world layouts.

The rationale for extracting these statistics is listed below:

- Aparts: The aspect ratio range defines the acceptable width-to-height ratio of a component (partition or sub-system based on the hierarchy). For rectilinear partitions, we derive the aspect ratio from the bounding box of the rectilinear polygon. This parameter guarantees that the generated synthetic layouts exhibit realistic shapes.
- R<sup>parts</sup><sub>terms</sub>: The ratio of terminal count to partition count captures
  the proportion of terminals relative to partitions within a
  hierarchy. This offers a method to represent the proportional
  distribution of partitions and terminals in the floorplan netlists,
  without biasing towards the absolute scale of terminal count.
- {*D<sub>parts</sub>*, *D<sub>terms</sub>*, *W<sub>parts</sub>*}: To quantify the complexity of connectivity structure in layouts at SoC and sub-system hierarchies, we use density (complement of sparsity) parameters of the connectivity matrices (or adjacency matrices).

 $D_{parts}$  and  $D_{terms}$ : These parameters capture the interconnection complexity of the floorplan netlists and provide a scale-independent mechanism to capture inter-partition and partition-terminal net connectivity. It is common to have multiple net connections (up to few thousands) between a partition pair, and weighted nets capture the strength of such connections. To account for weights on nets, we also extract the net-weight distribution,  $W_{parts}$ , as a function of net-length.

The net-lengths are normalized to the fixed-outline's circumference, and net-weights are scaled to a range of 0-1.

- E<sub>parts</sub>: This captures the relative distribution of components (or partitions) with edge-affinity (or boundary affinity). These components typically represent the input-output (IO) constrained partitions or sub-systems.
- {N<sub>clusters</sub>, C<sub>parts</sub>}: These capture the number of voltage islands and the number of components in these island regions.
- P<sub>parts</sub>: The percentage of pre-placed partitions capture the relative distribution of partitions with pre-placement constraints.
   These are hard constraints on the positions of partitions that are either derived from prior design knowledge or known-optimal decisions.
- M<sub>parts</sub>: The percentage of multi-instantiation constraints represents the proportion of partitions enforcing shape-sharing constraints in modular design flows. E.g., in a hierarchy, if four instances of a CPU core exist, they all share the same shape and area.

Table 1: Design parameters whose values are sampled from statistical distributions derived from real SoCs.

| Parameter             | Description                                                               |
|-----------------------|---------------------------------------------------------------------------|
| Aparts                | Aspect ratio (= $\frac{W}{H}$ ) of a partition's bounding box             |
| N <sub>terms</sub>    | Number of terminals in a hierarchy relative to the partition count        |
| Dparts                | Non-zero element percentage in the inter-partition connectivity matrix    |
| Wparts                | Inter-partition weight distribution as a function of distance             |
| $D_{terms}$           | Non-zero element percentage in the terminal partition connectivity matrix |
| Eparts                | Percentage of partitions with edge and corner constraints                 |
| Cparts                | Percentage of partitions with grouping (or clustering) constraints        |
| N <sub>clusters</sub> | Number of clusters in a hierarchy                                         |
| Pparts                | Percentage of partitions with preplaced positions                         |
| Mparts                | Percentage of partitions with multi-instantiation constraint              |

## 3.3 FloorSet-Prime: Rectilinear Partitions

In this section, we present the mechanism to generate synthetic layouts, using the floorplan statistics extracted from real layouts. Figure 4 provides an overview of the FloorSetgeneration framework at a high-level. The circuit statistics extracted are utilized as an input for our data generation framework. In addition, the second input to the FloorSetdata generation pipeline is a custom input configuration file (shown in Figure 4), that contains the following settings:

- num\_layouts: to specify the number of floorplan layouts to be generated.
- foutline\_shape: to sweep the desired shape of the fixedoutline.
- num\_partitions: to sweep the target partition count.
- rectilinear\_flag: to enable or disable rectilinear partitions in the layout.
- placement\_constraints\_flag: to enable or disable annotation of hard placement constraints.
- dataset\_mode flag: to select between the available dataset pipelines (FloorSet-Prime and FloorSet-Lite).

The output of this data generation framework comprises layouts in Bookshelf and Pytorch tensor formats, representing diverse instances of floorplan problems. Expanding on the previous Figure 4, Figure 5 explains the FloorSet data generation framework using the



Figure 2: The bookshelf \*.blocks file is modified to include fixed-outline dimensions, area budgets, shape constraints (aspect ratio range) and placement constraints.



Figure 3: The bookshelf \*.nets file is modified to add net weights.



Figure 4: The distribution of parameters (Table 1) and the custom configuration file serve as inputs for the data generation pipeline. The output layouts are formatted in the standard bookshelf format and Pytorch tensor format.

Bookshelf Format Tensor Format

following five important steps that are described below: Parameter Extraction, Partition Sampling, Terminal Sampling, Connectivity Sampling and Constraints Sampling.

The pseudocode of FloorSet data generation pipeline is presented in Algorithm 1. To recap, the inputs to the framework are target distributions extracted from real floorplan circuits, and input

configuration file to sample the design parameters. The output of the Algorithm 1 is a repository of synthetic floorplan layouts in the Bookshelf format and a tensor-based representation that is compatible with data loader libraries commonly used in ML applications.

- 1. Parameter Extraction: From a repository of several industrial floorplan layouts sourced from diverse SoC and subsystem implementations, we extract statistical metrics (listed in Table 1) representative of the circuit characteristics inherent to modern SoC and subsystem layouts. These metrics establish the target distribution from which we sample to produce synthetic floorplan layouts. To capture connectivity characteristics, we extract the density (complement of sparsity) distribution of inter-partition ( $D_{parts}$ ) and partition-terminal  $(D_{terms})$  adjacency matrices. In addition, we extract the relative proportion of terminal-count ( $N_{terms}^{parts}$ ) to represent the scale of terminal count as a function of partition count in a hierarchy, and the the distribution of net-weights as a function of net-lengths ( $W_{parts}$ ). Moreover, statistics regarding partition-specific aspect ratios ( $A_{parts}$ ) and placement constraints ( $E_{parts}$ ,  $C_{parts}$ ,  $N_{clusters}$ ,  $P_{parts}$ ,  $M_{parts}$ ) are also extracted from real layouts. The extracted statistics serve as the input to Algorithm 1. Lines 1-2 initialize the "Layouts" database and index variables. Line 3 is a loop for generating  $n_{layouts}$ . We use tensor F representation internally, to store the layout information (line 4).
- **2. Partition Sampling:** Following step 1, the next step (line 5) involves sweeping (or extracting) the desired fixed-outline dimensions from the input configuration file. Using this sampled outline, the synthetic layout undergoes decomposition into individual partitions or subsystems. To introduce variability, a mesh-layout (line 7 of Algorithm 1) is created by employing a sequence of randomly sampled vertical and horizontal lines (line 6 of Algorithm 1). These randomly positioned vertical and horizontal lines segment the fixed-outline layout into rectangles, forming the initial partition grid. Subsequently, adjacent polygons are merged, to create partition shapes that match the desired target distribution of partition shapes. To ensure sufficient room for the creation of desired shapes and area budgets, we start with a large partition count  $(4 \times -6 \times$  of the desired partition



Figure 5: Overview of the five-step FloorSet data generation framework, illustrating the sequential processes involved in the methodology: 1. Collection and extraction of target layout distributions, 2. Partitioning shapes with the target area budgets, 3. Annotation of terminal locations, 4. Annotation of connectivity matrix (weighted), and 5. Annotation of placement.

count) on the initial grid (line 6 of of Algorithm 1). The randomized selection of merging candidates and randomized initial grid creation offer a systematic approach to sampling numerous potential fully-abutted divisions of the fixed-outline layout, exposing complex but realistic use-cases. The details of the merging process is elaborated in Algorithm 2. As shown in line 2 and line 9 of Algorithm 2, polygons on the layout are iteratively merged until the desired shapes and partition count are achieved. If rectilinear\_flag = 0 (line 4), the merging operation ensures that generated partitions are always rectangles. Conversely, when the flag is set to 1, there are no restrictions on the rectilinearity of the shapes. Each merging operation is committed if the merged partition operation improves the alignment between the current shape distribution and the target shape distribution (lines 7 and 8).



Figure 6: A sequence of merging operations, to generate the channel-less layout with desired shapes and area budgets.

**3. Terminal Sampling:** After the creation of the fully-abutted layout, we annotate the terminal positions on the outline (line 9 of Algorithm 1). Illustrated in Figure 7 and Algorithm 3, this terminal annotation process involves first placing a terminal at a random location along the outline (line 3 of Algorithm 3). Subsequently, other terminals are incrementally placed (line 6 of Algorithm 3), while adhering to the normalized-pitch considerations (line 8 of Algorithm 3) and meeting the total terminal-count quota.



Figure 7: A sequence of terminal-annotation operations, to place terminals on the layout from Step 1.

**4. Connectivity Sampling:** Following the placement of desired partition shapes (or sub-systems), and terminal locations, we create the connectivity between partitions and terminals (line 10 of Algorithm 4). The details of connectivity annotation are explained in Algorithm

4. While decomposing the fixed outline into individual partitions implicitly ensures area-optimal layouts, a optimal-by-construct annotation of connectivity is necessary to generate an optimal or nearoptimal connectivity matrix. To achieve this, we utilize the pair-wise distance matrix among all partition pairs, as well as partition and terminal combinations (lines 4-5 of Algorithm 4). The inverse of the distance matrix yields the "similarity matrix," which denotes the proximity of modules in 2D Manhattan space (lines 6-7 of Algorithm 4). Since the optimal layouts tend to place heavily connected partitions closer, annotating connections using the similarity matrix as the probabilities (line 8-9 of Algorithm 4) offers a rational approach to ensuring the near-optimality of generated connections. In addition, we also sample inter-partition net-weights from the target distribution that captures net-weights as a function of inter-partition distance. using the similarity matrix. Figure 8 visually explain the annotation of weighted inter-partition nets and partition-terminal nets, that meet the target sparsity for inter-partition and partition-net connections.



Figure 8: Using the layouts with partition and terminals placed, connections are added using probabilities proportional to the closeness (Manhattan distance) of components.

**5. Constraint Sampling:** The output from Step 4 embodies the circuit characteristics of real layouts in terms of near-optimal connectivity and optimal area. Line 11 of Algorithm 1 is the constraint annotation step, where the desired placement constraints are annotated on the area and wire-length optimal layouts. The resulting layout manifests as a hard-constrained floorplanning instance with optimal area and nearly optimal wire-length characteristics. Further details on constraint annotation are provided in Algorithm 5. Utilizing the connectivity information and the optimal positions of individual partitions, boundary constraints such as corner partitions and edge partitions are annotated (line 7 of Algorithm 5). In real circuits, it is commonly observed that the majority of pre-placed partitions reside towards the periphery, and therefore we annotate pre-placed constraint, with a higher affinity for partitions that are placed on the periphery of the layout (line 8 of Algorithm 5). Line 9 of Algorithm 5 annotates the clustering constraint on a subset of partitions that are strongly connected (physical-adjacency requirement). Finally, multi-instantiated constraints are enforced on a subset of partitions that share identical shape and area values (line 10 of Algorithm 5). Figure 9 illustrates the process of constraint annotation.

**Special case of** FloorSet-**Prime** (rectilinear\_flag = 0): Although we observe that partition shapes or sub-system shapes in real circuits are rectilinear, the current literature is far from handing arbitrarily rectilinear partitions under hard constraints. Therefore, we provide a mechanism to generate a simpler case of theFloorSet-Prime dataset by setting rectilinear\_flag to 0. The layouts thus generated will be



Figure 9: Using the layouts with optimal shapes and locations, placement constraints are annotated. We also use connectivity from Step 4, to determine edge affinity.

Algorithm 1 FloorSet-Prime: Generating optimal layouts with constraint annotation

**Require:** Input Configuration:  $I_{config} = \text{num\_layouts}$ , foutline\_shape, num\_partitions, rectilinear\_flag, placement\_constraints, dataset\_mode = Prime Target distribution:  $(A_{parts}, N_{terms}^{parts}, D_{parts}, W_{parts}, D_{terms}, E_{parts}, C_{parts}, N_{clusters}, P_{parts}, M_{parts})$ 

- 1:  $Layouts = \emptyset$
- 2: index = 0
- 3: **while** index < num\_layouts **do**
- 4:  $\mathbf{F} = \emptyset$  Internal geometric representation of the layout
- 5:  $W, H, n\_parts \leftarrow sample(foutline\_shape, num\_partitions)$ Fixed-outline dimensions & number of partitions
- 6:  $n_{grids} \leftarrow random(4, 6) * n_{parts}$  Initial partition count
- 7:  $\mathbf{F} \leftarrow createMESH(n\_grids, W, H)$ 
  - Random partitioning of the layout in to n\_grids
- :  $\mathbf{F} \leftarrow mergePARTITION(n\_parts, A_{parts}, rectilinear\_flag, \mathbf{F})$ Merging of partitions to get target shapes
- 9:  $\mathbf{F} \leftarrow annotateTERMINALS(N_{terms}^{parts}, n\_parts, term\_pitch\mathbf{F})$ Annotate terminals on the outline of the layout
- 10:  $\mathbf{F} \leftarrow annotateNETS(D_{parts}, D_{terms}, W_{parts}, \mathbf{n\_parts}, \mathbf{F})$ Create connections among partitions and terminals
- if placement\_constraints == 1 then
- 12:  $\mathbf{F} \leftarrow annotateCONSTRAINTS(E_{parts}, C_{parts}, N_{clusters}, P_{parts}, M_{parts}, \mathbf{F})$

Annotate placement constraints on the partitions

- 13: index + = 1
- 14:  $blocks, nets, pl \leftarrow convertBOOKSHELF(\mathbf{F})$
- 15:  $Layouts[index] = [blocks, nets, pl, \mathbf{F}]$

fully abutted rectangular shapes. The near-optimal connectivity and annotation of hard constraints remain unchanged.

## 3.4 FloorSet-Lite: Rectangular Partitions

Fully abutted, rectilinear shapes of partitions in a SoC or a subsystem typically emerge in the later stages of floorplanning. Such

#### Algorithm 2 Merging of partitions: mergePARTITION

```
Require: n_parts, A<sub>parts</sub>, rectilinear_flag, F
 1: part\_count = countPolygons(\mathbf{F})
 2: while part\_count > n\_parts do
       F_new \leftarrow removeRandomEdge(F)
 3:
       if rectilinear\_flag == 0 then
 4:
          if checkRectilinear(F_new) then
 5:
            continue
 6:
       if wassertsteinDist(F_new, T) < wassertsteinDist(F, T) then
 7:
         F \leftarrow F\_new
 8:
         part\_count = countPolygons(\mathbf{F})
 9:
10: return F
```

#### Algorithm 3 Annotation of terminals: annotateTERMINALS

```
Require: N_{terms}^{parts}, n_{parts}, F
 1: n_{terms} = int(n_{parts} * N_{terms}^{parts})
 2: term\_count = 0
 3: W, H \leftarrow extractOutline(\mathbf{F})
 4: term\_pitch = 2 * (W + H)/n\_terms
 5: tx, ty \leftarrow randomPointOnOutline(\mathbf{F})
 6: term\_count+=1
 7: TermList[term\_count] = [tx, ty]
 8: while term\_count < n\_terms do
 9:
       tx, ty \leftarrow randomPointOnOutline(\mathbf{F})
       if minDist(TermList, [tx, ty]) \ge term\_pitch then
10:
11:
          term\_count + = 1
          TermList[term\_count] = [tx, ty]
12:
13: \mathbf{F} \leftarrow addTermLocs(TermList)
14: return F
```

## Algorithm 4 Annotation of Connectivity: annotateNETS

```
Require: D_{parts}, D_{terms}, W_{parts}, \mathbf{F}

1: n\_b2b\_nets = int(n\_parts * D_{parts})

Number of inter-partition nets

2: n\_t2b\_nets = n\_nets * D_{terms}

Number of partition-terminal nets

3: PDist \leftarrow pairwiseB2BDistance(\mathbf{F})

4: TDist \leftarrow pairwiseT2BDistance(\mathbf{F})

Extract Manhattan distance

5: PSim \leftarrow 1 - Normalize(PDist)

6: TSim \leftarrow 1 - Normalize(TDist)

Probabilities measured as inverse of distance

7: b2bConnectivity \leftarrow Sample(\mathbf{F}, size = n\_b2b\_nets, p = PSim)

8: b2bweights \leftarrow Sample(W_{parts}, size = n\_b2b\_nets, p = PSim)

9: t2bConnectivity \leftarrow Sample(\mathbf{F}, size = n\_p2b\_nets, p = TSim)
```

rectilinear shapes truly capture the complexity of industrial floorplanning formulation. However, in the initial design exploration stages, designers commonly begin with rectangular partitions, prioritizing finding optimal locations and early design goals (e.g., timing budgets and area requirements). At this stage, the assumption of rectangular

10:  $\mathbf{F} \leftarrow addNets(b2bConnectivity, b2bweights, t2bConnectivity)$ 

11: return F

Sample net-weights and net-connections using probabilities

# **Algorithm 5** Annotation of Placement Constraints: annotateCON-STRAINTS

```
Require: E_{parts}, C_{parts}, N_{clusters}, P_{parts}, M_{parts} F

1: n\_boundary\_parts = int(n\_parts * E_{parts})

2: n\_clustered\_parts = int(n\_parts * C_{parts})

3: n\_preplaced\_parts = int(n\_parts * P_{parts})

4: n\_multi\_inst\_parts = int(n\_parts * M_{parts})

5: eDist \leftarrow inverseedgeDistances(\mathbf{F})

6: cDist \leftarrow connectivity(\mathbf{F})

7: \mathbf{F\_e} \leftarrow getEdgeParts(\mathbf{F})

8: bParts \leftarrow Sample(\mathbf{F\_e}, size = n\_boundary\_parts, p = cDist)

9: pParts \leftarrow Sample(\mathbf{F\_size} = n\_preplaced\_parts, p = eDist)

10: cParts \leftarrow deriveClusters(\mathbf{F\_n\_clustered\_parts}, N_{clusters})

11: mParts \leftarrow deriveMultiInst(\mathbf{F\_n\_multi\_inst\_parts})

12: \mathbf{F} \leftarrow addConstraints(bParts, cParts, pParts, mParts)

13: return \mathbf{F}
```

partitions often results in the formation of channels or white-spaces. This use-case is also an important problem for design-space exploration, although less challenging than the fully-rectilinear counterpart. Therefore, we introduce an additional dataset, FloorSet-Lite, to reflect such scenarios. The pseudocode of FloorSet-Lite pipeline is shown in Algorithm 6. While FloorSet-Prime decomposes the fixed-outline layout into fully-abutted rectilinear partitions, FloorSet-Lite allows for gaps or whites-paces on the layout, to accommodate the rectangular assumption of partitions. As a result, we employ a slight change in the data generation pipeline to generate layouts with white-space while maintaining near-optimal characteristics of area and wire-length. It is well established that heuristic search algorithms such as Simulated Annealing (SA) excel at the unconstrained floorplanning problem [1, 2], particularly when the joint objective of wire-length and area is removed. When the focus shifts solely to minimizing white-space, SA demonstrates exceptional performance. Therefore, as shown in line 6 of Algorithm 6, we use SA to generate a packing solution for rectangular partitions while respecting the provided fixed-outline aspect ratio. This solution is near-optimal by construction in terms of area, with some white-space (primarily because of the rectangular assumption). As shown in lines 7-11, the rest of the process (terminal annotation, connectivity annotation and placement-constraint annotation) is identical to that of FloorSet-Prime. Similar to the FloorSet-Prime flow, FloorSet-Lite uses the configuration file (Figure 4) to generate 1M training samples and 100 validation test cases.

## 4 EXPERIMENTS

## 4.1 Analysis of FloorSet

While the 1M synthetic training layouts in FloorSet-Prime and FloorSet-Lite are intended for training ML models, we reserve the 100 test cases in each dataset to be used as a standard benchmark. In each dataset, the following labels serve as performance metrics: (a) area, (b) weighted wire-lengths (inter-partition and partition-to-terminal), and (c) violation count for each of the five placement constraints. E.g., if tensor **F** represents the validation dataset, one can retrieve the metric list of test-case *i* using **F**[*i*]["labels"]. The metric list has three components, in a list format [area, inter-partition

**Algorithm 6** FloorSet-Lite: Generating optimal layouts with constraint annotation

```
Require: Input
                                                                  Configuration:
                                                                                                                                                                                      num_layouts,
                                                                                                                                 I_{config}
               foutline_shape, num_partitions, rectilinear_flag,
                                                                                                                                                                                                               place-
               ment_constraints, dataset_mode = Prime
             Target distribution: (A_{parts}, N_{terms}^{parts}, D_{parts}, W_{parts}, D_{terms},
               E_{parts}, C_{parts}, N_{clusters}, P_{parts}, M_{parts})
     1: Layouts = \emptyset
     2: index = 0
     3:
            while index < num layouts do
     4.
                      \mathbf{F} = \emptyset
                                                                   Internal geometric representation of the layout
     5:
                      W, H, n\_parts \leftarrow sample(foutline\_shape, num\_partitions)
                                                              Fixed-outline dimensions & number of partitions
                      \mathbf{F} \leftarrow runSA(W, H, n\_parts, A_{parts})
     6:
                                Run Parquet (Simulated Annealing) for area optimization
                      \mathbf{F} \leftarrow annotate TERMINALS(N_{terms}^{parts}, n\_parts, term\_pitch\mathbf{F})
     7:
                      \mathbf{F} \leftarrow annotateNETS(D_{parts}, D_{terms}, W_{parts}, n\_parts, \mathbf{F})
     8:
                      if placement_constraints == 1 then
     9:
                              \mathbf{F} \leftarrow annotate CONSTRAINTS(E_{parts}, C_{parts}, N_{clusters}, P_{parts}, 
  10:
                                                                                                                                                                                                   M_{parts}, \mathbf{F})
                      index + = 1
 11:
                      blocks, nets, pl \leftarrow convertBOOKSHELF(\mathbf{F})
 12.
                      Layouts[index] = [blocks, nets, pl, \mathbf{F}]
 13:
```

wire-length, terminal-to-partition wire-length]. Since the dataset is devoid of constraint violations by construction, we do not explicitly store the violation count.

To underscore the complexity of FloorSet, we run classical SA [9] with a constraint-penalty on the FloorSet validation set. Figure 10 presents scatter plots (left) for one data-point from the validation sets of FloorSet-Prime (top) and FloorSet-Lite (bottom). The relative area and wire-length are derived using the corresponding metrics from FloorSet as the baseline; the golden layout on this relative scale is indicated by the green dot at (1, 1). For FloorSet-Prime, we observe that the solutions are significantly sub-optimal in terms of wire-length and area. Crucially, all solutions exhibit constraint violations. While solutions obtained on FloorSet-Lite's problem are closer to the optimal in terms of area, minimizing wire-length and fixing constraint violations remain a big challenge. The desired behavior of any future baseline is to approach the green dots, while respecting the hard-constraints. Figure 10 also shows the layout images of the corresponding golden data. The top-right figure shows an example 25-partition FloorSet-Prime layout (corresponding to the green dot on top-left figure). The bottom-right figure shows an example 65-partition FloorSet-Lite layout (corresponding to the green dot on bottom-left figure). The synthetic layout of FloorSet-Prime (top-right) closely resembles partition shapes in real-word layouts.

## 4.2 Distributions of FloorSet Characteristics

While Section 4.1 underscores the complexity of FloorSet-Prime benchmarks and highlights the sub-optimal performance of existing methods, it remains essential to establish the resemblance between these complex synthetic layouts and real-world counterparts in terms of their circuit characteristics. Figure 11 compares the probability density function (PDF) of synthetic and real layouts using their



Figure 10: Scatter plots (left) and the corresponding constraint annotated optimal layouts (right) indicating the optimality gap and violations of hard constraints, when applying classical SA on a test cases picked from the validation set of FloorSet-Prime (top) and FloorSet-Lite (bottom). On FloorSet-Lite, SA is able to find more area-optimal placements compared to FloorSet-Prime - consistent with the former's simplified rectangular partitions. However, almost all solutions discovered by SA on both datasets have constraint violations.

shape and connectivity metrics. The left plot depicts the partition-level shape distribution by plotting the PDF with respect to the aspect ratios of individual partitions. The proximity of real PDF (red) and the FloorSet-Prime PDF (black) is a clear indication of the resemblance between the shapes of synthetic and real layouts. Similarly, the plot on the right shows the PDF of inter-partition wire-length. The normalization of wire-length allows for a direct comparison. The closeness of the red and black curves illustrates that the synthetic layouts closely reflect the inherent connectivity distribution observed in real-world layouts.



Figure 11: Comparison between distributions of partition characteristics in FloorSet vs real designs. Distributions of aspect ratio (left) and inter-partition wire-length (right) in FloorSet closely match that of real SoC designs.

## 5 CONCLUSIONS

This paper introduces FloorSet, a VLSI floorplanning dataset that reflects the complexities and constraints of modern SoC design.

Crucially, it provides large-scale training data that is necessary to develop modern ML techniques to solve floorplanning problems. FloorSet-Lite, with near-optimal packing and wire-length for rectangular partitions, represents early stages of design. FloorSet-Prime, with the added complexity of fully-abutted rectilinear blocks, represents the final stages of design. FloorSet is constructed by sampling outline and wire-length values from distributions derived from real, commercial SoC designs. We hope that FloorSet spurs novel solutions in the area of complex combinatorial optimization that significantly advance the field of EDA.

#### 6 ACKNOWLEDGEMENTS

We thank Miaomiao Ma (Intel), Pei Chun Ch'ng (Intel), Olena Zhu (Intel) and Fadi Aboud (Intel) for providing us with insights into the real-world constraints that are often overlooked in academic benchmarks. Their expertise and collaboration have been critical to the development of FloorSet.

#### REFERENCES

- S.N. Adya and I.L. Markov. 2001. Fixed-outline floorplanning through better local search. In *ICCD* 2001, 328–334.
- [2] S.N. Adya and I.L. Markov. 2003. Fixed-outline floorplanning: enabling hierarchical design. *IEEE Transactions on Very Large Scale Integration (VLSI)* Systems, 11, 6, 1120–1135.
- [3] Anthony Agnesina, Kyungwook Chang, and Sung Kyu Lim. 2020. Vlsi placement parameter optimization using deep reinforcement learning. In *ICCAD* '20, 1–9.
- [4] Mohammad Amini, Zhanguang Zhang, Surya Penmetsa, Yingxue Zhang, Jianye Hao, and Wulong Liu. 2022. Generalizable floorplanner through corner block list representation and hypergraph embedding. In (KDD '22). Association for Computing Machinery, 2692–2702. ISBN: 9781450393850.
- [5] Jason Blocklove, Siddharth Garg, Ramesh Karri, and Hammond Pearce. 2023. Chip-chat: challenges and opportunities in conversational hardware design. In 2023 ACM/IEEE 5th Workshop on Machine Learning for CAD (MLCAD), 1–6.
- [6] Bookshelf. (). http://vlsicad.eecs.umich.edu/BK/overview.htm.
- [7] Ismail S. Bustany, David Chinnery, Joseph R. Shinnerl, and Vladimir Yutsis. 2015. Ispd 2015 benchmarks with fence regions and routing blockages for detailed-routing-driven placement. In (ISPD '15). Association for Computing Machinery, 157–164. ISBN: 9781450333993.
- [8] Yu Cao. Machine learning in computational lithography. (2019). https://www.ebeam.org/docs/SPIE2019-yu-cao.pdf.
- [9] Tung-Chieh Chen and Yao-Wen Chang. 2006. Modern floorplanning based on b/sup \*/-tree and fast simulated annealing. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 25, 4, 637–650.
- [10] Chung-Kuan Cheng, Chester Holtz, Andrew B. Kahng, Bill Lin, and Uday Mallappa. 2023. Dagsizer: a directed graph convolutional network approach to discrete gate sizing of vlsi graphs. ACM TODAES, 28, 4, Article 52, (May 2023), 31 pages.
- [11] Chung-Kuan Cheng, Andrew B. Kahng, Hayoung Kim, Minsoo Kim, Daeyeal Lee, Dongwon Park, and Mingyu Woo. 2022. Probe 2.0: a systematic framework for routability assessment from technology to design in advanced nodes. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 41, 5. 1495–1508.
- [12] Vidya A. Chhabria, Andrew B. Kahng, Minsoo Kim, Uday Mallappa, Sachin S. Sapatnekar, and Bangqi Xu. 2020. Template-based pdn synthesis in floorplan and placement using classifier and cnn techniques. In ASP-DAC, 44–49.
- [13] Nima Karimpour Darav, Ismail S. Bustany, Andrew Kennings, and Ravi Mamidi. 2017. Iccad-2017 cad contest in multi-deck standard cell legalization and benchmarks. In 2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 867–871.
- [14] Lamya Gaber, Aziza I. Hussein, and Mohammed Moness. 2021. Fault detection based on deep learning for digital vlsi circuits. *Procedia Computer Science*, 194, 122–131. 18th International Learning & Technology Conference 2021. https://www.sciencedirect.com/science/article/pii/S1877050921021062.
- [15] Gsrc. (). http://vlsicad.eecs.umich.edu/BK/GSRCbench.
- [16] Sergio Guadarrama, Summer Yue, Toby Boyd, Joe Wenjie Jiang, Ebrahim Songhori, Terence Tam, and Azalia Mirhoseini. 2021. Circuit Training: an open-source framework for generating chip floor plans with distributed deep reinforcement learning. [Online; accessed 21-December-2021]. (2021). https://github.com/google\_research/circuit\_training.

- [17] Zizheng Guo, Mingjie Liu, Jiaqi Gu, Shuhan Zhang, David Z. Pan, and Yibo Lin. 2022. A timing engine inspired graph neural network model for pre-routing slack prediction. In *Proceedings of the 59th ACM/IEEE Design Automation Conference* (DAC '22). Association for Computing Machinery, San Francisco, California, 1207–1212. ISBN: 9781450391429.
- [18] Puneet Gupta, Andrew B. Kahng, Amarnath Kasibhatla, and Puneet Sharma. 2010. Eyecharts: constructive benchmarking of gate sizing heuristics. In *Design Automation Conference*, 597–602.
- [19] Winston Haaswijk, Edo Collins, Benoit Seguin, Mathias Soeken, Frédéric Kaplan, Sabine Süsstrunk, and Giovanni De Micheli. 2018. Deep learning for logic optimization algorithms. In 2018 IEEE International Symposium on Circuits and Systems (ISCAS). 1–4.
- [20] Seung-Soo Han, Andrew B. Kahng, Siddhartha Nath, and Ashok S. Vydyanathan. 2014. A deep learning methodology to proliferate golden signoff timing. In DATE'14, 1–6.
- [21] Zhuolun He, Yuzhe Ma, Lu Zhang, Peiyu Liao, Ngai Wong, Bei Yu, and Martin D.F. Wong. 2020. Learn to floorplan through acquisition of effective local search heuristics. In 2020 IEEE 38th International Conference on Computer Design (ICCD), 324–331.
- [22] Ispd-ibm. (). http://vlsicad.eecs.umich.edu/BK/ISPD02bench/#The\_Benchmark
- [23] Iwls 2005. (). https://iwls.org/iwls2005/benchmarks.html.
- [24] Andrew B. Kahng. 2023. Machine learning for cad/eda: the road ahead. IEEE Design & Test, 40, 1, 8–16.
- [25] Andrew B. Kahng, Uday Mallappa, Lawrence Saul, and Shangyuan Tong. 2019. "unobserved corner" prediction: reducing timing analysis effort for faster design convergence in advanced-node design. In DATE'19, 168–173.
- [26] Daeyeon Kim, Sung-Yun Lee, Kyungjun Min, and Seokhyeong Kang. 2023. Construction of realistic place-and-route benchmarks for machine learning applications. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 42, 6, 2030–2042.
- [27] Jihye Kwon and Luca P. Carloni. 2020. Transfer learning for design-space exploration with high-level synthesis. In 2020 ACM/IEEE 2nd Workshop on Machine Learning for CAD (MLCAD), 163–168.
- [28] Yao Lai, Jinxin Liu, Zhentao Tang, Bin Wang, Jianye Hao, and Ping Luo. 2023. ChiPFormer: transferable chip placement via offline decision transformer. In (Proceedings of Machine Learning Research). Vol. 202. PMLR, (23–29 Jul 2023), 18346–18364.
- [29] Yao Lai, Yao Mu, and Ping Luo. 2022. Maskplace: fast chip placement via reinforced visual representation learning. (2022). arXiv: 2211.13382 [cs.CV].
- [30] Rongjian Liang, Hua Xiang, Diwesh Pandey, Lakshmi Reddy, Shyam Ramji, Gi-Joon Nam, and Jiang Hu. 2020. Drc hotspot prediction at sub-10nm process nodes using customized convolutional network. In Proceedings of the 2020 International Symposium on Physical Design (ISPD '20). Association for Computing Machinery, New York, NY, USA, 135–142. ISBN: 9781450370912.
- [31] Haiguang Liao, Wentai Zhang, Xuliang Dong, Barnabas Poczos, Kenji Shimada, and Levent Burak Kara. 2019. A deep reinforcement learning approach for global routing. (2019). arXiv: 1906.08809 [cs.LG].
- [32] Hyeong Gu Lim, Jaeyeon Jang, Byung Kook Ju, Jae Woo Ko, and Chang Ouk Kim. 2023. A test vector selection method based on machine learning for efficient presilicon verification. Expert Systems with Applications, 224, 120056. https://www.sciencedirect.com/science/article/pii/S0957417423005584.
- [33] Mingjie Liu et al. 2024. Chipnemo: domain-adapted llms for chip design. (2024). arXiv: 2311.00176 [cs.CL].
- [34] Yiting Liu, Ziyi Ju, Zhengming Li, Mingzhi Dong, Hai Zhou, Jia Wang, Fan Yang, Xuan Zeng, and Li Shang. 2022. Graphplanner: floorplanning with graph neural network. ACM Trans. Des. Autom. Electron. Syst., 28, 2, Article 21, (Dec. 2022), 24 pages.
- [35] Yi-Chen Lu, Jeehyun Lee, Anthony Agnesina, Kambiz Samadi, and Sung Kyu Lim. 2019. Gan-cts: a generative adversarial framework for clock tree prediction and optimization. In 2019 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1–8.
- [36] Yi-Chen Lu, Siddhartha Nath, Vishal Khandelwal, and Sung Kyu Lim. 2021. Rl-sizer: vlsi gate sizing for timing optimization using deep reinforcement learning. In DAC '21, 733–738.
- [37] Stefanus Mantik, Gracieli Posser, Wing-Kai Chow, Yixiao Ding, and Wen-Hao Liu. 2018. Ispd 2018 initial detailed routing contest and benchmarks. In Proceedings of the 2018 International Symposium on Physical Design (ISPD '18). Association for Computing Machinery, Monterey, California, USA, 140–143. ISBN: 9781450356268.
- [38] Mcnc. (). http://vlsicad.eecs.umich.edu/BK/MCNCbench.
- [39] Walter Lau Neto, Max Austin, Scott Temple, Luca Amaru, Xifan Tang, and Pierre-Emmanuel Gaillardon. 2019. Lsoracle: a logic synthesis framework driven by artificial intelligence: invited paper. In 2019 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1–6.
- [40] Opencores. (). https://opencores.org.
- [41] Muhammet Mustafa Ozdal, Chirayu Amin, Andrey Ayupov, Steven Burns, Gustavo Wilke, and Cheng Zhuo. 2012. The ispd-2012 discrete cell sizing contest

- and benchmark suite. In (ISPD '12). Association for Computing Machinery, New York, NY, USA, 161–164. ISBN: 9781450311670. https://doi.org/10.1145/2160916.2160950.
- [42] David Z. Pan. Closing the virtuous cycle of ai for ic and ic for ai. The Council on Electronic Design Automation (CEDA), IEEE, (2021). https://ieee-ceda.org/presentation/webinar/closing-virtuous-cycle-ai-ic-and-ic-ai.
   [43] Martin Rapp, Hussam Amrouch, Yibo Lin, Bei Yu, David Z. Pan, Marilyn Wolf,
- [43] Martin Rapp, Hussam Amrouch, Yibo Lin, Bei Yu, David Z. Pan, Marilyn Wolf, and Jörg Henkel. 2022. Mlcad: a survey of research in machine learning for cad keynote paper. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 41, 10, 3162–3181.
- [44] Zi Wang and Benjamin Carrion Schafer. 2022. Learning from the past: efficient high-level synthesis design space exploration for fpgas. ACM Trans. Des. Autom. Electron. Syst., 27, 4, Article 34, (Feb. 2022), 23 pages.
- [45] Qi Xu, Hao Geng, Song Chen, Bo Yuan, Cheng Zhuo, Yi Kang, and Xiaoqing Wen. 2022. Goodfloorplan: graph convolutional network and reinforcement learning-based floorplanning. *Trans. Comp.-Aided Des. Integ. Cir. Sys.*, 41, 10, (Oct. 2022), 3492–3502.
- [46] Hang Zhao, Qijin She, Chenyang Zhu, Yin Yang, and Kai Xu. 2021. Online 3d bin packing with constrained deep reinforcement learning. In *Thirty-Fifth AAAI Conference on Artificial Intelligence*, AAAI 2021. AAAI Press, 741–749.