Aalto University School of Science Master's Programme in Computer, Communication and Information Sciences

Artem YUSHKOVSKIY

# **Automated Analysis of Weak Memory Models**

Master's Thesis Espoo, ?.?.2018

Supervisor: Assoc. Prof. Keijo Heljanko

Instructor:



Aalto University School of Science

Master's Programme in Computer, Communication and Information Sciences

ABSTRACT OF
MASTER'S THESIS

**Author:** Artem YUSHKOVSKIY

Title:

Automated Analysis of Weak Memory Models

**Date:** ?.?.2018 **Pages:** vi + 7

Professorship: Code: AS-116

**Supervisor:** Assoc. Prof. Keijo Heljanko

**Instructor:** 

In id fringilla velit. Maecenas sed ante sit amet nisi iaculis bibendum sed vel elit. Quisque eleifend lacus nec ipsum lobortis ornare. Nam lectus diam, facilisis eget porttitor ac, fringilla quis massa. Phasellus ac dolor sem, eget varius lacus. Sed sit amet ipsum eget arcu tristique aliquam. Integer aliquam velit sit amet odio tempus commodo. Quisque commodo lacus in leo sagittis vel dignissim quam vestibulum. Cras fringilla velit et diam dictum faucibus. Pellentesque at eros non mauris auctor euismod. Nullam convallis arcu vel lectus sollicitudin rutrum. Praesent consequat, nisl at pretium posuere, neque arcu dapibus lacus, ut sollicitudin elit velit ultricies libero. Vestibulum ante ipsum primis in faucibus orci luctus et ultrices posuere cubilia Curae;

Nulla semper hendrerit molestie. Pellentesque blandit velit sit amet est vestibulum faucibus. Nullam massa turpis, venenatis non mollis fringilla, mattis et diam. Fusce molestie convallis elementum. Morbi nec lacus dapibus arcu mollis gravida. Aliquam erat volutpat. Nam vitae magna nunc. Nunc ut ipsum at massa porttitor vestibulum. Praesent diam lorem, ultrices nec vestibulum id, volutpat nec lacus.

**Keywords:** Thesis template, master's thesis

**Language:** English



#### Aalto-yliopisto Perustieteiden korkeakoulu

???

| Tekijä:      | Artem YUSHKOVSKIY           |            |        |
|--------------|-----------------------------|------------|--------|
| Työn nimi:   |                             |            |        |
| ?            |                             |            |        |
| Päiväys:     | ?.?.2018                    | Sivumäärä: | vi + 7 |
| Professuuri: | ?                           | Koodi:     | AS-116 |
| Valvoja:     | Assoc. Prof. Keijo Heljanko |            |        |
| Ohjaaja:     |                             |            |        |

Cras tincidunt bibendum erat, vel tincidunt diam porttitor aliquam. Donec sit amet urna non felis placerat pharetra. Aenean ultrices facilisis nulla vitae semper. Nullam non libero quis dui fermentum aliquam id vel eros. Praesent elementum tortor quis sem congue iaculis sit amet eget nisl. Quisque erat tortor, condimentum eu volutpat et, blandit et augue. Phasellus erat turpis, pretium non feugiat id, posuere id velit. Vestibulum ut sapien felis, quis convallis dui.

In elementum est eu nulla hendrerit feugiat. In sodales diam vel lacus cursus tincidunt. Morbi nibh dui, imperdiet non vestibulum non, dignissim id risus. Sed sollicitudin neque lectus, porttitor sollicitudin elit. Nulla facilisi. Nullam in ante eu mi suscipit sollicitudin. Sed est velit, gravida facilisis varius eget, tempus sed urna. Aliquam erat volutpat. Nam semper condimentum nisi. Nullam scelerisque, metus nec sodales vulputate, purus augue venenatis urna, sit amet mattis turpis nisl ac metus. Mauris nec odio ut neque condimentum vulputate vel in turpis. Nulla facilisi. Nulla id tellus sapien, vitae blandit lorem.

| Asiasanat: | Diplomityöpohja |
|------------|-----------------|
| Kieli:     | Englanti        |

# Acknowledgements

In id fringilla velit. Maecenas sed ante sit amet nisi iaculis bibendum sed vel elit. Quisque eleifend lacus nec ipsum lobortis ornare. Nam lectus diam, facilisis eget porttitor ac, fringilla quis massa. Phasellus ac dolor sem, eget varius lacus. Sed sit amet ipsum eget arcu tristique aliquam. Integer aliquam velit sit amet odio tempus commodo. Quisque commodo lacus in leo sagittis vel dignissim quam vestibulum. Cras fringilla velit et diam dictum faucibus. Pellentesque at eros non mauris auctor euismod. Nullam convallis arcu vel lectus sollicitudin rutrum. Praesent consequat, nisl at pretium posuere, neque arcu dapibus lacus, ut sollicitudin elit velit ultricies libero. Vestibulum ante ipsum primis in faucibus orci luctus et ultrices posuere cubilia Curae;

Espoo, ?.?.2018

Artem YUSHKOVSKIY

# **Abbreviations**

LI Lorem Ipsum

ABC Quisque et mi lacus, nec porta ante.
DEF Proin pellentesque accumsan laoreet

# **Contents**

| Al | Abbreviations |                                        |   |  |
|----|---------------|----------------------------------------|---|--|
| 1  | Intr          | roduction                              | 1 |  |
|    | 1.1           | Thesis structure                       | 2 |  |
| 2  | Wea           | ak Memory Models                       | 3 |  |
|    | 2.1           | The CAT language                       | 3 |  |
|    | 2.2           | Examples of WMM                        | 3 |  |
|    | 2.3           | The CAT language                       | 3 |  |
| 3  | Imp           | lementation                            | 4 |  |
|    | 3.1           | Program Requirements                   | 4 |  |
|    | 3.2           | Program Components                     | 4 |  |
|    |               | 3.2.1 C11 to YTree parser              | 4 |  |
|    |               | 3.2.2 YTree to XGraph event converter  | 5 |  |
|    |               | 3.2.3 XGraph to ZFormula (SMT) encoder | 5 |  |
|    | 3.3           | Optimisations                          | 5 |  |
| 4  | Eval          | luation                                | 6 |  |
|    | 4.1           | Comparison with PORTHOS                | 6 |  |
|    |               | 4.1.1 Unique Features                  | 6 |  |
|    |               | 4.1.2 Performance                      | 6 |  |
|    | 4.2           | Comparison with HERD                   | 6 |  |
|    |               | 4.2.1 Unique Features                  | 6 |  |
|    |               | 4.2.2 Performance                      | 6 |  |
| 5  | Sun           | nmary                                  | 7 |  |

## Introduction

problems of concurrency necessity of Weak Memory Models

- hardware wmms
- wmms of programming languages
- specific wmms like for kernel

wmm as a formal way to define guaranties that a hw, programming language, execution environment provide for programmers.

considering wmm as a set of allowed behaviours, the latter wmms are the supersets

wmm allows and disallows optimisations: partial sync of memory buffers, out-of-order execution (reordering), <more> => more behaviours that are unallowed in SC.

question possible to answer with wmm: which behaviours (in addition to SC) are allowed? which new states are allowed? Consequently, correctness, absence of data races, deadlocks or portability issues, etc.

existing tools: herd, diy7 – exhaustive search approach for exploring the state space.

another approach: using smt solver, e.g. for answering questions like base paper: aims to investigate portability of small programs written in a C-like pseudocode and provides the proof-of-concept tool PORTHOS [link]. As input, it takes a program and two memory models in CAT language. Then it encodes programs and memory models into an smt formula and tries to solve it via z3. Current thesis aims to extend this tool functionality to process the real C code, therefore it proposes different modula program architecture and multiple optimisations.

2

### 1.1 Thesis structure

...

# Weak Memory Models

briefly what it is, as in Intro

Some examples of what wmms allow to do

Lamport's sequentially consistent memory model – global order "a global order, even if they are actually executed in different threads running in parallel in different processors. In this model, each read from some memory location is guaranteed to see a value written by the last write to this location." ([c11 by natasha]). Produces same result as sequential program.

#### 2.1 The CAT language

the CAT language [ref to Jade's paper] (hard part: to decipher the Alglave's paper).

the event representation

#### 2.2 Examples of WMM

briefly known hw memory models: X86-TSO, Alpha, POWER, – ref to Jade; language memory models: Java, C++; library-level kernel memory model, ref to github with tests

Relationship between different models

#### 2.3 The CAT language

## **Implementation**

This chapter describes the architecture of the tool mousquitaires ... language: java

#### 3.1 Program Requirements

- stability (tests) - scalability (new features of language, new models, new tasks for a program – e.g.!) - transparency - efficiency

#### 3.2 Program Components

Big view

#### 3.2.1 C11 to YTree parser

- The language-dependent syntax tree: for now it's the C subset language which I called 'Cmin'; as a base, I used the C11 grammar from ANTLR github repository, then I simplified it a lot, cutting off many unnecessary C syntax features and making it more convenient for parsing. When developing the Cmin language, I kept in mind C elements that are necessary for processing the linux kernel code, though for now not the whole grammar element described in file 'Cmin.g4' are being implemented; later I am going to add the litmus grammar as well; in future, it will be not a problem to add any new C-like language;
- The language-independent abstract syntax tree (aliased 'Ytree', where 'Y' resembles branching of the tree): all tree nodes in my code are prefixed

with 'Y', see tentative (yet almost complete) class hierarchy in picture 'YEntity.png'; - this AST contains very basic language elements according to the C execution model (statements and expressions); - converting the language-dependent syntax tree to the language-independent syntax tree is performed by Visitor pattern (e.g., for Cmin->Ytree conversion is made by 'CminToYtreeConverterVisitor') - minor changes are performed by converting to ytree representation: desugaring the target code, etc.

#### 3.2.2 YTree to XGraph event converter

- Then, the AST is being interpreted and converted to event-based representation (aliased 'Xrepr' for eXecution representation): more low-level code representation (or high-level assembly); I try to keep this representation close to the one you described in your papers: basic load & store events, branching events, fence events; this representation is being implementing these days, I've just started doing it (see current class hierarchy in the picture 'XEntity.png');
- After we acquired the event-based representation, we can perform some modifications/simplifications/optimisations on it (separately, allowing user to manage them): converting to SSA form as one of necessary steps before encoding; (more? I'm not thinking about it yet);

#### 3.2.3 XGraph to ZFormula (SMT) encoder

- Then, this modified event-representation is being encoded to SMT formula and sent to the solver.

#### 3.3 Optimisations

... in each stage

## **Evaluation**

- 4.1 Comparison with PORTHOS
- 4.1.1 Unique Features
- 4.1.2 Performance
- 4.2 Comparison with HERD
- 4.2.1 Unique Features
- 4.2.2 Performance

Summary