result location mechanism (previously: well defined copy eliding semantics) #287

thejoshwolfe · 2017-03-28T03:57:04Z

Old Proposal:

Have well-defined rules for copy eliding, and we sometimes allow what looks like copying non-copyable objects.

During the semantic analysis of every expression, there is an additional field of provided context, which is the location to put the expression's result value.
A function can declare a non-copyable return type. In this case, the function gets an additional, secret parameter that is a writable pointer to where it should write its return value.
Here's how specific language constructs handle the result location:
- A var or const declaration creates a location, and passes that location to the initializer expression, if any.
- An assignment statement uses the address of the left hand side as the result location for the right hand side.
- A function call or an operator that acts like a function (e.g. +, ~) creates a temporary storage location for each of its parameters/operands and provides that temporary storage as the result location when evaluating each parameter/operand expression.
- The body of a function whose return type is copyable uses a special result location, such as a platform-specific register.
- The body of a function whose return type is non-copyable uses the secret result location pointer parameter as the result location.
- A return statement provides the function body's result location to the return expression.
- For a function call where the function's return type is non-copyable, the function call expression's result location is passed as the function's secret return value pointer parameter.
- For a function call where the function's return type is copyable, the result of the function is copied from where the function puts it (such as a platform-specific register) to the function call expression's result location.
- A block or branching control structure forwards its result location to whatever sub-expression determines its result value.
- A statement followed by a ; in a block gets a void result location.
- A defer statement provides a void result location to its expression.
- A struct, array, or enum initialization expression uses its result location in-place.
- Automatic error and maybe coercion happen in-place.

Examples:

fn foo() -> u32 { // result location for function body is a special register
    bar(); // function call gets a void result location, so bar() must not return anything (see #219).
    var // varaible declaration gets a void result location.
        a : u32 // creates location for a u32
        = 1; // integer literal gets &a as result location
    const b // creates a location for a TBD type
        = baz(); // baz() gets &b as result location, and baz() determines the type of b
    a = ( // result location is &a
            b // result location is a temporary location of a TBD type provided by the + operator
        + // checks left and right types and produces the sum into &a, possibly doing automatic type coorsion first
            baz() // result location is a temporary location of a TBD type provided by the + operator
    );
    var array // creates a location for a TBD type
        = []u32 { // result location is &array, and now type of array is [TBD]u32.
            1, // result location is &array[0], and array.len is at least 1
            5, // result location is &array[1], and array.len is at least 2
        };
    1 // result location is the special register for the function body
}

struct BigStruct {
    a: [2]SubStruct,
    pub fn init(offset: u32) -> BigStruct { // result location is the secret parameter; let's call it result_location
        BigStruct { // result location is still result_location
            .a = []SubStruct { // result location is &result_location.a
                SubStruct { // result location is &result_location.a[0]
                    .x = offset + 0, // result location is &result_location.a[0].x
                    // (note elaboration on the + operator is omitted here. see above.)
                },
                SubStruct { // result location is &result_location.a[1]
                    .x = offset + 1, // result location is &result_location.a[1].x
                },
            },
        }
    }
}
struct SubStruct {
    x: u32,
}
fn main() {
    var a // creates a location for a TBD type
        = BigStruct.init(10); // result location secret parameter is &a
    // equivalent to:
    var b : BigStruct = undefined;
    b.a[0].x = 10 + 0;
    b.a[1].x = 10 + 1;

    var c // creates a location for a TBD type
        = if // result location is &c
        (
            something() // result location is a temporary location created by the if
        ) {
            BigStruct.init(100) // result location is &c
        } else {
            BigStruct.init(200) // result location is &c
        };
    // equivalent to:
    var d : BigStruct = undefined;
    if (something()) {
        d.a[0].x = 100 + 0;
        d.a[1].x = 100 + 1;
    } else {
        d.a[0].x = 200 + 0;
        d.a[1].x = 200 + 1;
    }

    var e // creates a location for a TBD type
        = if (something()) {
            a // ERROR: can't copy type BigStruct
        } else {
            b // ERROR: can't copy type BigStruct
        };
}

Relative to what #83 originally proposed, we've got relaxed restrictions on returning non-copyable types from a function. Previously returning non-copyable types required use of a named return value. So do we still need named return values?

Here's a usecase for named return values:

struct PluginRegistry {
    id_to_plugin: Hashtable(Id, &Plugin), // non-copyable
    pub fn init() -> (result: PluginRegistry) {
        result.id_to_plugin = Hashtable(Id, &Plugin).init();
        result.register(base_plugin.id, &base_plugin);
    }
    pub fn register(self: &PluginRegistry, id: Id, plugin: &Plugin) {
        self.id_to_plugin.put(id, plugin);
        plugin.on_register();
    }
}

We want to design PluginRegistry to use the constructor-like pattern where you can assign from init(), and we want to do something non-trivial with the object before we return it. In order to refer to the object, it has to be named; we wouldn't be able to call register() if we did a return PluginRegistry { ... } expression.

The text was updated successfully, but these errors were encountered:

thejoshwolfe · 2017-03-28T18:24:39Z

Let me elaborate on a specific usecase:

fn foo() -> BigStruct {
    const a = BigStruct { ... }; // fine so far
    return a; // ERROR: cannot copy type BigStruct
}

The reason for this error is that at the time when you declared a, it created its own location as a local variable (or const, w/e). If the compiler were clever enough to look ahead and notice you were returning a, it could have used the secret result location pointer parameter as the storage location for a. Then the return would not be a copy, and it would work.

I'm hesitant to suggest that this rule be well-defined, because it's a bit more demanding of the compiler, and the rules for what is allowed and what's not allowed get more complicated as well. For example:

fn foo() -> BigStruct {
    const a = BigStruct { ... };
    if (something()) return a; // ERROR
    const b = BigStruct { ... };
    if (something()) return a; // ERROR
    if (something()) return b; // ERROR: but really if this were deleted, then all the errors go away.
    return a; // ERROR
}

However, one argument in favor of this idea is #286, which wants to refer to the return value of a block by name. Among the proposals in that issue, there is a simpler proposal, which is the one in this comment.

fn main() {
    const a : BigStruct = {
        const result = BigStruct{ ... };
        result.method();
        result // here's the "copy" that could be elided if the compiler notices
               // that this block only returns that local variable.
    };
}

BarabasGitHub · 2018-07-19T19:27:57Z

A function can declare a non-copyable return type. In this case, the function gets an additional, secret parameter that is a writable pointer to where it should write its return value.

Why not just be explicit about it and let the user provide a pointer to the function? Then everyone can clearly see it doesn't get copied and nobody has to wonder why you can 'copy' this non-copyable struct.

thejoshwolfe · 2018-07-19T19:59:37Z

That would require that the user declare the variable on a separate line and initialize it to undefined, and then the function signature doesn't really indicate that it's an output only parameter, and the function implementation could read from the pointer without getting a compile error.

That all definitely works ok, and it's what you do in C, but it seems more elegant to make the function look like it's returning the thing. However, I agree that the copy-or-not semantics are a little confusing when they're completely implicit, especially when the return type is generic. Then a single function can do and not do the secret pointer thing depending on the type parameters.

It is desirable that we only have one obvious way to return things from functions. But if under the hood there are actually multiple ways, we need to be careful that surprises don't break anything. For example, we need to be careful that this doesn't cause any aliasing footguns.

andrewrk · 2018-08-31T01:13:30Z

Something like this is still planned, but this proposal is old enough now that it needs revisiting and reworking before it's ready to be implemented.

ghost · 2018-08-31T20:17:05Z

I don't believe named return types need/ should be part of this proposal because cpp has guaranteed copy elision as well and does not have named return types so it seems to be unnecessary.

It seems though as if cpp has cases where its not guaranteed (even cpp 17) so it might be worth investigating this before making a final judgement. My cpp is currently not good enough to easily judge the current state of copy elision in cpp.

andrewrk · 2018-10-02T18:46:40Z

Here is my new proposal for guaranteed copy elision:

const Foo = struct {
    x: i32,
    ptr: *i32,

    fn init(z: i32) !Foo { // same function signature syntax
        try somethingThatCanFail(); // try still works
        @result() = Foo{ // new builtin function which is a reference to the return value
            .x = 1234,
            .ptr = undefined,
        };
        if (z == 0) return error.Bad;
        if (z == 1) {
            // this still works, but doesn't have guaranteed
            // copy elision semantics.
            return Foo { .x = 0, .ptr = undefined};
        }
        // in case of error inference, @result() refers to the bare value
        @result().ptr = &foo.x;
        return @result(); // returning @result() is guaranteed not to copy any memory
    }

    const Error = error{Bad};

    fn init2(z: i32) Error!Foo {
        @result() = Foo{
            .x = 1234,
            .ptr = undefined,
        };
        // since the result type is fully specified, we need to unwap to get the bare value
        const res = &(@result() catch unreachable);
        res.ptr = &foo.x;
        return @result();
    }
}

// works at global scope too
const foo = Foo.init(2);
test "pointer value correct" {
    assert(foo.ptr == &foo.x);
}

The followup proposal would be something like #591 (comment) where a field could be fixed, and not doing this @result() thing to avoid copying would give a compile error.

winksaville · 2018-10-02T19:27:59Z

What happens here:

if (z == 3) {
  var foo: Foo = undefined;
  foo.x = 456;
  foo.ptr = &foo.x;
  return foo;

I think this should generate a compiler error?

winksaville · 2018-10-02T19:33:56Z

(Note in the above I "fixed" the foo.ptr assignment).

Also, Is there a simple syntax to something like:

fn init3(v: i32) Foo {
    return @result() = Foo{
        .x = v,
        .ptr = &@result().x,
    }
}

(Sorry for the editing :( )

andrewrk · 2018-11-21T01:00:28Z

OK I'm back with an updated proposal. I'm confident about this one. So confident, in fact, that I'm going to accept it as the null hypothesis. Everyone is of course welcome to provide alternative proposals or point out flaws in this one that mean it should not be accepted.

Copy Elision Part 1, a prerequisite, is well underway in #1682. This proposal is for for Part 2 where we make it possible for functions to return large structs with no copying, guaranteed, and more importantly, to use the return value before returning it, e.g. calling a method on it.

I started typing up this complicated proposal and then changed my mind at the end, and here's where I've arrived, somewhere very close to what @thejoshwolfe originally proposed.

Zig will detect when all control flow paths end with return foo;, where foo is the same in all the return expressions, and is declared in a way that allows it to reference the return value. In this case the variable declaration will reference the return value rather than be a stack allocation. The detection doesn't have to be very advanced, just good enough that it's easy to get the detection to happen when you are trying to.
Introduce the ability to mark structs/unions as "nocopy", or perhaps even at a field level, where you can mark individual fields as "fixed" which means that they cannot be moved to a new address in memory, once initialized.
If a struct/union is "nocopy" and would get copied, it's a compile error. This makes up for lack of sophistication in the result value detection. The compile error would point to the part in the code where a copy happens, and you could then adjust the logic to avoid it. Note that LLVM optimizations do much more advanced copy elision detection; this proposal is discussing only what Zig has to do to guarantee no-copy semantics in certain situations.
This solves the question about blocks. They work the same way.
Tuples: No tuples. See the comment I'm about to post on that issue. (remove var args and add anon list initialization syntax #208)

andrewrk · 2019-06-27T03:18:25Z

First part is landed in 01ff0d4
Second and third parts split into #2761 and #2765

thejoshwolfe added the enhancement Solving this issue will likely involve adding new logic or components to the codebase. label Mar 28, 2017

thejoshwolfe mentioned this issue Mar 28, 2017

named return values and reference-assignment operators #286

Closed

andrewrk added this to the 0.1.0 milestone Mar 28, 2017

thejoshwolfe mentioned this issue Mar 31, 2017

if statement suppresses error for ignoring return value #291

Closed

andrewrk mentioned this issue May 7, 2017

move semantics #190

Closed

andrewrk modified the milestones: 0.2.0, 0.1.0 May 7, 2017

andrewrk mentioned this issue Sep 9, 2017

use case: shared_ptr and unique_ptr from C++ #453

Closed

andrewrk mentioned this issue Nov 7, 2017

design flaw: using a struct as an interface and @fieldParentPtr can lead to bad pointer dereference #591

Open

andrewrk modified the milestones: 0.2.0, 0.3.0 Jan 3, 2018

thejoshwolfe mentioned this issue Jan 11, 2018

Shortcut (type inferrence) for naming enum values #683

Closed

andrewrk added proposal This issue suggests modifications. If it also has the "accepted" label then it is planned. accepted This proposal is planned. and removed enhancement Solving this issue will likely involve adding new logic or components to the codebase. labels Feb 2, 2018

andrewrk mentioned this issue Feb 19, 2018

add support for stack traces on macosx #780

Merged

andrewrk modified the milestones: 0.3.0, 0.4.0 Feb 28, 2018

andrewrk mentioned this issue May 25, 2018

make assignment an expression that returns the payload lvalue #1022

Closed

andrewrk mentioned this issue Jun 14, 2018

add another way of passing non-copyable things as parameters #733

Closed

andrewrk mentioned this issue Aug 23, 2018

TAI64 library for std #1400

Closed

andrewrk removed the accepted This proposal is planned. label Aug 31, 2018

This was referenced Oct 11, 2018

Part 1 of well-defined copy elision #1652

Closed

syntax flaw: return type #760

Closed

andrewrk mentioned this issue Oct 25, 2018

Part 1 of well-defined copy-elision (second attempt) #1682

Closed

75 tasks

tgschultz mentioned this issue Oct 28, 2018

changing pointers #1688

Closed

andrewrk mentioned this issue Nov 7, 2018

while with a comptime false condition should work like static if #1705

Open

andrewrk added the accepted This proposal is planned. label Nov 21, 2018

This was referenced Nov 21, 2018

captured values should have the same copy/const semantics as function parameters #1766

Closed

add syntax to destructure array initialization lists #498

Closed

ability to write code that is agnostic of blocking vs async I/O #1778

Closed

andrewrk modified the milestones: 0.4.0, 0.5.0 Jan 31, 2019

This was referenced Feb 18, 2019

Documentation for the standard library #965

Closed

deprecate @bitCast #1992

Closed

package manager #943

Closed

http client in the standard library #2007

Closed

andrewrk mentioned this issue Feb 28, 2019

anonymous struct literals #685

Closed

emekoi mentioned this issue Mar 28, 2019

Allocator seg fault #1083

Closed

andrewrk mentioned this issue Apr 29, 2019

The Coroutine Rewrite Issue #2377

Closed

andrewrk mentioned this issue May 31, 2019

result location mechanism (part of no-copy semantics) #2602

Merged

19 tasks

andrewrk changed the title ~~well defined copy eliding semantics~~ result location mechanism (previously: well defined copy eliding semantics) Jun 26, 2019

This was referenced Jun 26, 2019

result locations: unwrap optional and error unions so that the payload can be non-copied #2761

Open

result location: ability to refer to the return result location before the return statement #2765

Open

andrewrk closed this as completed Jun 27, 2019

momumi mentioned this issue Jan 1, 2020

Copy elision causes side effects when modifying a struct in place. #4021

Open

fengb mentioned this issue Apr 14, 2020

add documentation for Result Location Semantics #2809

Open

14 tasks

This was referenced Mar 5, 2021

Proposal: Pinned Structs #7769

Open

Should these two snippets be equivalent ? #8188

Closed

VoilaNeighbor mentioned this issue Sep 2, 2023

Mark wrapper functions inline Snektron/vulkan-zig#107

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

result location mechanism (previously: well defined copy eliding semantics) #287

result location mechanism (previously: well defined copy eliding semantics) #287

thejoshwolfe commented Mar 28, 2017 •

edited by andrewrk

Loading

thejoshwolfe commented Mar 28, 2017

BarabasGitHub commented Jul 19, 2018

thejoshwolfe commented Jul 19, 2018

andrewrk commented Aug 31, 2018

ghost commented Aug 31, 2018

andrewrk commented Oct 2, 2018

winksaville commented Oct 2, 2018 •

edited

Loading

winksaville commented Oct 2, 2018 •

edited

Loading

andrewrk commented Nov 21, 2018 •

edited

Loading

andrewrk commented Jun 27, 2019

result location mechanism (previously: well defined copy eliding semantics) #287

result location mechanism (previously: well defined copy eliding semantics) #287

Comments

thejoshwolfe commented Mar 28, 2017 • edited by andrewrk Loading

thejoshwolfe commented Mar 28, 2017

BarabasGitHub commented Jul 19, 2018

thejoshwolfe commented Jul 19, 2018

andrewrk commented Aug 31, 2018

ghost commented Aug 31, 2018

andrewrk commented Oct 2, 2018

winksaville commented Oct 2, 2018 • edited Loading

winksaville commented Oct 2, 2018 • edited Loading

andrewrk commented Nov 21, 2018 • edited Loading

andrewrk commented Jun 27, 2019

thejoshwolfe commented Mar 28, 2017 •

edited by andrewrk

Loading

winksaville commented Oct 2, 2018 •

edited

Loading

winksaville commented Oct 2, 2018 •

edited

Loading

andrewrk commented Nov 21, 2018 •

edited

Loading