Tracking issue for uninitialized constructors for Box, Rc, Arc #63291

SimonSapin · 2019-08-05T15:34:56Z

Assigning MaybeUninit::<Foo>::uninit() to a local variable is usually free, even when size_of::<Foo>() is large. However, passing it for example to Arc::new causes at least one copy (from the stack to the newly allocated heap memory) even though there is no meaningful data. It is theoretically possible that a Sufficiently Advanced Compiler could optimize this copy away, but this is reportedly unlikely to happen soon in LLVM.

This issue tracks constructors for containers (Box, Rc, Arc) of MaybeUninit<T> or [MaybeUninit<T>] that do not initialized the data, and unsafe conversions to the known-initialized types (without MaybeUninit). The constructors are guaranteed not to make unnecessary copies.

PR #62451 adds:

impl<T> Box<T> { pub fn new_uninit() -> Box<MaybeUninit<T>> {…} }
impl<T> Box<MaybeUninit<T>> { pub unsafe fn assume_init(self) -> Box<T> {…} }
impl<T> Box<[T]> { pub fn new_uninit_slice(len: usize) -> Box<[MaybeUninit<T>]> {…} }
impl<T> Box<[MaybeUninit<T>]> { pub unsafe fn assume_init(self) -> Box<[T]> {…} }

impl<T> Rc<T> { pub fn new_uninit() -> Rc<MaybeUninit<T>> {…} }
impl<T> Rc<MaybeUninit<T>> { pub unsafe fn assume_init(self) -> Rc<T> {…} }
impl<T> Rc<[T]> { pub fn new_uninit_slice(len: usize) -> Rc<[MaybeUninit<T>]> {…} }
impl<T> Rc<[MaybeUninit<T>]> { pub unsafe fn assume_init(self) -> Rc<[T]> {…} }

impl<T> Arc<T> { pub fn new_uninit() -> Arc<MaybeUninit<T>> {…} }
impl<T> Arc<MaybeUninit<T>> { pub unsafe fn assume_init(self) -> Arc<T> {…} }
impl<T> Arc<[T]> { pub fn new_uninit_slice(len: usize) -> Arc<[MaybeUninit<T>]> {…} }
impl<T> Arc<[MaybeUninit<T>]> { pub unsafe fn assume_init(self) -> Arc<[T]> {…} }

PR #66128 adds:

impl<T> Box<T> { pub fn new_zeroed() -> Box<MaybeUninit<T>> {…} }
impl<T> Arc<T> { pub fn new_zeroed() -> Arc<MaybeUninit<T>> {…} }
impl<T> Rc<T> { pub fn new_zeroed() -> Rc<MaybeUninit<T>> {…} }

Unresolved question:

The constructor that returns for example Box<MaybeUninit<T>> might “belong” more as an associated function of that same type, rather than Box<T>. (And similarly for other constructors.) However this would make a call like Box::<u32>::new_uninit() becomes Box::<MaybeUnint<u32>>::new_uninit() which feels unnecessarily verbose. I suspect that this turbofish will be needed in a lot of cases to appease type inference.

The text was updated successfully, but these errors were encountered:

TimDiekmann · 2019-10-05T22:04:16Z

Just from looking at the current source code:

pub fn new_uninit() -> Box<mem::MaybeUninit<T>> {
    let layout = alloc::Layout::new::<mem::MaybeUninit<T>>();
    let ptr = unsafe {
        Global.alloc(layout)
            .unwrap_or_else(|_| alloc::handle_alloc_error(layout))
    };
    Box(ptr.cast().into())
}

When mem::size_of::<T>() == 0, a zero-sized layout is passed to Global.alloc but alloc mentions:

This function is unsafe because undefined behavior can result if the caller does not ensure that layout has non-zero size.

Did I have overseen something or is this a bug?

SimonSapin · 2019-10-05T23:12:37Z

Good point! I copied this pattern from Rc and Arc, but there the header (refcounts) cause the allocation to never be zero-size.

Box::new_uninit_slice has a similar bug with size_of() == 0 or len == 0.

TimDiekmann · 2019-10-05T23:20:08Z

I have noticed this when implementing Box for custom allocators with non-zero layouts. Turns out, that this is indeed useful 🙂

This is my implementation for the sliced version.

Requesting a zero-size allocation is not allowed, return a dangling pointer instead. CC rust-lang#63291 (comment)

SimonSapin · 2019-10-06T21:51:25Z

#65174

Fix zero-size uninitialized boxes Requesting a zero-size allocation is not allowed, return a dangling pointer instead. CC rust-lang#63291 (comment)

alloc: Add new_zeroed() versions like new_uninit(). MaybeUninit has both uninit() and zeroed(), it seems reasonable to have the same surface on Box/Rc/Arc. Needs tests. cc rust-lang#63291

Kixunil · 2019-12-13T09:59:19Z

What are the requirements for stabilizing the Box API?

I don't think using Rc<MaybeUninit<T>> (and Arc) is currently that useful, as Arc happens to be shared (no mutation). However, there's this pattern where one wants to create a value while it's uniquely owned and then share it later. Doing it with Box would mean copying, so I was thinking about having RcMut and ArcMut that work exactly as Box, but reserve space for reference counters, so that when you call share() on them, they just initialize ref count and convert to Rc/Arc. This is however another topic. Did anyone had this idea before or should I start discussion somewhere?

SimonSapin · 2019-12-13T12:54:34Z

there's this pattern where one wants to create a value while it's uniquely owned and then share it later

Yes, that’s exactly the idea with having Rc::new_uninit together with Rc::get_mut_unchecked #63292. It would also work with Rc::new_uninit + Rc::get_mut + unwrap, with an unnecessary run-time check (assuming sharing only after this).

Kixunil · 2019-12-16T12:12:21Z

Great! I still think it'd be better to provide safe abstraction for it, but at least it'd be possible to do in a library.

rickvanprim · 2020-03-28T20:00:18Z

Similar to new_zeroed() it would be nice to have a new_zeroed_slice().

josephlr · 2022-10-17T19:07:03Z

Okay, so there is no way to create an array on heap with some length in stable Rust, even in unsafe? -_-"

It is possible to create a zeroed array or slice directly on the heap on Stable Rust and without using unsafe:

const N: usize = 4096*4096;

pub fn make_array() -> Box<[u8; N]> {
    vec![0; N].into_boxed_slice().try_into().unwrap()
}

pub fn make_slice(n: usize) -> Box<[u8]> {
    vec![0; n].into_boxed_slice()
}

This doesn't blow the stack even with opt-level=0.

SimonSapin · 2022-10-17T19:19:59Z

That’s an optimization (based on specialization of a private trait) in current versions of the standard library, but it only works for some types and is not a documented guarantee.

QuineDot · 2023-01-23T04:03:05Z

Regarding the self parameter, using some_box_maybe_uninit.assume_init() currently resolves to MaybeUninit::assume_init, though it at least also triggers 48919. Stabilizing the method with the self receiver will be a breaking change as the return types differ.

thomcc · 2023-01-23T05:09:25Z

and is not a documented guarantee.

This is correct, but it's also true that we'd probably need a very good reason to regress it. If some code's correctness (somehow) relies on this, they should find another method, but for performance reasons I think it's safe enough to rely on.

That said, as you point out it does not work for all types.

The unstable new_uninit feature enables various library APIs to create uninitialized containers, such as `Box::assume_init()`. This is necessary to build abstractions that directly initialize memory at the target location, instead of doing copies through the stack. Will be used by the DRM scheduler abstraction in the kernel crate, and by field-wise initialization (e.g. using `place!()` or a future replacement macro which may itself live in `kernel`) in driver crates. See [1] [2] [3] for background information. [1] Rust-for-Linux/linux#879 [2] Rust-for-Linux/linux#2 [3] rust-lang/rust#63291 Signed-off-by: Asahi Lina <lina@asahilina.net>

The unstable new_uninit feature enables various library APIs to create uninitialized containers, such as `Box::assume_init()`. This is necessary to build abstractions that directly initialize memory at the target location, instead of doing copies through the stack. Will be used by the DRM scheduler abstraction in the kernel crate, and by field-wise initialization (e.g. using `place!()` or a future replacement macro which may itself live in `kernel`) in driver crates. See [1] [2] [3] for background information. [1] Rust-for-Linux#879 [2] Rust-for-Linux#2 [3] rust-lang/rust#63291 Signed-off-by: Asahi Lina <lina@asahilina.net> Reviewed-by: Gary Guo <gary@garyguo.net> Reviewed-by: Andreas Hindborg <a.hindborg@samsung.com> Reviewed-by: Martin Rodriguez Reboredo <yakoyoku@gmail.com> Reviewed-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com> Link: https://lore.kernel.org/r/20230224-rust-new_uninit-v1-1-c951443d9e26@asahilina.net Signed-off-by: Miguel Ojeda <ojeda@kernel.org>

The unstable new_uninit feature enables various library APIs to create uninitialized containers, such as `Box::assume_init()`. This is necessary to build abstractions that directly initialize memory at the target location, instead of doing copies through the stack. Will be used by the DRM scheduler abstraction in the kernel crate, and by field-wise initialization (e.g. using `place!()` or a future replacement macro which may itself live in `kernel`) in driver crates. Link: #879 Link: #2 Link: rust-lang/rust#63291 Signed-off-by: Asahi Lina <lina@asahilina.net> Reviewed-by: Martin Rodriguez Reboredo <yakoyoku@gmail.com> Reviewed-by: Gary Guo <gary@garyguo.net> Reviewed-by: Andreas Hindborg <a.hindborg@samsung.com> Reviewed-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com> Link: https://lore.kernel.org/r/20230224-rust-new_uninit-v1-1-c951443d9e26@asahilina.net [Reworded to use `Link` tags] Signed-off-by: Miguel Ojeda <ojeda@kernel.org>

The unstable new_uninit feature enables various library APIs to create uninitialized containers, such as `Box::assume_init()`. This is necessary to build abstractions that directly initialize memory at the target location, instead of doing copies through the stack. Will be used by the DRM scheduler abstraction in the kernel crate, and by field-wise initialization (e.g. using `place!()` or a future replacement macro which may itself live in `kernel`) in driver crates. Link: #879 Link: #2 Link: rust-lang/rust#63291 Signed-off-by: Asahi Lina <lina@asahilina.net> Reviewed-by: Martin Rodriguez Reboredo <yakoyoku@gmail.com> Reviewed-by: Gary Guo <gary@garyguo.net> Reviewed-by: Andreas Hindborg <a.hindborg@samsung.com> Reviewed-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com> Link: https://lore.kernel.org/r/20230224-rust-new_uninit-v1-1-c951443d9e26@asahilina.net [ Reworded to use `Link` tags. ] Signed-off-by: Miguel Ojeda <ojeda@kernel.org>

The unstable new_uninit feature enables various library APIs to create uninitialized containers, such as `Box::assume_init()`. This is necessary to build abstractions that directly initialize memory at the target location, instead of doing copies through the stack. Will be used by the DRM scheduler abstraction in the kernel crate, and by field-wise initialization (e.g. using `place!()` or a future replacement macro which may itself live in `kernel`) in driver crates. Link: Rust-for-Linux#879 Link: Rust-for-Linux#2 Link: rust-lang/rust#63291 Signed-off-by: Asahi Lina <lina@asahilina.net> Reviewed-by: Martin Rodriguez Reboredo <yakoyoku@gmail.com> Reviewed-by: Gary Guo <gary@garyguo.net> Reviewed-by: Andreas Hindborg <a.hindborg@samsung.com> Reviewed-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com> Link: https://lore.kernel.org/r/20230224-rust-new_uninit-v1-1-c951443d9e26@asahilina.net [ Reworded to use `Link` tags. ] Signed-off-by: Miguel Ojeda <ojeda@kernel.org>

safinaskar · 2023-05-29T11:40:49Z

API looks very strange. Box::new_uninit returns type, which is different from type this method lives in! new_uninit is located in Box<T>, but returns Box<MaybeUninit<T>>. This is absolute violation of principle of least surprise. I just have read #110715 (comment) and I initially was unable to understand the code. I wondered how Box::<Type>::new_uninit() can possibly work. Then I opened docs and then I was able to understand what is going on.

So, please make signature so:

impl<T> Box<MaybeUninit<T>, Global> {
  fn new_uninit() -> Box<MaybeUninit<T>, Global> { ... }
}

The same applies to other functions, for example, try_new_uninit

The unstable new_uninit feature enables various library APIs to create uninitialized containers, such as `Box::assume_init()`. This is necessary to build abstractions that directly initialize memory at the target location, instead of doing copies through the stack. Will be used by the DRM scheduler abstraction in the kernel crate, and by field-wise initialization (e.g. using `place!()` or a future replacement macro which may itself live in `kernel`) in driver crates. Link: Rust-for-Linux#879 Link: Rust-for-Linux#2 Link: rust-lang/rust#63291 Signed-off-by: Asahi Lina <lina@asahilina.net> Reviewed-by: Martin Rodriguez Reboredo <yakoyoku@gmail.com> Reviewed-by: Gary Guo <gary@garyguo.net> Reviewed-by: Andreas Hindborg <a.hindborg@samsung.com> Reviewed-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com> Link: https://lore.kernel.org/r/20230224-rust-new_uninit-v1-1-c951443d9e26@asahilina.net [ Reworded to use `Link` tags. ] Signed-off-by: Miguel Ojeda <ojeda@kernel.org>

The unstable new_uninit feature enables various library APIs to create uninitialized containers, such as `Box::assume_init()`. This is necessary to build abstractions that directly initialize memory at the target location, instead of doing copies through the stack. Will be used by the DRM scheduler abstraction in the kernel crate, and by field-wise initialization (e.g. using `place!()` or a future replacement macro which may itself live in `kernel`) in driver crates. Link: Rust-for-Linux/linux#879 Link: Rust-for-Linux/linux#2 Link: rust-lang/rust#63291 Signed-off-by: Asahi Lina <lina@asahilina.net> Reviewed-by: Martin Rodriguez Reboredo <yakoyoku@gmail.com> Reviewed-by: Gary Guo <gary@garyguo.net> Reviewed-by: Andreas Hindborg <a.hindborg@samsung.com> Reviewed-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com> Link: https://lore.kernel.org/r/20230224-rust-new_uninit-v1-1-c951443d9e26@asahilina.net [ Reworded to use `Link` tags. ] Signed-off-by: Miguel Ojeda <ojeda@kernel.org>

bertptrs · 2023-07-17T05:55:43Z

Regarding the self parameter, using some_box_maybe_uninit.assume_init() currently resolves to MaybeUninit::assume_init, though it at least also triggers 48919. Stabilizing the method with the self receiver will be a breaking change as the return types differ.

This could be resolved by not making Box::assume_init a method but rather an associated function like most of the Rc API.

SimonSapin added T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. B-unstable Feature: Implemented in the nightly compiler and unstable. C-tracking-issue Category: A tracking issue for an RFC or an unstable feature. labels Aug 5, 2019

This was referenced Aug 5, 2019

Tracking issue for {Rc, Arc}::get_mut_unchecked #63292

Open

Add APIs for uninitialized Box, Rc, and Arc. (Plus get_mut_unchecked) #62451

Merged

Centril added the requires-nightly This issue requires a nightly compiler in some way. label Aug 5, 2019

RalfJung mentioned this issue Aug 14, 2019

Tracking issue for RFC 1892, "Deprecate uninitialized in favor of a new MaybeUninit type" #53491

Closed

3 tasks

Centril mentioned this issue Aug 14, 2019

Meta tracking issue for RFC 1892, "Deprecate uninitialized in favor of a new MaybeUninit type" #63566

Closed

7 tasks

RalfJung mentioned this issue Aug 14, 2019

Tracking issue for #![feature(maybe_uninit_slice)] #63569

Open

1 task

TimDiekmann mentioned this issue Oct 5, 2019

Ban zero-sized allocations rust-lang/wg-allocators#16

Closed

SimonSapin added a commit to SimonSapin/rust that referenced this issue Oct 6, 2019

Fix zero-size uninitialized boxes

23d3ff1

Requesting a zero-size allocation is not allowed, return a dangling pointer instead. CC rust-lang#63291 (comment)

SimonSapin mentioned this issue Oct 6, 2019

Fix zero-size uninitialized boxes #65174

Merged

emilio mentioned this issue Nov 5, 2019

alloc: Add new_zeroed() versions like new_uninit(). #66128

Merged

Mark-Simulacrum mentioned this issue Dec 10, 2019

Consider adding Box::uninitialized function #46406

Closed

TimDiekmann mentioned this issue Jan 30, 2020

Is something like realloc_zeroed and grow_in_place_zeroed useful? rust-lang/wg-allocators#14

Closed

VictorKoenders mentioned this issue Aug 17, 2022

[WIP] Deny OOM, embrace try_reserve bincode-org/bincode#448

Open

10 tasks

ojeda mentioned this issue Sep 22, 2022

Rust unstable features needed for the kernel Rust-for-Linux/linux#2

Open

71 tasks

AlexTMjugador mentioned this issue Oct 8, 2022

Publishing to crates.io? (I'll help!) ComunidadAylas/vorbis-rs#3

Closed

Thomasdezeeuw mentioned this issue Apr 17, 2023

A10 on stable Rust Thomasdezeeuw/a10#63

Closed

10 tasks

koute mentioned this issue Jun 19, 2023

Use Box::new_ununit etc. once that's stabilized paritytech/parity-scale-codec#458

Open

AlexTMjugador mentioned this issue Jul 3, 2023

Use slices rather than vectors in ZopfliHash (std only) zopfli-rs/zopfli#27

Merged

tehmatt mentioned this issue Sep 8, 2023

Support stable rust codx-dev/msgpacker#13

Open

Dylan-DPC mentioned this issue Mar 4, 2024

Tracking issues for unstable library features used by std #94971

Open

32 tasks

DCNick3 mentioned this issue Mar 6, 2024

Avoid using default in HashBuffers::reset Frommi/miniz_oxide#147

Closed

egkoppel mentioned this issue Mar 10, 2024

List of nightly features required popcorn-2/popcorn-2#74

Open

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking issue for uninitialized constructors for Box, Rc, Arc #63291

Tracking issue for uninitialized constructors for Box, Rc, Arc #63291

SimonSapin commented Aug 5, 2019 •

edited

TimDiekmann commented Oct 5, 2019 •

edited

SimonSapin commented Oct 5, 2019

TimDiekmann commented Oct 5, 2019 •

edited

SimonSapin commented Oct 6, 2019

Kixunil commented Dec 13, 2019

SimonSapin commented Dec 13, 2019

Kixunil commented Dec 16, 2019

rickvanprim commented Mar 28, 2020

josephlr commented Oct 17, 2022

SimonSapin commented Oct 17, 2022

QuineDot commented Jan 23, 2023

thomcc commented Jan 23, 2023

safinaskar commented May 29, 2023 •

edited

bertptrs commented Jul 17, 2023

Tracking issue for uninitialized constructors for Box, Rc, Arc #63291

Tracking issue for uninitialized constructors for Box, Rc, Arc #63291

Comments

SimonSapin commented Aug 5, 2019 • edited

TimDiekmann commented Oct 5, 2019 • edited

SimonSapin commented Oct 5, 2019

TimDiekmann commented Oct 5, 2019 • edited

SimonSapin commented Oct 6, 2019

Kixunil commented Dec 13, 2019

SimonSapin commented Dec 13, 2019

Kixunil commented Dec 16, 2019

rickvanprim commented Mar 28, 2020

josephlr commented Oct 17, 2022

SimonSapin commented Oct 17, 2022

QuineDot commented Jan 23, 2023

thomcc commented Jan 23, 2023

safinaskar commented May 29, 2023 • edited

bertptrs commented Jul 17, 2023

SimonSapin commented Aug 5, 2019 •

edited

TimDiekmann commented Oct 5, 2019 •

edited

TimDiekmann commented Oct 5, 2019 •

edited

safinaskar commented May 29, 2023 •

edited