toml 0.6.0 no longer allows deserializing a struct with an explicit lifetime #490

msfjarvis · 2023-01-24T05:02:36Z

I've raised #489 as a minimal repro within the existing test suite

epage · 2023-01-24T13:22:40Z

This was an intentional change as by using toml_edit for the parser as everything gets converted to an owned String before we deserialize, making it so borrows of the original source cannot happen.

msfjarvis · 2023-01-24T13:37:46Z

This was an intentional change as by using toml_edit for the parser as everything gets converted to an owned String before we deserialize, making it so borrows of the original source cannot happen.

Got it. Is there a documentation change that can be made to make that explicit? I looked through the changelog but nothing stood out as the cause for this which is why I ended up raising this issue.

epage · 2023-01-24T13:53:42Z

The changelog was updated.

djc · 2023-01-24T14:28:07Z

So why does toml_edit convert everything to String?

epage · 2023-01-24T14:38:51Z

toml_edit parses directly into a Document.

In theory, we could add an internal ParsedDocument that gets converted into a Document or used for serde but that is a lot of extra code to maintain and a lot of extra steps and we need a strong enough case to justify it.

In theory, we could also parse into a Document<'de> where we store Cows for the strings but I've generally found putting Cow into API items adds a lot of end-user complexity and we need a strong enough case to justify it.

As an alternative to Cow, we could also parse into enum { Span, String }, like we do with Repr and Decor but that significantly complicated the API and for toml_edit users will require a lot of unnecessary unwraps, so this would also need a strong case to justify it.

djc · 2023-01-26T10:44:39Z

It seems quite surprising to me to just throw away the ability to zero-copy/low-allocation deserialization when that is one of the main benefits usually offered by serde integrations. Offering a Document<'de> with Cows seems pretty reasonable -- in my experience this doesn't really need to add a lot of end-user complexity.

epage · 2023-01-26T13:10:08Z

It seems quite surprising to me to just throw away the ability to zero-copy/low-allocation deserialization when that is one of the main benefits usually offered by serde integrations.

Do you have a use case where the benefits of zero-copy / low allocation deserialization is important? Assuming its performance driven, what is the actual performance before/after this and what is your performance target and why is that performance target set?

djc · 2023-01-26T13:32:07Z

I don't currently have such a use case. My usage of toml that prompted these questions was for Askama's configuration file.

AlexTMjugador · 2023-01-31T21:54:50Z

For what is worth, I'd like to point out that serde_json's from_str just errors out on runtime if it turns out that the struct can't hold owned data because that lifetime is used in a reference like &'a str. serde_json is usually considered an example for serde deserializer and serializer design, although this potential runtime failure, which happens when serde_json unescapes strings, is also considered a footgun for the not-so-experienced.

epage · 2023-01-31T22:15:10Z

The difference between serde_json and toml is that serde_json has a chance to reuse the borrowed data. This distinction is highlighted in the documentation on Deserializer lifetimes

toml-rs/toml#490

Yuri6037 · 2024-09-18T21:05:58Z

I just had to revert one of my projects to 0.5.11 to be able to deserialize without allocations.

epage · 2024-09-18T21:16:44Z

I just had to revert one of my projects to 0.5.11 to be able to deserialize without allocations.

What is your use case that you are avoiding allocations?

AlexTMjugador · 2024-09-18T21:16:47Z

@Yuri6037 please note that due to what was stated on #505 and #490 (comment) no-allocation deserialization was not possible with the toml crate anyway, as deserializing e.g. a Cow field would always result in a Cow::Owned. ~~In other words, previous toml crate versions at best provided some placebo that zero-copy deserialization was happening when in fact it was not.~~

Edit: I reviewed the old deserialization code for a bit and indeed it seems it had some logic for handling borrowed and owned strings separately, so it turns out that in previous versions zero-copy string deserialization may indeed have been possible. But with the current design it's indeed no longer possible, even if the DeserializeOwned constraint is relaxed.

Yuri6037 · 2024-09-18T22:04:33Z

I just had to revert one of my projects to 0.5.11 to be able to deserialize without allocations.

What is your use case that you are avoiding allocations?

I'm trying to improve performance in one of my app which already does far too many allocations.

Yuri6037 · 2024-09-18T22:07:09Z

@Yuri6037 please note that due to what was stated on #505 and #490 (comment) no-allocation deserialization was not possible with the toml crate anyway, as deserializing e.g. a Cow field would always result in a Cow::Owned. ~~In other words, previous toml crate versions at best provided some placebo that zero-copy deserialization was happening when in fact it was not.~~

Edit: I reviewed the old deserialization code for a bit and indeed it seems it had some logic for handling borrowed and owned strings separately, so it turns out that in previous versions zero-copy string deserialization may indeed have been possible. But with the current design it's indeed no longer possible, even if the DeserializeOwned constraint is relaxed.

I'm not using a Cow, I use from_str with a &str and all models use &'a based fields.

epage · 2024-09-18T22:32:49Z

I'm trying to improve performance in one of my app which already does far too many allocations.

Could you expand on your use case where TOML string allocations are a performance concern in your allocation?

For context on my perspective, I deal with Cargo's performance which for certain projects can load hundreds of TOML files. Even in that situation, TOML is a fraction of a fraction of where the end-user is going and only that is noticeable in no-op runs. The only reason I'm caring about no-op run performance is for shell completions calling into cargo. Like with Cargo, TOML is mostly dealt with in application initialization and most other applications load a fraction of the TOML files.

I'm not using a Cow, I use from_str with a &str and all models use &'a based fields.

Unless you are intentionally restricting your string inputs, this will reject perfectly valid TOML files because any escaped values will need an allocation.

epage mentioned this issue Jan 24, 2023

Why was support for deserializing to borrowing types dropped? #492

Closed

epage closed this as not planned Won't fix, can't repro, duplicate, stale Jan 24, 2023

epage added C-bug Category: Things not working as expected A-serde Area: Serde integration labels Jan 24, 2023

epage mentioned this issue Jan 31, 2023

feat: support deserializing types that borrow data in from_str #505

Closed

epage added C-enhancement Category: Raise on the bar on expectations and removed C-bug Category: Things not working as expected labels Feb 7, 2023

epage mentioned this issue Feb 8, 2023

fixed lifetime in toml::de::from_str #514

Closed

epage pinned this issue Feb 8, 2023

knoellle mentioned this issue Jun 13, 2023

Re-Framework HULKs/hulk#182

Merged

epage mentioned this issue Nov 13, 2023

Allow from_str to take Deserialize instead of DeserializeOwned #651

Closed

Skeletonxf added a commit to Skeletonxf/easy-ml that referenced this issue Mar 16, 2024

Update to latest 0.5.x version of toml due to breaking change in 0.6

21a23da

toml-rs/toml#490

epage mentioned this issue Jun 1, 2024

[Bug] de::from_str has overly strict generic restrictions. #739

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

toml 0.6.0 no longer allows deserializing a struct with an explicit lifetime #490

toml 0.6.0 no longer allows deserializing a struct with an explicit lifetime #490

msfjarvis commented Jan 24, 2023

epage commented Jan 24, 2023

msfjarvis commented Jan 24, 2023

epage commented Jan 24, 2023

djc commented Jan 24, 2023

epage commented Jan 24, 2023

djc commented Jan 26, 2023

epage commented Jan 26, 2023

djc commented Jan 26, 2023

AlexTMjugador commented Jan 31, 2023

epage commented Jan 31, 2023

Yuri6037 commented Sep 18, 2024

epage commented Sep 18, 2024

AlexTMjugador commented Sep 18, 2024 •

edited

Loading

Yuri6037 commented Sep 18, 2024

Yuri6037 commented Sep 18, 2024

epage commented Sep 18, 2024

toml 0.6.0 no longer allows deserializing a struct with an explicit lifetime #490

toml 0.6.0 no longer allows deserializing a struct with an explicit lifetime #490

Comments

msfjarvis commented Jan 24, 2023

epage commented Jan 24, 2023

msfjarvis commented Jan 24, 2023

epage commented Jan 24, 2023

djc commented Jan 24, 2023

epage commented Jan 24, 2023

djc commented Jan 26, 2023

epage commented Jan 26, 2023

djc commented Jan 26, 2023

AlexTMjugador commented Jan 31, 2023

epage commented Jan 31, 2023

Yuri6037 commented Sep 18, 2024

epage commented Sep 18, 2024

AlexTMjugador commented Sep 18, 2024 • edited Loading

Yuri6037 commented Sep 18, 2024

Yuri6037 commented Sep 18, 2024

epage commented Sep 18, 2024

AlexTMjugador commented Sep 18, 2024 •

edited

Loading