`nil` or `null` values #30

benolee · 2013-02-24T05:27:22Z

(moved discussion from #11)

It seems like nil or null values must be allowed. For example,

# is this equivalent to {"foo":null,"bar":{"baz":null}} or {} ?
[foo]
[bar.baz]

in this case, it seems like it would make sense to be able to set them with the normal key = value syntax. Here are some alternatives for thought:

key = nil
key = null
key = # empty value ala bash

The text was updated successfully, but these errors were encountered:

aaronblohowiak · 2013-02-24T05:41:42Z

you don't need nil or null, just leave out that assignment.

mojombo · 2013-02-24T05:46:07Z

Yeah, I'm not convinced of the usefulness of nil. TOML is intended for configuration, at which point @aaronblohowiak is right: just leave it out. I'm open to use cases and further convincing, though.

benolee · 2013-02-24T05:53:49Z

no further arguments here. I think it might be important to implementers to know if an empty "key group" should result in a key with no value (ie. nil or null depending on the language) or no key at all

mojombo · 2013-02-24T07:12:36Z

@benolee Ah, good question. I'm going to say it should be an empty hash table.

Iron-E · 2021-04-08T16:55:12Z

Yeah, I'm not convinced of the usefulness of nil. TOML is intended for configuration, at which point @aaronblohowiak is right: just leave it out. I'm open to use cases and further convincing, though.

It's possible I'm using TOML for the wrong thing here, but I was going to use TOML as a bridge for query languages in my CLInvoice crate since I already have toml as a dependency for parsing user configurations, and it is a dead-simple markup language which I believe users would be able to learn without much trouble.

The reason for this is that CLInvoice is designed to be able to handle any permanent storage facility, whether or not it actually has a Structured Query Language of its own. Because of this, I needed to create a unified query 'language' based on the model and what operations made sense for it. Writing an adapter for CLInvoice explicitly provides support for this query 'language'.

Querying in CLInvoice is built on the backbone of the Match type, which can accept a list of types for HasNone, HasAny, or HasAll operations. Some types give Match values which may be None. For example, querying an InvoiceDate requires specifying an Option<chrono::DateTIme<chrono::Local>> for its paid field. If I were using toml, that means that TOML would have to be able to accept a list containing None/Nil and/or a concrete date, like so:

# rest of `InvoiceDate` query left out for simplicity's sake

[paid]
condition = 'HasAny'
value = [
  2020-04-01T03:00:00Z,
  None
]

The above would be quivalent to the English statement "match InvoiceDates that are either unpaid or were paid on 2020/04/01 at 3:00 UTC."

Obviously, for Match operations such as EqualTo that only accept one value, just leaving the value out is good enough to imply a None. But in list types there isn't a good way to specify a None in a given position.

~~Right now I'm thinking of switching to YAML~~ I've switched to YAML, but YAML has some of its own issues (such as its fear of tabs and embedded types using way too much indentation). Accepting None in TOML would be very handy for the odd case such as this!

I had considered nom but writing a DSL for this project seems like it would lead to less ROI than serializing / deserializing a model + helpful errors in this case.

This is because TOML does not support `None` in lists. We need to be able to tell the difference between `Some` and `None` in list items in order to support `Match<'_, Option<_>>`. SEE toml-lang/toml#30

eksortso · 2021-04-09T02:42:49Z

@Iron-E Although there won't be a None or Nil added to TOML (as far as I can see), you do have an option to use within the TOML syntax that would fit the bill. If you would never need to represent a hashmap value (and few relational database columns in this world store whole hashmaps), you could use an empty inline table to express a NULL value in your value list. For example, the sample you provided could be changed to look like this:

[paid]
condition = 'HasAny'
value = [
  2020-04-01T03:00:00Z,
  {}  # This represents NULL in the value list.
]

It's not a literal null, but it would do the trick. You could also use any value that isn't a datetime, like false, if you'd rather have something more lightweight than a table here.

In any case, you would need to handle non-datetime values gracefully, but you would need to do that with any hypothetical NULL anyway.

danhje · 2021-05-06T14:00:10Z

Yeah, I'm not convinced of the usefulness of nil. TOML is intended for configuration, at which point @aaronblohowiak is right: just leave it out. I'm open to use cases and further convincing, though.

Here's a use case: You want to read config from a TOML file, interpreting values as defaults, but you want environment variables with the same name to be able to override the defaults. An empty field thus indicates to your code that it must look in the environment, and it indicates to the user that an environment variable must be set.

[config]
my_db_host = '127.0.0.1'
my_db_user = 'user'
my_db_pass

eksortso · 2021-05-08T13:36:58Z

@danhje Based on what you wrote, this use case has no real "defaults," a.k.a. values that are used in the absence of all other settings. Everything is set by environment variables first and foremost, followed by the settings in the TOML configuration file. Any missing setting must certainly lead to an error.

What this use case needs is just documentation. No value, not even an explicit null, would indicate that my_db_pass must be assigned by an environment variable. Worse, users may consider an explicit null to be a legitimate value for a password. An explicit null is equivalent to a missing setting, so why use an explicit null? In any case, you must explain your intention for password assignment, which is what comments are for. Or external documentation, if you don't want configuration comments.

Here's a pattern for this use case. This configuration would come with the installation for the users to fill out. All equivalent environment variable settings appear next to the configuration setting.

[config]
# Environment variable settings override the values here.

# Database host (Env: MY_DB_HOST)
my_db_host = '127.0.0.1'

# Database user account (Env: MY_DB_USER)
my_db_user = 'user'

# Database password cannot be set here.
# Required Env: MY_DB_PASS

danhje · 2021-05-08T17:08:44Z

Nobody reads documentation, and config comments are ugly. But more importantly, in my use case it’s not just about signaling to the user what variables are expected, I also want access to “empty” variables in code.

Consider how docker-compose interprets empty variables to mean that the variable should be mirrored from the host’s environment. In that case, leaving out the variable or using a comment isn’t an option. Docker composes uses yaml, and my understand is that leaving the value out really just results in an empty string, not a null, which I suppose is fine for my use case.

Here’s my use case, in a little bit more detail:

I want to create a variable / secret managing library for Python. The library is meant to be used for app development in large teams, where it’s difficult for each developer to keep track of all the environment variables that have to be set in order for the code to work. I want the users of my library to be able to centrally manage all these variables in a config file that could either be included or excluded from version control. I want the library to be able to give a friendly warning to a developer if a variable doesn’t have a default and isn’t found in the environment. So if you as a developer pulls down some commit where a fellow developer, unbeknownst to you, have introduced new variables that need to be set, you’ll find out about it right away rather than when the app fails unexpectedly, possible with a not so helpful error.

When working in interactive mode, I also want tab completion to present you with all variables from the config file, both set and unset.

I could force the users of my library to list expected variables in code rather than a config file, including default values, but this breaks the separation of config and code. In a project with hundreds of code files it also makes it harder to track down those expected variables, and it’s hard to enforce a central location for them.

If there’s a clever solution I haven’t thought of, I’d love to hear about it. But I think I’ll just use yaml instead. Which is a shame, since I was hoping to allow using pyproject.toml.

marzer · 2021-05-31T17:11:35Z

@albertotb if that second example parses OK in some implementation you're using you should file a bug report because it should not

albertotb · 2021-05-31T17:32:05Z

@albertotb if that second example parses OK in some implementation you're using you should file a bug report because it should not

It seems it was fixed in the latest Python implementation (0.10.2)

ghost · 2021-07-17T00:38:23Z

Nobody reads documentation, and config comments are ugly. But more importantly, in my use case it’s not just about signaling to the user what variables are expected, I also want access to “empty” variables in code.

Can't you just use config.get("value") which will automatically fall back to None? Or does your use case require differentiating between missing and null values?

… toml value cannot and won't accept null value. see toml-lang/toml#30 (comment)

* github action: fix event payload type of repository dispatch, because toml value cannot and won't accept null value. see toml-lang/toml#30 (comment) * fix repository dispatch problems. now ci::filter_workflow directly returns config::runtime::Workflow * separate run/boot handler * apply revision/silent/verbosity correctly

jonaslb · 2022-10-31T16:53:18Z

Question regarding this: Instead of allowing null/nil/none, can it be specified what parsers "should" do by default if they see a null/nil/none value anyway? Ie. I think the standard should recommend to simply omit them from the serialized toml document, or throw a type error, or to use a magic stringified value (id hope not), or something else (maybe putting an empty commented line with the key but no value?).

This is relevant since libraries are making different decisions on this. E.g. Fatal1ty/mashumaro#85 or samuelcolvin/rtoml#23. The latter is interesting - it claims to be fully compliant and pass all the toml tests - but apparently if stringifies null values, somehow (wrongly I assume) indicating that this is the right thing to do.

ChristianSi · 2022-11-04T09:14:37Z

@jonaslb TOML parsers will never see a null value, since those don't exist in TOML files. What you mean is a TOML serializer/writer.

About giving advice for them on how to represent types that don't map cleanly to a TOML type: I'm a bit skeptical about this since it might vary a lot on the use case. In the general case, "throw an error" is probably indeed the best course of action. But there may be applications, where, say, calling a to_dict() method on objects that have it and then serializing the result as a TOML table is entirely appropriate.

So I think the general rule is: when writing a serializer, document how it handles unexpected types.

And for TOML users: the best course of action is certainly not to pass any unexpected objects to your TOML writer in the first place. But if you want/need to do so anyway, make sure that it handles them in a way you consider appropriate.

salim-b · 2023-09-12T00:52:05Z

TOML is intended for configuration, at which point @aaronblohowiak is right: just leave it out. I'm open to use cases and further convincing, though.

Here's a use case: Layered configuration with global default config (read from, say, /etc/config/my-app.toml) and user settings that override/complement the defaults (read from, say, /home/user/.config/my-app/config.toml). In this scenario, it's currently impossible for the user to unset a default value. null would allow this.

marzer · 2023-09-12T11:34:56Z

@salim-b

The snippet you've quoted:

I'm open to use cases and further convincing, though.

Was written over ten years ago. There has been considerable deliberation on this point in the intervening years (including people giving examples exactly like yours), and sentiment has coalesced pretty firmly around "nulls are bad, actually" (see discussions in #146, #802, #803, #921, #975).

levkk · 2023-10-05T17:41:53Z

There is one good use case for nulls in TOML configuration files: sane defaults. Bear with me here.

Imagine that you have a setting like connect_timeout in your software that configures how long your application should wait before giving up on connecting to a server. Super important setting because servers go down all the time, doesn't mean your app should too. If you're distributing this app, you'd want to help your users by setting it to a value that's reasonable to use in production, e.g. 30 seconds. So you get the following definition:

#[derive(Serialize, Deserialize)]
pub struct Config {
    #[serde(default = "Config::default_config_timeout")]
    connect_timeout: u64,
}

impl Config {
    fn default_config_timeout() -> u64 {
        1000 * 30 // 30 seconds in milliseconds
    }
}

Everything is great and right in the world. If your users want to set it higher or lower, they can just:

connect_timeout = 1000

and everyone is happy.

But what if your users don't want a connect timeout? Their network is slow, they know it and they are in no rush, and why would they want to throw errors to their users when they know things will take a while? Their option is to either set it to a super large value like 1 year in milliseconds, which...well, works in practice, until you actually want to wait 1 year for something and Christmas day comes and your on-call gets a nasty page about an error they have never seen before, or for your software to support weird values like -1 which then require additional documentation and changing the obviously unsigned integer to a signed one just to store a negative number for one use case.

But what if we could set it to null instead?

connect_timeout = null

means there is no connect timeout and the app should wait forever, as desired by the user. Nulls are valid values in databases, software code and life in general: they mean there is nothing here, and that's how we like it.

marzer · 2023-10-05T18:10:54Z

or for your software to support weird values like -1 which then require additional documentation

Would it, though? Your software would need exactly the same amount of documentation regardless of what you chose for a sentinel, be it -1, null, nil, 0, or whatever else you can imagine. In all cases it's a single value that has special meaning, and would require exactly the same kind of verbiage. null isn't somehow special in this regard.

levkk · 2023-10-05T18:37:32Z

I think my main concern is using incorrect types, e.g. i64 can store an order of magnitude less values in it just so I can store a -1. Also someone could set it to -500 and the compiler wouldn't complain. We would have to validate it with logic. Meanwhile, a Duration::from_millis(config.connect_timeout) is validated by the compiler.

marzer · 2023-10-05T18:54:24Z

We would have to validate it with logic. Meanwhile, a Duration::from_millis(Meanwhile, a Duration::from_millis(config.connect_timeout) is validated by the compiler.

You will always need runtime logic, with or without nulls. TOML data is heterogeneous so you can't somehow get compile-time validation without doing type-based logic on lookups first. You need to explicitly specify the type of config.connect_timeout yourself somewhere, which means you need to check that it's a match etc.

levkk · 2023-10-06T01:00:15Z

You need to explicitly specify the type of config.connect_timeout yourself somewhere, which means you need to check that it's a match etc.

Serde will take care of that. By forcing me to change the data type I need to make sure that the value is valid, but before, any value was valid... if deaerialization is successful that is. So by forcing me to change the data type, I need to write more error-prone code.

Whether nulls belong in the TOML spec or not I think is a question of taste to be honest. It's hard for me to know what's the right decision here since your points are valid as well and having explicit nulls in a config file looks weird. That being said, null is a valid value for a data type so excluding it from the spec is not driven by correctness but probably by ergonomics and taste which are fine choices to make but nonetheless force the user to do something the TOML way instead of the optimal way.

marzer · 2023-10-06T08:28:45Z

Yeh, indeed it is a matter of taste. I'd like to clarify something though:

That being said, null is a valid value for a data type

No, it isn't. It's not in the spec, so it's not a valid value. It existing conceptually, or being in other languages, doesn't confer validity in TOML. The canonical way to express (something like) 'null' in TOML is to omit a value, so you still have that option.

instead of the optimal way.

What the 'optimal' way is happens to be a matter of taste too, FYI. IMO the most 'optimal' thing is what requires the least expression in the TOML config file itself - hard to beat "omit this KVP entirely" there.

I do recognize that the lack of null makes interop with other languages a decent bit harder in many cases, but I think it's also important to acknowledge that TOML is a config language first-and-foremost - any serialization concerns are for implementers to worry about, not users. Implementers can always jump through an extra hoop via a helper function (or similar), which is great if it keeps the language simpler for users.

salim-b · 2023-10-06T10:29:53Z

The canonical way to express (something like) 'null' in TOML is to omit a value, so you still have that option.

That is a flawed concept by itself which doesn't work for the described use case (user overrides global default; default is not absent).

What the 'optimal' way is happens to be a matter of taste too, FYI. IMO the most 'optimal' thing is what requires the least expression in the TOML config file itself - hard to beat "omit this KVP entirely" there.

"omit this KVP entirely" is not a universally applicable way to express "undefinition" in TOML, so cannot be a good general best practice recommendation. If there were a null value in TOML, you would still be free to "omit this KVP entirely" instead of explcitily setting it to null for simple use cases. So I don't really see the damage null would bring to the TOML language. My opinion.

pradyunsg · 2023-10-07T00:24:13Z

I'm sorry but I'm going to say that we're not revisiting this design choice at this point and an extended discussion about the consequences of that choice is something that I'd prefer folks have on a new discussion over on https://github.com/toml-lang/toml/discussions instead of in an issue that was closed a decade ago.

s-banach · 2024-03-01T17:48:47Z

People in the 21st century are really still arguing against the concept of zero. "It's not useful to say you have None of something." Lmao.

mojombo closed this as completed in a511658 Feb 24, 2013

This was referenced Feb 24, 2013

EBNF Monocle Party #34

Closed

Allow keys with no (i.e. empty) value #83

Closed

rcarver mentioned this issue Feb 28, 2013

In favor of NULL #146

Closed

rjocoleman mentioned this issue Feb 17, 2014

convert nil to empty string jm/toml#28

Merged

ntrepid8 added a commit to ntrepid8/pytoml that referenced this issue Oct 5, 2015

comment out values of None without crashing, toml-lang/toml#30

b4fe7d0

smith mentioned this issue Feb 21, 2017

Add a handlebars linter to check for undefined variables in hooks habitat-sh/habitat#1779

Open

tro3 mentioned this issue Mar 29, 2017

Reflection-based marshaling / unmarshaling pelletier/go-toml#149

Merged

uiri mentioned this issue Jun 8, 2017

Add support for Local Time uiri/toml#105

Merged

sangaman mentioned this issue May 18, 2018

db password of sample config causing errors ExchangeUnion/xud#77

Closed

bryanforbes mentioned this issue Dec 12, 2018

ValueError/TypeError with path dependency python-poetry/poetry#454

Closed

3 tasks

binarylogic mentioned this issue Mar 20, 2019

File context_key cannot be disabled vectordotdev/vector#151

Closed

gilbsgilbs mentioned this issue Sep 21, 2019

feat(manager): Support poetry custom repositories. renovatebot/renovate#4524

Merged

This was referenced Jun 1, 2020

Netlify validator errors on null netlify/build#1402

Closed

Cannot specify null for publishDirJSONFileName swyxio/netlify-plugin-search-index#18

Closed

lemon24 mentioned this issue Aug 15, 2020

CLI options must be passed all the time lemon24/reader#177

Closed

jszwedko mentioned this issue Nov 11, 2020

enhancement(sources): Allow line aggregation to never timeout vectordotdev/vector#4951

Closed

Roger-luo mentioned this issue Dec 5, 2020

inconsistent empty keys parse/print JuliaLang/TOML.jl#13

Closed

marzer mentioned this issue Jan 5, 2021

Proposal: Null type values for TOML #802

Closed

antalszava mentioned this issue Apr 30, 2021

Remove more analytic occurrences PennyLaneAI/pennylane#1261

Merged

samuelcolvin mentioned this issue Sep 12, 2021

Conversion of None to "null" string inconsistency samuelcolvin/rtoml#23

Open

pwwang mentioned this issue May 12, 2022

Dont dump none entity samuelcolvin/rtoml#31

Closed

umegaya added a commit to suntomi/deplo that referenced this issue Aug 26, 2022

github action: fix event payload type of repository dispatch, because…

5d5f78d

… toml value cannot and won't accept null value. see toml-lang/toml#30 (comment)

Artemis21 mentioned this issue Sep 17, 2022

Add a none type #921

Closed

jonaslb mentioned this issue Oct 30, 2022

Omit None values in TOML instead of throwing TypeError Fatal1ty/mashumaro#85

Closed

pwwang mentioned this issue Nov 12, 2022

Add argument none_value for None representation in loading and dumping samuelcolvin/rtoml#53

Merged

christoph2 mentioned this issue Mar 27, 2023

How to use christoph2/pyxcp#129

Open

mitsuhiko mentioned this issue May 21, 2023

Be Stricter About Lack of Null/None/Nil #975

Closed

Yura52 mentioned this issue May 25, 2023

How can I make a field wrapped in Optional without a default value to be required during deserialization? yukinarit/pyserde#352

Open

ax3l mentioned this issue Aug 17, 2023

TOML Backend openPMD/openPMD-api#1436

Merged

4 tasks

dkoshkin mentioned this issue Feb 15, 2024

fix: set config_path in Containerd config nutanix-cloud-native/cluster-api-runtime-extensions-nutanix#364

Merged

kislyuk mentioned this issue Mar 1, 2024

Raise legible error when trying to emit TOML with null values kislyuk/yq#183

Open

toml-lang locked as resolved and limited conversation to collaborators Apr 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`nil` or `null` values #30

`nil` or `null` values #30

benolee commented Feb 24, 2013

aaronblohowiak commented Feb 24, 2013

mojombo commented Feb 24, 2013

benolee commented Feb 24, 2013

mojombo commented Feb 24, 2013

Iron-E commented Apr 8, 2021 •

edited

Loading

eksortso commented Apr 9, 2021

danhje commented May 6, 2021 •

edited

Loading

eksortso commented May 8, 2021

danhje commented May 8, 2021

marzer commented May 31, 2021

albertotb commented May 31, 2021

ghost commented Jul 17, 2021 •

edited by ghost

Loading

jonaslb commented Oct 31, 2022 •

edited

Loading

ChristianSi commented Nov 4, 2022

salim-b commented Sep 12, 2023 •

edited

Loading

marzer commented Sep 12, 2023 •

edited

Loading

levkk commented Oct 5, 2023

marzer commented Oct 5, 2023 •

edited

Loading

levkk commented Oct 5, 2023

marzer commented Oct 5, 2023 •

edited

Loading

levkk commented Oct 6, 2023 •

edited

Loading

marzer commented Oct 6, 2023 •

edited

Loading

salim-b commented Oct 6, 2023

pradyunsg commented Oct 7, 2023

s-banach commented Mar 1, 2024

nil or null values #30

nil or null values #30

Comments

benolee commented Feb 24, 2013

aaronblohowiak commented Feb 24, 2013

mojombo commented Feb 24, 2013

benolee commented Feb 24, 2013

mojombo commented Feb 24, 2013

Iron-E commented Apr 8, 2021 • edited Loading

eksortso commented Apr 9, 2021

danhje commented May 6, 2021 • edited Loading

eksortso commented May 8, 2021

danhje commented May 8, 2021

marzer commented May 31, 2021

albertotb commented May 31, 2021

ghost commented Jul 17, 2021 • edited by ghost Loading

jonaslb commented Oct 31, 2022 • edited Loading

ChristianSi commented Nov 4, 2022

salim-b commented Sep 12, 2023 • edited Loading

marzer commented Sep 12, 2023 • edited Loading

levkk commented Oct 5, 2023

marzer commented Oct 5, 2023 • edited Loading

levkk commented Oct 5, 2023

marzer commented Oct 5, 2023 • edited Loading

levkk commented Oct 6, 2023 • edited Loading

marzer commented Oct 6, 2023 • edited Loading

salim-b commented Oct 6, 2023

pradyunsg commented Oct 7, 2023

s-banach commented Mar 1, 2024

`nil` or `null` values #30

`nil` or `null` values #30

Iron-E commented Apr 8, 2021 •

edited

Loading

danhje commented May 6, 2021 •

edited

Loading

ghost commented Jul 17, 2021 •

edited by ghost

Loading

jonaslb commented Oct 31, 2022 •

edited

Loading

salim-b commented Sep 12, 2023 •

edited

Loading

marzer commented Sep 12, 2023 •

edited

Loading

marzer commented Oct 5, 2023 •

edited

Loading

marzer commented Oct 5, 2023 •

edited

Loading

levkk commented Oct 6, 2023 •

edited

Loading

marzer commented Oct 6, 2023 •

edited

Loading