Int128 support #8373

jkthorne · 2019-10-24T15:50:51Z

I found in a project that while using Int128 it was not behaving as expected. In digging into this, I found this bug report #7915. This is something that I thought was listed as a TODO on the roadmap, but I don't see it there anymore.

I took over this project from someone else who abandoned it and quickly found that 32 bit support was a massive pain. I don't think it would be too hard to add this for 64bit machines only. I am currently working on a PR to add compiler-rt support for 32bit machines to support Int128. I think adding compile-rt could be done, but I want to make sure as a community, we will support 32bit machines.

Here are the PRs where people have started the Int128 work.
#5564
#7093
#8313

Is Crystal going to support 32bit machines?
I want Int128, but is this a feature that should be supported in Crystal?

bcardiff · 2019-10-24T16:34:49Z

My concern regarding 32 bits is not i386 arch but arm 32. Although we currently do not ship 32 bits if we cut the only 32 bits distro it might be hard to get it back.

If we decide to drop 32 bits to a second citizen we will stop shipping the compiler in 32 bits and we will stop working on actively fixing things for it. Yet we can keep the existing codebase and accept PR to make it work based on contributions.

Some might think that Int128 can be included only in 64 bits machines, but we don't want 100% Crystal apps and shards that will compile depending on the architecture. That is why either:

a) 32 bits supports Int128, or
b) Int128 are removed, or
c) 32 bits support is dropped.

So, voting time?

🎉 for keeping 32 bits official support & distributions and support Int128 at all costs.
👀 for keeping 32 bits but dropping Int128 as a feature of the language itself
🚀 for dropping official support & distributions of 32 bits (and keeping Int128)

vlazar · 2019-10-24T17:02:35Z

Last option means no Crystal on Raspberry Pi with Raspbian? 😞

Sija · 2019-10-24T17:09:23Z

@vlazar IIRC raspbian comes in 64-bit flavor already (for RPi >= 4) but that would be correct for older devices :(

vlazar · 2019-10-24T17:22:32Z

@Sija You are probably right and I have Pi3 :(

Note: We're focusing on Pi4 here. In theory the kernel does run on Pi3, but that's not our priority.
https://www.raspberrypi.org/forums/viewtopic.php?f=29&t=250730

straight-shoota · 2019-10-24T18:15:56Z

I'm in favour of supporting Int128 on 32bit architectures, but not at all costs. If it requires too much effort that keeps us away from working on other things, I suppose dropping 32bit is reasonable.

However, I'm also fine with a delayed resolution, meaning we won't implement Int128 for 32bit right away but still keep 32bit as primary architecture for now (because it's costly to drop it and bring it back later). We'd still have to decide later whether we want to keep 32bit support up and invest in implementing Int128 or drop the architecture.

We should also consider if there are other features besides Int128 which might let 32bits become more costly to support. This would impact the decision on Int128.

oprypin · 2019-10-24T19:59:47Z

I vote 😕 for discarding int128 on 32bit and ensuring Crystal works without it.

Which might be the status quo.

And yes, I see that this was excluded intentionally, I just disagree.

bcardiff · 2019-10-24T21:21:09Z

for keeping 32 bits official support & distributions and support Int128 at all costs.

@straight-shoota That means either embedding or porting compiler-rt. Both have their tradeoff I prefer porting at least as an initial default implementation that will likely be more portable. But going in this direction generated some questions. We need to be on the same page or accept the initial directions.

discarding int128 on 32bit and ensuring Crystal works without it.

that is not an option @oprypin because that is actually the current status where Int128 is not a first class citizen. There are no literals. Shards and app that use public api std-lib compiles depending on the arch.

We are trying to have a clear message on this feature. Either it works in the main supported platforms or is removed.

If we remove the 32-bits and complete the support for Int128, cross-compiling from 64-bits to 32-bits should still be possible as long as compiler-rt is linked I think. So it might be not that terrible, but that lineage at least initially will not be managed by us in this scenario.

asterite · 2019-10-24T21:26:20Z

I don't understand why the decisions are mixing Int128 and 32 bits support. Can we decide whether we want 128 int bits or not, and separately whether we want to support 32 bits platforms? My vote is remove Int128. I still don't know about 32 bits platforms.

bcardiff · 2019-10-25T18:25:25Z

There is a bit of tension between both together with the overflow semantics. Yet, the overflow of Int64 in 32 bits platforms has the same issue, but since it's on a smaller scale nobody complained that much. But the criteria could be the same: porting mulodi4 to Crystal was fine, but all the others for Int128 is not?

Dropping 32 bits allows us to easily support 128bits in the compiler which might be a bottleneck for 32 bits later, but as long as compiler-rt is linked maybe we are fine.

watzon · 2020-12-23T19:20:38Z

Is support for Int128 in the parser still planned before v1.0? I've noticed there hasn't been much traction here for a while, but having support for Int128 would really help me reduce my reliability on BigInt in a project I'm working on.

asterite · 2020-12-23T20:15:44Z

No.

jkthorne · 2020-12-23T20:26:19Z

this project has had a lot done on it and has had a few working PRs. It is too hard to review and breaking it into pieces has led to people not really being satisfied with merging it. Because the work is so old it probably has to be mostly redone but I dont think this will get merged without a plan and support from someone on the core team.

straight-shoota · 2020-12-23T20:58:55Z

The way I understand it is we're ultimately missing a decision on how to move on with 128-bit integers and 32-bit platforms.

The options are as far as I can see (ordered by amount of 128-bit integer support):
a) improve compiler-rt to fully support 128-bit integers on 32-bit platforms
b) limited support for 128-bit integers on 32-bit platforms, but keep platform support
c) drop official support for 32-bit platforms
d) no full 128-bit integer support on any platform

I think d) is pretty much out of question, I just included if for completeness. There's a broad consensus that Crystal should have support for 128-bit integers.

All the other options seem viable to me. a) would certainly be the best solution, but it also requires a lot of work. b) is debatable whether we could accept supporting different language features on different platforms.
But IMO it wouldn't be that bad. 128-bit integers isn't a typical every-day feature, so the amount of impact is negligible. I would also see b) as a good intermediary option which can later resolve to a) if demand is there and it gets implemented or ultimately c) if it get's no traction.

Not sure if this is a good idea, but what about adding 128-bit integers behind a compile time flag? That would ensure the premise that crystal code compiles independent of the platform by default. But you can use 128-bit integers on supported platforms if you opt-in to that.

asterite · 2020-12-23T21:48:22Z

I think d) is pretty much out of question

Why? I added Int128 support on a whim because one day I was bored and I wanted to see how easy or hard it was. It worked right away... and they it turned out that it's really hard to achieve.

If I had woken up with a different mood that day, there would be no Int128 support and nobody would be talking about it. Or at least talking about how to move it forward (maybe about how to initially implement it, but there wouldn't be anyone to do it).

Go is extremely popular and it doesn't have Int128. Surely Int128 is not really needed? Of course it's useful, but I'm not sure how much.

I wouldn't discard option d, even though everyone else discards it immediately.

Again, I regret my past self and I wish I hadn't implemented it in the first place.

straight-shoota · 2020-12-23T22:42:22Z

Fair enough. It would nevertheless be one of the least appreciated options ;)

jkthorne · 2020-12-23T22:42:53Z

I mean I would not be too upset. I think it is a really cool feature and I got involved because I found a use for it. If we found a path forward I would be interested in contributing and using it. You should wake up and do more of these.

watzon · 2020-12-24T04:45:53Z

Personally I would love just to have support for arbitrarily sized integers like Zig is doing, but since that's probably out of the question I'd settle for just having Int128 be completely supported. If someone could give a basic breakdown of what would be required to get it completely supported I'd even be willing to put the work in myself.

I guess the big question is how to come to a consensus as to which method is the right one. Obviously option a would be great, but it wouldn't be terrible to have limited or no support on 32 bit platforms. That could always change later, but in the mean time most everyone is probably developing and shipping on 64 bit platforms as it is.

I guess the other option could be to make BigInt more of a first class citizen by supporting it in the parser somehow and making sure that you can do anything with BigInts that can be done with the Int primitives, but that seems like a bigger overall change.

delef · 2021-08-19T08:40:47Z

why do you want to leave support 32 bits platforms? they are dead

jkthorne · 2021-08-19T15:47:13Z

@delef that is a good point.
I thought there was more 32 bits platforms with raseberry pis and embedded platforms. I guess it is a good question what targets do we have that are 32 bits that we want to cover?

watzon · 2021-08-22T07:31:54Z

32 bit is definitely not dead. Dying, yes, but not dead.

delef · 2021-08-22T08:37:37Z

@watzon where is it used besides old raspberries?

lbguilherme · 2021-08-22T10:18:59Z

WebAssembly is 32-bit.

watzon · 2021-08-23T16:06:34Z

@delef embedded, micro computers like the Raspberry Pi, WASM as @lbguilherme mentioned, and plenty of older hardware. Even if 32 bit platforms aren't generally being produced anymore, that doesn't mean they don't still exist and aren't in use. It's going to be a while yet before 32 bit is truly dead.

oprypin · 2021-12-12T12:44:25Z

There is an important aspect of Int128 support to discuss: what happens with large literals that don't have a suffix specified.

Should literals implicitly be allowed to become Int128 if they're big enough, like it is for Int64 currently?
For some reason I think they shouldn't and that the error should say "if you're sure you want i128, add the corresponding suffix", because that type is kind of a big commitment to work with, and programmers should know that.

Test code for literals, will be referenced below

macro t(expr)
  %x = {{`python -c "print(#{expr.id})"`}}
  puts "| #{%x.to_s.rjust(40)} | #{{{expr}}.ljust(10)} | #{typeof(%x).to_s.rjust(7)} |"
end

What we have now

Test code

t "2**31 - 1"  # Int32

t "2**31"      # Int64
t "2**32 - 1"  # Int64
t "2**32"      # Int64
t "2**63 - 1"  # Int64

t "2**63"      # UInt64
t "2**64 - 1"  # UInt64

#t "2**64"     # error

literal	expr	type
2147483647	2**31 - 1	`Int32`
2147483648	2**31	`Int64`
4294967295	2**32 - 1	`Int64`
4294967296	2**32	`Int64`
9223372036854775807	2**63 - 1	`Int64`
9223372036854775808	2**63	`UInt64`
18446744073709551615	2**64 - 1	`UInt64`
	2**64	error

Alternative 1 🎉

(What #11571 implements)

Test code

t "2**31 - 1"  # Int32

t "2**31"      # Int64
t "2**32 - 1"  # Int64
t "2**32"      # Int64
t "2**63 - 1"  # Int64

t "2**63"      # UInt64
t "2**64 - 1"  # UInt64

t "2**64"      # Int128
t "2**127 - 1" # Int128

t "2**127"     # UInt128
t "2**128 - 1" # UInt128

#t "2**128"    # error

literal	expr	type
2147483647	2**31 - 1	`Int32`
2147483648	2**31	`Int64`
4294967295	2**32 - 1	`Int64`
4294967296	2**32	`Int64`
9223372036854775807	2**63 - 1	`Int64`
9223372036854775808	2**63	`UInt64`
18446744073709551615	2**64 - 1	`UInt64`
18446744073709551616	2**64	`Int128`
170141183460469231731687303715884105727	2**127 - 1	`Int128`
170141183460469231731687303715884105728	2**127	`UInt128`
340282366920938463463374607431768211455	2**128 - 1	`UInt128`
	2**128	error

It is fully backwards-compatible and follows the pattern of inserting a UInt where possible: 2**127 is same kind of switchover as 2**63.
What I have a problem with is that, according to such a pattern, 2**31 should've also been a point of switchover to UInt32!
But that's not really what I want to see, and I'm not even gonna list that as an alternative.
My main gripe is actually that 2**63 shouldn't have been allowed and now it should be a point of switchover to Int128 immediately. Or, an explicit error as I mentioned at the beginning.

Alternative 2 😕

(purely hypothetical, backwards-incompatible part shown in bold)

literal	expr	type
2147483647	2**31 - 1	`Int32`
2147483648	2**31	`Int64`
4294967295	2**32 - 1	`Int64`
4294967296	2**32	`Int64`
9223372036854775807	2**63 - 1	`Int64`
9223372036854775808	2**63	`Int128`
18446744073709551615	2**64 - 1	`Int128`
18446744073709551616	2**64	`Int128`
170141183460469231731687303715884105727	2**127 - 1	`Int128`
170141183460469231731687303715884105728	2**127	error, add `u` suffix
340282366920938463463374607431768211455	2**128 - 1	error, add `u` suffix
	2**128	error

That was kind of an idealistic alternative because it makes the biggest sin in terms of backwards compatibility: the change in behavior isn't an error and is very implicit

Alternative 3 ❤️

(purely hypothetical, backwards-incompatible part shown in bold)

literal	expr	type
2147483647	2**31 - 1	`Int32`
2147483648	2**31	`Int64`
4294967295	2**32 - 1	`Int64`
4294967296	2**32	`Int64`
9223372036854775807	2**63 - 1	`Int64`
9223372036854775808	2**63	error, add `u64` or `i128` suffix
18446744073709551615	2**64 - 1	error, add `u64` or `i128` suffix
18446744073709551616	2**64	error, add `i128` suffix
170141183460469231731687303715884105727	2**127 - 1	error, add `i128` suffix
170141183460469231731687303715884105728	2**127	error, add `u128` suffix
340282366920938463463374607431768211455	2**128 - 1	error, add `u128` suffix
	2**128	error

I think this is the state we should eventually be in.

Alternative 4 🚀

(purely hypothetical, compatible compromise option,
with likely transition to alternative 3 for Crystal 2.0)

literal	expr	type
2147483647	2**31 - 1	`Int32`
2147483648	2**31	`Int64`
4294967295	2**32 - 1	`Int64`
4294967296	2**32	`Int64`
9223372036854775807	2**63 - 1	`Int64`
9223372036854775808	2**63	`UInt64`, in 1.4 produces a warning
18446744073709551615	2**64 - 1	`UInt64`, in 1.4 produces a warning
18446744073709551616	2**64	error, add `i128` suffix
170141183460469231731687303715884105727	2**127 - 1	error, add `i128` suffix
170141183460469231731687303715884105728	2**127	error, add `u128` suffix
340282366920938463463374607431768211455	2**128 - 1	error, add `u128` suffix
	2**128	error

Vote, if you'd like.
1:tada: | 2:confused: | 3:heart: | 4:rocket:

straight-shoota · 2021-12-12T19:06:07Z

For the immediate first implementation of 128-bit integer literals, only 1 or 4 are possible. 2 and 3 would be breaking changes. We can still consider them as eventual goal (for 2.0 or later).

I'm in favour of 4 and eventually probably 3. I think it makes sense to add type suffixes to all literals that are not the default type. For smaller types, this is already necessary because untyped literals get the default unless they don't fit in Int32.
Hence we might even consider implicit Int64 literals to become invalid. But that's a different story.

BlobCodes · 2021-12-12T21:24:00Z

When I use a positive number literal that doesn't fit in a UInt64, should the error message be:
Error: x doesn't fit in an UInt64 or Error: x doesn't fit in an Int64 ?

As UInt64 literals without suffix should produce a warning in 1.4, it's probably discouraged to use them, but they are still the biggest supported integer without a suffix.

oprypin · 2021-12-12T21:54:50Z

@BlobCodes let's say UInt64. It can easily be changed to Int64 at the same time as the warning is added.

oprypin · 2021-12-17T16:32:06Z

So, #11571 is set to be merged soon, and it implements Alternative 4.

Literals that used to produce UInt64 still produce UInt64 and don't emit a warning as of that PR.
In the table what I wrote about that was "in 1.4 produces a warning". But I didn't give too much thought to that. Why should we start giving a warning in Crystal 1.4 and not 1.3 (the actual upcoming release) or even 1.5? Does anyone have alternative suggestions?
Otherwise let's just stick to the "1.4" plan. I don't see any downsides to it: it would give two "kinds" of transitional periods for this change.

delef · 2021-12-18T14:08:47Z

I think we need to add a warning in 1.3 because the way has already been chosen. What reasons to wait 1.4?

HertzDevil · 2022-04-01T07:40:34Z

Tried implementing the warning, but the lexer doesn't have access to the program, and Crystal::NumberLiteral doesn't have access to the raw string, so all I could get working is:

0x8000_0000_0000_0000 # Warning: 9223372036854775808 doesn't fit in an Int64, try using the suffix u64 or i128

straight-shoota · 2022-04-05T08:11:28Z

I think we should add the raw string to literals. Might even make sense to expose that in the macro API.

beta-ziliani · 2023-01-12T14:44:11Z

Int128 support is good now, so I'm closing it. If there's something missing, we can open a specific issue.

crysbot · 2024-02-13T18:44:16Z

This issue has been mentioned on Crystal Forum. There might be relevant details there:

https://forum.crystal-lang.org/t/is-this-a-bug/6571/5

jkthorne mentioned this issue Jan 15, 2020

Use (U)Int128 literals if possible #5545

Draft

straight-shoota mentioned this issue Jun 20, 2020

feat: added Int128 and UInt128 to integers docs crystal-lang/crystal-book#413

Closed

straight-shoota mentioned this issue Dec 9, 2020

fixed int128 implementation #5564

Closed

straight-shoota mentioned this issue Apr 25, 2021

Cast error ? #8268

Closed

straight-shoota added kind:feature platform topic:stdlib:numeric labels Apr 25, 2021

straight-shoota mentioned this issue Jun 18, 2021

Scientific notation for integer literals #10154

Open

straight-shoota mentioned this issue Jul 20, 2021

Syntax errors for invalid 128-bit integer literals #10975

Merged

BlobCodes mentioned this issue Aug 13, 2021

Refactor integer parsing, add string to 128bit integer methods #11092

Closed

BlobCodes mentioned this issue Aug 20, 2021

Support U/Int128 parsing and U/Int128 literals #11111

Closed

BlobCodes mentioned this issue Sep 9, 2021

Int128 Parsing support [PART 1] #11196

Closed

1 task

BlobCodes mentioned this issue Dec 11, 2021

Implement lexer int128 support #11571

Merged

BlobCodes added a commit to BlobCodes/crystal that referenced this issue Dec 12, 2021

Apply rules from discussion in crystal-lang#8373

e25b754

HertzDevil mentioned this issue Jul 18, 2022

Decouple warning detection from program instances #12293

Merged

HertzDevil mentioned this issue Aug 30, 2022

Warn on suffix-less integer literals outside Int64's range #12427

Merged

beta-ziliani closed this as completed Jan 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Int128 support #8373

Int128 support #8373

jkthorne commented Oct 24, 2019 •

edited

bcardiff commented Oct 24, 2019

vlazar commented Oct 24, 2019

Sija commented Oct 24, 2019 •

edited

vlazar commented Oct 24, 2019

straight-shoota commented Oct 24, 2019

oprypin commented Oct 24, 2019

bcardiff commented Oct 24, 2019

asterite commented Oct 24, 2019

bcardiff commented Oct 25, 2019

watzon commented Dec 23, 2020

asterite commented Dec 23, 2020

jkthorne commented Dec 23, 2020

straight-shoota commented Dec 23, 2020

asterite commented Dec 23, 2020

straight-shoota commented Dec 23, 2020

jkthorne commented Dec 23, 2020

watzon commented Dec 24, 2020

delef commented Aug 19, 2021 •

edited

jkthorne commented Aug 19, 2021

watzon commented Aug 22, 2021

delef commented Aug 22, 2021

lbguilherme commented Aug 22, 2021

watzon commented Aug 23, 2021

oprypin commented Dec 12, 2021

straight-shoota commented Dec 12, 2021

BlobCodes commented Dec 12, 2021

oprypin commented Dec 12, 2021

oprypin commented Dec 17, 2021

delef commented Dec 18, 2021

HertzDevil commented Apr 1, 2022

straight-shoota commented Apr 5, 2022

beta-ziliani commented Jan 12, 2023

crysbot commented Feb 13, 2024

Int128 support #8373

Int128 support #8373

Comments

jkthorne commented Oct 24, 2019 • edited

bcardiff commented Oct 24, 2019

vlazar commented Oct 24, 2019

Sija commented Oct 24, 2019 • edited

vlazar commented Oct 24, 2019

straight-shoota commented Oct 24, 2019

oprypin commented Oct 24, 2019

bcardiff commented Oct 24, 2019

asterite commented Oct 24, 2019

bcardiff commented Oct 25, 2019

watzon commented Dec 23, 2020

asterite commented Dec 23, 2020

jkthorne commented Dec 23, 2020

straight-shoota commented Dec 23, 2020

asterite commented Dec 23, 2020

straight-shoota commented Dec 23, 2020

jkthorne commented Dec 23, 2020

watzon commented Dec 24, 2020

delef commented Aug 19, 2021 • edited

jkthorne commented Aug 19, 2021

watzon commented Aug 22, 2021

delef commented Aug 22, 2021

lbguilherme commented Aug 22, 2021

watzon commented Aug 23, 2021

oprypin commented Dec 12, 2021

What we have now

Alternative 1 🎉

Alternative 2 😕

Alternative 3 ❤️

Alternative 4 🚀

straight-shoota commented Dec 12, 2021

BlobCodes commented Dec 12, 2021

oprypin commented Dec 12, 2021

oprypin commented Dec 17, 2021

delef commented Dec 18, 2021

HertzDevil commented Apr 1, 2022

straight-shoota commented Apr 5, 2022

beta-ziliani commented Jan 12, 2023

crysbot commented Feb 13, 2024

jkthorne commented Oct 24, 2019 •

edited

Sija commented Oct 24, 2019 •

edited

delef commented Aug 19, 2021 •

edited