Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce NonMaxU32 for array index #2131

Closed
wants to merge 2 commits into from

Conversation

CYBAI
Copy link
Contributor

@CYBAI CYBAI commented Jun 20, 2022

It changes the following:

  • Introduce NonMaxU32 type
  • Use NonMaxU32 for index property key instead of u32

While looking into the failed built-ins/Reflect/ownKeys/return-on-corresponding-order-large-index.js in test262, I found that the root cause is related to array index for property keys.

The root cause of the failure is, the test uses u32::MAX as key and we're ordering it wrong because we see u32::MAX as a array index instead of an integer index. Based on the spec, array index's range is from 0 <= n < u32::MAX so the max number is exclusive.

We can avoid transforming u32::MAX into property key in some functions like From<T> traits though, it might be forgettable and error-prone. So, I wonder it might be good to introduce the NonMaxU32 type which is similar to NonZeroU32 in std but excluding max instead.

Then, we can use it for the index property key

-    Index(u32),
+    Index(NonMaxU32),

With doing so, we can always guarantee the integer hold by Index property key is valid.

WDYT?

(On the other hand, I'm not sure where is the best place for such types. Please feel free to let me know if I should move it to other places 馃檹)

@codecov
Copy link

codecov bot commented Jun 20, 2022

Codecov Report

Merging #2131 (f18bf35) into main (d173e08) will decrease coverage by 0.05%.
The diff coverage is 25.75%.

@@            Coverage Diff             @@
##             main    #2131      +/-   ##
==========================================
- Coverage   43.48%   43.43%   -0.06%     
==========================================
  Files         220      221       +1     
  Lines       19983    20009      +26     
==========================================
+ Hits         8689     8690       +1     
- Misses      11294    11319      +25     
Impacted Files Coverage 螖
boa_engine/src/lib.rs 86.66% <酶> (酶)
...ine/src/object/internal_methods/integer_indexed.rs 0.00% <0.00%> (酶)
boa_engine/src/object/internal_methods/mod.rs 67.53% <0.00%> (-0.44%) 猬囷笍
boa_engine/src/nonmaxu32.rs 14.81% <14.81%> (酶)
...oa_engine/src/object/internal_methods/arguments.rs 30.76% <20.00%> (酶)
boa_engine/src/object/internal_methods/string.rs 36.17% <33.33%> (酶)
boa_engine/src/property/mod.rs 50.88% <61.53%> (-0.01%) 猬囷笍
boa_engine/src/object/internal_methods/array.rs 76.36% <100.00%> (-1.82%) 猬囷笍
boa_engine/src/object/property_map.rs 20.00% <100.00%> (酶)
...oa_engine/src/syntax/parser/statement/block/mod.rs 41.46% <0.00%> (-2.44%) 猬囷笍
... and 9 more

Continue to review full report at Codecov.

Legend - Click here to learn more
螖 = absolute <relative> (impact), 酶 = not affected, ? = missing data
Powered by Codecov. Last update d173e08...f18bf35. Read the comment docs.

Based on the definition in spec, [array index][] is only valid from zero
to u32::MAX - 1. It would be helpful to use the NonMaxU32 type to define
it as the property key instead so that contributors won't forget to
check the validity.

[array index]: https://tc39.es/ecma262/#array-index
@Razican Razican added this to the v0.16.0 milestone Jun 20, 2022
@Razican Razican added enhancement New feature or request Internal Category for changelog labels Jun 20, 2022
@Razican
Copy link
Member

Razican commented Jun 20, 2022

Test262 conformance changes

VM implementation

Test result main count PR count difference
Total 90,619 90,619 0
Passed 56,859 56,861 +2
Ignored 13,788 13,788 0
Failed 19,972 19,970 -2
Panics 0 0 0
Conformance 62.75% 62.75% +0.00%
Fixed tests (2):
test/built-ins/Reflect/ownKeys/return-on-corresponding-order-large-index.js [strict mode] (previously Failed)
test/built-ins/Reflect/ownKeys/return-on-corresponding-order-large-index.js (previously Failed)

Copy link
Member

@Razican Razican left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks very good! Thanks for the fix. I added some comments regarding some potential enhancements. I would like to also get inputs from the rest of the team, since some of the options might just be too much, or might not make sense.

Oh, and tests seem to be failing in MacOS and Windows, so these need to be fixed.

Comment on lines +24 to +28
// Safety: `NonMaxU32` does not contain any objects which needs to be traced,
// so this is safe.
unsafe impl Trace for NonMaxU32 {
unsafe_empty_trace!();
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would suggest to just not implement this, and use #[unsafe_ignore_trace] in the enum (if that's allowed). It should be more performant.

/// ```
#[derive(Copy, Clone, Eq, Finalize, PartialEq, Ord, PartialOrd, Hash)]
#[repr(transparent)]
pub struct NonMaxU32(u32);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would suggest the internal type to be not an enum type and be a structure type:

pub struct NonMaxU32 {
    inner: u32,
}

This would disallow direct editing of the internal structure (having to use getters and setters), and changing the internal type would be API compatible.

Talking about changing the internal type, it might be useful to use a NonZeroU32, which is better optimized for space. On the other hand, this would require using a +1 when storing, and a -1 when retrieving, which could cause a small performance penalty. We should test both options.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for late reply 馃檹 Thank you for your suggestion! I will try to work on a fix.

On the other hand, this would require using a +1 when storing, and a -1 when retrieving, which could cause a small performance penalty.

This sounds a good idea to me. I experimented a bit using XOR to save with NonZeroU32 like the nonmax crate does last week but I realized we would need to rev() the iterator because the xor'ed value would be in reversed order. So, I wonder just +1 and -1 would be easier.

I will try it more and share more information here or on Discord 馃檹 Thank you!

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I'm ok with either approach (-1/+1 or XOR).

Comment on lines +93 to +95
pub const fn get(self) -> u32 {
self.0
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we also include setters? And operations such as add, sub, mul and so on?

Copy link
Contributor Author

@CYBAI CYBAI Jun 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm good with adding these methods though, I wonder it also depends on how we want to use it. Do we want to always do calculation with NonMaxU32 directly?

On the other hand, I wonder it might be good to at least add methods like checked_add or saturating_add on NonZeroU32.

WDYT?

Comment on lines +168 to +173
impl fmt::Debug for NonMaxU32 {
#[inline]
fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result {
self.get().fmt(f)
}
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This might be a good implementation for Display, but I would say the Debug implementation shouldn't hide the internal structure. I'm open to suggestions, though.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tried to follow NonZeroU32 debug though, it sounds fine to me to format it like NonMaxU32 { inner: 42 }.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agree with Razican

Comment on lines +120 to +152
impl BitOr for NonMaxU32 {
type Output = Self;
#[inline]
fn bitor(self, rhs: Self) -> Self::Output {
// Safety: since `self` and `rhs` are both nonzero, the
// result of the bitwise-or will be nonzero.
unsafe { NonMaxU32::new_unchecked(self.get() | rhs.get()) }
}
}

impl BitOr<u32> for NonMaxU32 {
type Output = Self;

#[inline]
fn bitor(self, rhs: u32) -> Self::Output {
// Safety: since `self` is nonzero, the result of the
// bitwise-or will be nonzero regardless of the value of
// `rhs`.
unsafe { NonMaxU32::new_unchecked(self.get() | rhs) }
}
}

impl BitOr<NonMaxU32> for u32 {
type Output = NonMaxU32;

#[inline]
fn bitor(self, rhs: NonMaxU32) -> Self::Output {
// Safety: since `rhs` is nonzero, the result of the
// bitwise-or will be nonzero regardless of the value of
// `self`.
unsafe { NonMaxU32::new_unchecked(self | rhs.get()) }
}
}

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The "Safety" argument and code seems to be copied from the nonzero implementaion and is not correct. (At least the comment is wrong)

@Razican Razican modified the milestones: v0.16.0, v0.17.0 Sep 19, 2022
Copy link
Member

@RageKnify RageKnify left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need to introduce +1/-1 or XOR on accesses. fix the safety comments and correct the safety comments.

@jasonwilliams
Copy link
Member

Hey @CYBAI how is this going do you need any help?

@CYBAI
Copy link
Contributor Author

CYBAI commented Dec 27, 2022

Hey @CYBAI how is this going do you need any help?

Sorry for very late reply 馃檹 I'll try to find some time to address the comments and update the PR after mid-Jan 馃檱

@raskad
Copy link
Member

raskad commented Sep 30, 2023

Added in #3321

@raskad raskad closed this Sep 30, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request Internal Category for changelog
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

6 participants