Add classification functions #80

calebzulawski · 2021-03-06T14:32:37Z

Closes #36.

calebzulawski · 2021-03-06T15:21:16Z

Looks like aarch64 segfaulted during the travis build (not the test). No clue how that happened...

workingjubilee · 2021-03-22T21:34:06Z

That's... odd. This requires rebasing anyways, can you do that and see if it repros?

workingjubilee · 2021-03-23T07:15:02Z

Looks like we do indeed have a particularly gribbly bug!

calebzulawski · 2021-03-23T13:24:19Z

I read somewhere that LLVM segfaults when an assert fails. When I run this in docker (I don't have access to an actual aarch64 machine) it succeeds, so I thought maybe we're running out of memory and a malloc assert is failing.

Lokathor · 2021-03-28T17:12:10Z

I'm unclear why codegen-units=1 is in the build flags on some targets but not all targets, and I'm worried about that not being picked up by downstream, when this crate is built into core/std, or is inlined into a user's program.

If we need to reduce codegen-units for this to build without crashing LLVM, and a user of the crate leads the codegen-units at the default quantity of 256 of 16, will their build explode when they use this crate?

calebzulawski · 2021-03-28T17:49:56Z

I read in an issue elsewhere that codegen-units=1 could fix aarch64 issues, but it didn't here. I just didn't remove it yet. I'm still not completely certain what is causing the issue. I can't replicate it locally with cross, so I think it's particular to the build agent in some way?

calebzulawski · 2021-03-29T01:17:09Z

I think I've figured out the issue, but I'm not sure what's really happening. The problem is the simd_ne intrinsic which is only used in is_nan, is_normal, and is_subnormal. If I replace it with a naive implementation everything compiles fine on aarch64. I think this is probably a bug in LLVM?

workingjubilee · 2021-04-01T20:20:25Z

Following up from discussion: these two examples offer repro of our bug, but only when targeting aarch64, but seemingly only when generating tests, and only in release mode.

#[test]
fn is_normal() {
    assert!(core_simd::SimdF32::<64>::splat(0.).is_normal().to_array()[0]);
}

#[test]
fn bug() {
    use core_simd::{SimdF32, SimdU32};
    let v = SimdF32::<64>::splat(0.);
    let m = v.lanes_eq(SimdF32::splat(0.0)) & v.lanes_eq(SimdF32::splat(0.0));
    assert!(m.to_array()[0]);
}

calebzulawski · 2021-04-03T19:08:18Z

I reduced the maximum lane count from 64 to 32, since the error only occurred with 64-length vectors.

This prevents AVX-512 vectors of u8, but this is really only temporary until we can get an LLVM fix (and get that fixed version into rustc)

workingjubilee · 2021-04-08T21:41:54Z

Alright, looks like this is good now with that. Have you filed the LLVM bug somewhere? I think with that posted somewhere (anywhere) this can be merged.

calebzulawski · 2021-04-08T23:13:05Z

There's an LLVM bug filed (though I'm not 100% sure it's the same bug). I still need to file the rust and stdsimd bugs to track it.

calebzulawski · 2021-04-08T23:44:09Z

Filed #90 and rust-lang/rust#84020.

calebzulawski requested review from workingjubilee and Lokathor March 6, 2021 14:33

calebzulawski marked this pull request as ready for review March 6, 2021 14:56

calebzulawski force-pushed the feature/comparisons branch from 4f2ad9d to 51ca17a Compare March 23, 2021 03:52

calebzulawski marked this pull request as draft March 28, 2021 17:50

calebzulawski force-pushed the feature/comparisons branch from b9888b0 to 8f7e115 Compare March 28, 2021 21:54

workingjubilee mentioned this pull request Apr 2, 2021

Account some warnings to fix CI rust-lang/packed_simd#315

Merged

calebzulawski added 4 commits April 3, 2021 13:54

Add floating-point classification functions

93ce1c1

Various bug fixes

07247a0

Fix normal and subnormal classification

97bbe2d

Reduce maximum lanes from 64 to 32

e6a5309

calebzulawski force-pushed the feature/comparisons branch from 74fc505 to e6a5309 Compare April 3, 2021 18:43

calebzulawski marked this pull request as ready for review April 3, 2021 20:36

workingjubilee merged commit 0682c31 into master Apr 9, 2021

This was referenced Apr 9, 2021

Add reductions #83

Merged

Failing wasm tests #92

Closed

workingjubilee deleted the feature/comparisons branch April 14, 2021 02:55

akiradeveloper mentioned this pull request Sep 26, 2021

Use SIMD in matrix multiplication akiradeveloper/rubikmaster#8

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add classification functions #80

Add classification functions #80

calebzulawski commented Mar 6, 2021 •

edited

Loading

calebzulawski commented Mar 6, 2021

workingjubilee commented Mar 22, 2021

workingjubilee commented Mar 23, 2021

calebzulawski commented Mar 23, 2021

Lokathor commented Mar 28, 2021

calebzulawski commented Mar 28, 2021

calebzulawski commented Mar 29, 2021

workingjubilee commented Apr 1, 2021

calebzulawski commented Apr 3, 2021

workingjubilee commented Apr 8, 2021

calebzulawski commented Apr 8, 2021

calebzulawski commented Apr 8, 2021

Add classification functions #80

Add classification functions #80

Conversation

calebzulawski commented Mar 6, 2021 • edited Loading

calebzulawski commented Mar 6, 2021

workingjubilee commented Mar 22, 2021

workingjubilee commented Mar 23, 2021

calebzulawski commented Mar 23, 2021

Lokathor commented Mar 28, 2021

calebzulawski commented Mar 28, 2021

calebzulawski commented Mar 29, 2021

workingjubilee commented Apr 1, 2021

calebzulawski commented Apr 3, 2021

workingjubilee commented Apr 8, 2021

calebzulawski commented Apr 8, 2021

calebzulawski commented Apr 8, 2021

calebzulawski commented Mar 6, 2021 •

edited

Loading