[WIP] Implements `Rfloat` and `Doubles` using macros #284

Ilia-Kosenkov · 2021-09-11T14:57:02Z

Fixes #280.

This turned out to be a much larger contribution than I expected.
After implementing Doubles, I decided to introduce paste as a dependency (see #283) and then further improved code generation.

Major topics:

Rfloat as a thin NA-aware wrapper around f64. I generalized macros so that both Rint and Rfloat are generated using the same macros, and the only thin written by hand is TryFrom that also allows coercion (i.e., floats that can be represented as ints can be converted to Rint).
Doubles. Doubles is an almost exact copy of Integers, so, with the help of paste, both Doubles and Integers are generated using a single macros. I expect this approach will work as well for other primitive types. If not, at least in these cases we do not write the same code twice.
Moved tests from doc strings to test module. When members are generated using macros, it is even harder to generate meaningful tests. In this case, I took out test cases, and put them in a separate module (under crate::wrappers::{integers, doubles}::tests). These tests are guarded with #[cfg(test)] and are now executed in all build scenarios (see S4 doctests potentially fail on Windows #282).

Takeways:

Prefer 'horizontal' development to 'vertical' (i.e., deal with one abstraction layer fully instead of implementing support of one type), which encourages macros usage instead of heavy copy-pasting
Prefer meaningful tests in a separate module to reduce comment pollution around impl members
Carefully address edge cases (marked with TODO)

TODO:

Meaningful tests in extendr-api/tests
Meaningful tests in tests/extendrtests
Find a way to keep docs when generating members using macros

clauswilke · 2021-09-11T15:56:47Z

This is great!

One comment about doc strings: I understand the reasoning that they shouldn't be used to test the code, and I agree. However, I think 1-2 examples per function are super helpful and make the code more accessible for a new user. So I wouldn't just delete all these examples. Rather assess whether the example is actually helpful or not.

Ilia-Kosenkov · 2021-09-11T16:01:36Z

@clauswilke, I agree about examples. The challenge is -- when members are generated using macros, we have two options -- generate examples the same way we generate all these methods right now, which can be tricky, or add additional parameters to the macros, which are translated into docs.
This is challenging, and I will look into it (and encourage others with the knowledge of macros to do the same). I certainly do prefer macros with weaker documentation but no code repetition compared to a lot of code repetition but slightly better docs.

clauswilke · 2021-09-11T16:51:17Z

The challenge is -- when members are generated using macros, we have two options -- generate examples the same way we generate all these methods right now, which can be tricky, or add additional parameters to the macros, which are translated into docs.

I see. Maybe it's better to write documentation by hand and to just put all the relevant info and examples into a single documentation block for the entire type. In other words, put the documentation and examples for Doubles here:

extendr/extendr-api/src/wrapper/doubles.rs

Lines 4 to 7 in 6549281

    
           #[derive(Debug, PartialEq, Clone)] 
        
           pub struct Doubles { 
        
               pub(crate) robj: Robj,

Ilia-Kosenkov · 2021-09-11T16:58:59Z

@clauswilke,
Yeah, it is probably better to write the whole documentation block in one place.
However, we can provide some limited support for doctest (and examples) by generating docstrings within the macros.
The last commit 27c0858 demonstrates such an example. The generated test case also participates in the testing.

andy-thomason · 2021-09-14T09:54:51Z

Looking good!

I am usually cautious about using extra dependencies, but the problems with concat_ident! require this for now.
There is a PR to fix this in the language but don't forget to exhale.

It is worth thinking about Logicals/Rbool (aka Bool) but not in this PR.

This shares much with Integers.

We may need to implement more operators in the future (such as &, |, ^) for integer and bool.

Ilia-Kosenkov · 2021-09-14T19:14:30Z

I have identified several issues with Rfloat that I need to address.
The main problem is that NA checks and PartialEq do not work in the current (macros) implementation, because NA is represented with f64::NaN, and f64::NaN != f64::NaN (there are multiple NaNs, and NA uses one value AFAIK).

Ilia-Kosenkov · 2021-09-14T20:11:09Z

Rint was implementing PartialEq and Eq by default. This implementation is inconsistent with how R treats NAs. For R, NA != NA (it actually returns NA_logical_), so Eq should not be implemented. ParitalEq cannot be derived implicitly, because then it will generate equality such that Rint::na() == Rint::na() (ints can be compared directly), but Rfloat::na() != Rfloat::na() (NA_real_ is a NaN, and f64::NaN != f64::NaN).
Solution - manually implement PartialEq for (Rtype, Rtype) and return false if either of objects is NA. This can be done in a generic way, using macros.

Downside - can no longer compare NAs directly and in assertions, IsNA::is_na() should be used instead (as it is done in R and in f64 when working with NaNs).

I also generated some doctests using macros, to test simple behavior (like Rtype::default() == Rtype::default()).

Next stop: bounds check when using elt, doctesting this case, then some integration tests using R package. I will probably finalize it in a day or two and it will be ready for review.

clauswilke · 2021-09-14T21:05:57Z

Downside - can no longer compare NAs directly and in assertions,

I don't see this as a downside. As you say, it's exactly how R handles the same case.

andy-thomason · 2021-09-15T12:23:18Z

I agree that Rint::na() should not (partial) compare with another Rint::na()

This behavious makes it easy to compare floats.

Ilia-Kosenkov · 2021-09-15T19:26:03Z

tests/extendrtests/src/rust/src/lib.rs

+fn double_scalar(x: f64) -> f64 {
+    x
+}

 // Convert an int scalar to itself
 // x a number
 #[extendr]
-fn int_scalar(x: i32) -> i32 { x }
+fn int_scalar(x: i32) -> i32 {
+    x
+}


These formatting changes were unintentional, I hooked cargo fmt to OnSave of my CLion, so once I edited this file, it automatically applied formatting and I did not notice it.

yutannihilation

Minor comments.

extendr-api/src/wrapper/macros.rs

Co-authored-by: Hiroaki Yutani <yutani.ini@gmail.com>

clauswilke

I feel this exceeds my level of technical understanding so please take my approval as nothing more than saying I'm very much in favor of this work and am glad you're taking the lead.

One minor comment: I always find it hard to read macro code and I would encourage you to be extra careful with documenting. It may seem all obvious to you today as you're writing this but a few more comments probably wouldn't hurt. At a minimum, I think it would be good for every chunk of macro code (i.e., every impl section) to provide in a comment a concrete example of what it implements. For example: "Implement binary mathematical operators such as + or -: a + b"

* rebase on master * I hope this explains the drop test. * More comments for Claus. * Fix typos * fmt * More comments. * Rashly edit the merge conflict in github. * fmt * [WIP] Implements `Rfloat` and `Doubles` using macros (#284) * Generalizing macros * Generating 'From' * Fixed typo * More macros * Preliminary implementation of Rfloat * First Rfloat tests * Refactored macros * Split 'impl' macros into two * Instance methods for Rfloat * Mixed bianry operators * Testing Rfloat * Formatting * Gnenerating altrep implementations * Adding support for Doubles * Comments * Using 'paste!' to generate vectors * Implementing 'Doubles' using macros * Ported 'Integers' to new macros * Generating default value * Formatting * Adding comments to the macros * Simplified unary operator macros * Simplified binary operator macros * [POC] generating doc tests on the fly * Implementing 'Default' for scalars * Some docs * Changed how 'NA's are handled * Updated scalars tests * Defensive check on 'vector::elt' + doctests for 'elt' * Testing 'Doubles' * Testing out of range * Testing Integers/Doubles from R * Updated tests * Fixed docs Co-authored-by: Hiroaki Yutani <yutani.ini@gmail.com> * Note on 'elt' boundary checks [skip ci] Co-authored-by: Hiroaki Yutani <yutani.ini@gmail.com> * rebase on master * I hope this explains the drop test. * More comments for Claus. * Fix typos * fmt * More comments. * Rashly edit the merge conflict in github. * fmt * Fix merge conflict. * Merge Co-authored-by: Ilia <ilia.kosenkov@outlook.com> Co-authored-by: Hiroaki Yutani <yutani.ini@gmail.com>

Ilia-Kosenkov added 21 commits September 9, 2021 23:14

Generalizing macros

dc071d3

Generating 'From'

0719e51

Fixed typo

7b6e515

More macros

43bb4c8

Preliminary implementation of Rfloat

48dba56

First Rfloat tests

828cf61

Refactored macros

f1c3d46

Split 'impl' macros into two

8f38b44

Instance methods for Rfloat

02f4615

Mixed bianry operators

c104b61

Testing Rfloat

c3c202b

Formatting

d3adf17

Gnenerating altrep implementations

bcdb9db

Adding support for Doubles

a3dda89

Comments

daaa0a7

Using 'paste!' to generate vectors

301532c

Implementing 'Doubles' using macros

45b20e9

Ported 'Integers' to new macros

a14eb76

Generating default value

0b29a2c

Formatting

933ded9

Adding comments to the macros

a2adf43

Ilia-Kosenkov marked this pull request as draft September 11, 2021 15:00

Ilia-Kosenkov added 2 commits September 11, 2021 18:35

Simplified unary operator macros

1507757

Simplified binary operator macros

6549281

Ilia-Kosenkov mentioned this pull request Sep 11, 2021

Proposal: Introduce paste dependency in extendr-api #283

Closed

[POC] generating doc tests on the fly

27c0858

Ilia-Kosenkov added 2 commits September 14, 2021 23:04

Changed how 'NA's are handled

cf20284

Updated scalars tests

8f2eba6

Ilia-Kosenkov added 5 commits September 15, 2021 17:16

Defensive check on 'vector::elt' + doctests for 'elt'

21a4b13

Testing 'Doubles'

ff52b8c

Testing out of range

1e6e608

Testing Integers/Doubles from R

305482b

Updated tests

64dff42

Ilia-Kosenkov force-pushed the doubles-w-paste branch from a5b3e24 to 64dff42 Compare September 15, 2021 19:20

Ilia-Kosenkov commented Sep 15, 2021

View reviewed changes

Ilia-Kosenkov marked this pull request as ready for review September 15, 2021 19:29

yutannihilation reviewed Sep 16, 2021

View reviewed changes

extendr-api/src/wrapper/macros.rs Outdated Show resolved Hide resolved

extendr-api/src/wrapper/macros.rs Show resolved Hide resolved

Fixed docs

36bf77f

Co-authored-by: Hiroaki Yutani <yutani.ini@gmail.com>

Ilia-Kosenkov requested review from andy-thomason, clauswilke and yutannihilation September 19, 2021 18:30

yutannihilation approved these changes Sep 19, 2021

View reviewed changes

clauswilke approved these changes Sep 20, 2021

View reviewed changes

Note on 'elt' boundary checks [skip ci]

4abca00

andy-thomason approved these changes Sep 20, 2021

View reviewed changes

andy-thomason merged commit 298a31d into extendr:master Sep 20, 2021

This was referenced Sep 23, 2021

Int, Real conversions follow-up #264

Closed

SliceIter<T>::from_slice, SliceIter<T>::next allow unsafe behavior #267

Closed

multimeric mentioned this pull request May 28, 2022

Add proper wrappers for vector types and make names consistent and intuitive #266

Open

Ilia-Kosenkov deleted the doubles-w-paste branch October 8, 2023 10:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Implements `Rfloat` and `Doubles` using macros #284

[WIP] Implements `Rfloat` and `Doubles` using macros #284

Ilia-Kosenkov commented Sep 11, 2021 •

edited

clauswilke commented Sep 11, 2021

Ilia-Kosenkov commented Sep 11, 2021

clauswilke commented Sep 11, 2021 •

edited

Ilia-Kosenkov commented Sep 11, 2021

andy-thomason commented Sep 14, 2021 •

edited

Ilia-Kosenkov commented Sep 14, 2021

Ilia-Kosenkov commented Sep 14, 2021 •

edited

clauswilke commented Sep 14, 2021

andy-thomason commented Sep 15, 2021

Ilia-Kosenkov Sep 15, 2021

yutannihilation left a comment

clauswilke left a comment •

edited

[WIP] Implements Rfloat and Doubles using macros #284

[WIP] Implements Rfloat and Doubles using macros #284

Conversation

Ilia-Kosenkov commented Sep 11, 2021 • edited

clauswilke commented Sep 11, 2021

Ilia-Kosenkov commented Sep 11, 2021

clauswilke commented Sep 11, 2021 • edited

Ilia-Kosenkov commented Sep 11, 2021

andy-thomason commented Sep 14, 2021 • edited

Ilia-Kosenkov commented Sep 14, 2021

Ilia-Kosenkov commented Sep 14, 2021 • edited

clauswilke commented Sep 14, 2021

andy-thomason commented Sep 15, 2021

Ilia-Kosenkov Sep 15, 2021

Choose a reason for hiding this comment

yutannihilation left a comment

Choose a reason for hiding this comment

clauswilke left a comment • edited

Choose a reason for hiding this comment

[WIP] Implements `Rfloat` and `Doubles` using macros #284

[WIP] Implements `Rfloat` and `Doubles` using macros #284

Ilia-Kosenkov commented Sep 11, 2021 •

edited

clauswilke commented Sep 11, 2021 •

edited

andy-thomason commented Sep 14, 2021 •

edited

Ilia-Kosenkov commented Sep 14, 2021 •

edited

clauswilke left a comment •

edited