Fabricate rewrite PR #41

aaronrudkin · 2017-12-19T20:22:25Z

This is the PR that will merge in all of the new level syntax, including the cross_level stuff.

Travis:
Travis is currently failing on a specific test even though R CMD Check on my local system is working. @nfultz figures it's a weird Travis quirk, but it's POSSIBLE that there is an actual build issue (see builds 280, 281, 283). I'll be back this evening if I'm needed to address this stuff.

Notes about backwards compatibility:
The one thing that I am not 100% certain about in terms of replicating the existing behaviour of the old level function is the exact conditions under which a new ID label column is added to data. This is not breaking when it comes to any substantive stuff, it's just that a data frame might have an extra ID column relative to some past workflows. I began a discussion with @graemeblair about this in issue #33 -- I just haven't fully doubled back and verified exactly what's going on in the code.

Not done:
Vignettes related to cross-classifying data; draw_discrete changes

Code review / refactoring notes:
Some of the error handling and setup for the level functions is slightly duplicated and could possibly be abstracted into helper functions; the fabricate.r file should be split up into multiple files.

…s to fail.)

…ate a warning.

…e, will implement similar strategy in the refactor in next few commits. This will generate a warning or error on the build.

…s build should generate a warning.

…egan process of deprecating old files.

…nit tests to reflect new names, bugfixes to correct unit tests

…t fixes. Also cleaning up documentation

…o imported data single-level

…on_functions.

…r into fabricate_rewrite

…avis but not on local builds, related to building the PDF manual.

…umentation for ICC functions, and fixed bug with normal ICC data.

… unique variables by level.

…ather than level calls.

… tests.

…warning due to a documentation issue.

…data.

…igma

…g helpers.

…by my side, a little bit of optimization is all I need, a little bit of CI bugs is what I see

nfultz

Looks excellent and pretty well tested. The stuff with building a working environment could use a second pass (not using lists inside the env could simplifiy quite a bit of the book keeping) but if it's working we can always change it later. 👍 overall.

nfultz · 2017-12-19T22:58:37Z

NAMESPACE

 export(fabricate)
+export(join)


close to overloading dplyr funs, but barely doesnt ;)

nfultz · 2017-12-19T23:04:43Z

R/cross_classify_helpers.R

+  if(is.null(sigma)) {
+    if(is.atomic(rho) & length(rho)==1) {
+      if(ndim>2 & rho<0) {
+        stop("The correlation matrix must be positive semi-definite. In specific, ",


"Specifically"

omg grammar correction in a code review

nfultz · 2017-12-19T23:05:37Z

R/cross_classify_helpers.R

+  }
+
+  # Can we use the fast package or are we stuck with the slow one?
+  use_f = use_f && requireNamespace("mvnfast", quietly = TRUE)


nfultz · 2017-12-19T23:06:22Z

R/cross_classify_helpers.R

+
+  } else {
+    # Using mvnfast
+    correlated_sn = mvnfast::rmvn(N, ncores = 2, mu, sigma)


getOption("mc.cores")

oh, nuts, this is clearly an artifact from putzing around on my machine. Will fix.

nfultz · 2017-12-19T23:08:19Z

R/cross_classify_helpers.R

+    # with right_chol to make it correlated.
+    correlated_sn <- matrix(rnorm(N * ndim),
+                            nrow = N,
+                            byrow = TRUE) %*% right_chol


No need for byrow

nfultz · 2017-12-19T23:27:18Z

R/fabricate.R

+
+  # User is adding a new level, but already has a working data frame.
+  # Shelf the working data frame and move on
+  if("data_frame_output_" %in% names(working_environment_)) {


This logic is a bit hard to follow, going to meditate on it.

nfultz · 2017-12-19T23:29:04Z

R/fabricate.R

+    # Any obs that matches the level matching this i will be a duplicate of this i.
+    index_maps[
+      working_environment_$data_frame_output_[[ID_label]] == unique_values_of_level[i]
+      ] = i


Yeah I'm really open to a better implementation of this, I'm sure there is one.

nfultz · 2017-12-19T23:30:32Z

R/fabricate.R

+  # Move the current working data frame into a package
+  package_df = list(data_frame_output_ = working_environment_$data_frame_output_,
+                    level_ids_ = working_environment_$level_ids_,
+                    variable_names_ = names(working_environment_$data_frame_output_))


this snippet pops up a bit, might want to make a helper / constructor function.

nfultz · 2017-12-19T23:32:19Z

R/fabricate.R

+  # Loop over the variable name
+
+  variable_names = by$variable_names
+  data_frame_indices = numeric(length(variable_names))


length always returns an int I believe.

nfultz · 2017-12-19T23:40:17Z

R/cross_classify_helpers.R

+      # What would the indices of the quantiles be if our data was ordered --
+      # if the answer is below 0, set it to 1. round will ensure the tie-
+      # breaking behaviour is random with respect to outcomes
+      ordered_indices = pmax(1,


this can be factored out of the lapply to a seperate sweep statement -

nfultz · 2017-12-19T23:54:29Z

tests/testthat/test-crossclassified.R

+
+test_that("Deliberate failures in cross_level", {
+  expect_error(
+    test_next = fabricate(


you should not have a name (test_next) on this expect_error (test-crossclassified line 96) -

…data.

aaronrudkin · 2017-12-20T07:10:48Z

I believe I've implemented all of @nfultz's suggestions that were trivial to do, as well as bumped up the test coverage to hit all the code. The remainder of the code review suggestions I've issued. Provided everyone else is fine, this can be squashed and merged now.

Phew. Feels good to have over a month of hard work finally be merged into master.

coveralls · 2017-12-20T08:25:22Z

Coverage increased (+23.2%) to 98.554% when pulling 289693b on fabricate_rewrite into fd88f3c on master.

coveralls · 2017-12-20T08:48:04Z

Coverage increased (+24.5%) to 99.889% when pulling f33c26d on fabricate_rewrite into fd88f3c on master.

coveralls · 2017-12-20T09:21:21Z

Coverage increased (+24.5%) to 99.889% when pulling ad6b54c on fabricate_rewrite into fd88f3c on master.

aaronrudkin and others added 30 commits November 2, 2017 15:31

Complete documentation of fabricate.R so it makes sense to me.

b6219fd

Detailed documentation of existing level function.

42855e2

Beginning of fabricate rewrite (this will break a build, expect build…

ee91b2b

…s to fail.)

nest_level and stub modify_level functionality. This build will gener…

40b1c85

…ate a warning.

Major speed improvement on modify level calls in the original codebas…

073935c

…e, will implement similar strategy in the refactor in next few commits. This will generate a warning or error on the build.

Implemented modify_level_new and improved speed of several steps. Thi…

35d465d

…s build should generate a warning.

Switched the working environment to an environment for speed gains. B…

c9e4c52

…egan process of deprecating old files.

Renamed files to old- prefix and added to rbuildignore

715df58

Renamed new files to final names.

4e097d2

Renamed functions to take over namespace.

089adca

Cutoff to new version of fabricate and the level functions, updated u…

b49a6b1

…nit tests to reflect new names, bugfixes to correct unit tests

Update to use add_level instead of nest_level, documentation, and tes…

a72ddea

…t fixes. Also cleaning up documentation

README.Rmd, update with Getting started example on main Github page.

41e4090

Bugfixes for row names when resampling, bugfix for adding variables t…

2416330

…o imported data single-level

Complete rewrite and expansion of vignette.

5e6e95b

Added README.Rmd to .Rbuildignore

5e156f8

Fixes #32 and provisionally implements suggestion 1 for #33

e9277af

Documentation update Nov 20, 2017

f894d77

Merge branch 'master' into fabricate_rewrite

bf27e10

Added draw_binary_icc and did line length trimming on variable_creati…

8b2cde9

…on_functions.

Merge branch 'fabricate_rewrite' of github.com:DeclareDesign/fabricat…

95de0b0

…r into fabricate_rewrite

Fixed #35 and sped up ordered data by swapping cut for findInterval

7ebb9e2

Added a likert unit test to draw ordered data.

31797f6

Fixed issues with last set of tests, added draw_normal_icc

b5f5113

Changed documentation to remove math which was causing an error on Tr…

a61e41d

…avis but not on local builds, related to building the PDF manual.

Merge commit of doc changes from master into fabricate_rewrite

3b66ee0

Fixed documentation to reflect add_level, modify_level, and added doc…

4349550

…umentation for ICC functions, and fixed bug with normal ICC data.

Documentation push and fixed a bug in draw_normal_icc vignette.

354fd67

Test additions to ICC data and fixed tests for draw_normal_icc

e473942

Renamed cluster_ids to clusters and patched up a few more tests

472c508

aaronrudkin added 17 commits December 1, 2017 16:30

Fixed bug in error handling in handle_n and added more tests.

ce912ca

More test coverage.

1521ba7

Remaining test coverage for draw_normal_icc

49782c1

Test coverage for helper functions including symbol lookahead and get…

2883619

… unique variables by level.

Test coverage for main fabricate and level methods.

284980d

Moved data frame sanity check for imported data into fabricate call r…

3858838

…ather than level calls.

Removed an error handler code could never reach and added a few minor…

a91e053

… tests.

Forgot to commit one character typo fix, broke build.

bd28cc2

Removed superfluous data checking code in nest, modify, and add

351fa3b

cross_classify implementation first pass. This build will generate a …

593dedc

…warning due to a documentation issue.

Changes to cross_level syntax and documentation for cross-classified …

99b6344

…data.

First test pass at cross-classified data, fixed a bug in specifying s…

bd36789

…igma

Fix a bug in specifying rho, added tests for all the cross-classifyin…

ff97198

…g helpers.

Fixed a bug that made all the tests I just wrote not work.

6e44118

Added additional testing for outer wrapper of cross_level

7990a47

A little bit of test coverage in my life, a little bit of bug fixing …

8fc19f8

…by my side, a little bit of optimization is all I need, a little bit of CI bugs is what I see

Version bump due to breaking syntax change.

7c6c307

graemeblair requested a review from nfultz December 19, 2017 20:49

nfultz approved these changes Dec 19, 2017

View reviewed changes

nfultz reviewed Dec 19, 2017

View reviewed changes

aaronrudkin and others added 4 commits December 19, 2017 21:21

Fixes for test apparatus to work with testthat 2.0.0

289693b

A few more tests and fixed a bug in adding variables after importing …

f33c26d

…data.

Fixes from nfultz's code review.

ce4080a

Merge branch 'master' into fabricate_rewrite

ad6b54c

graemeblair merged commit 78f56bb into master Dec 20, 2017

graemeblair deleted the fabricate_rewrite branch January 18, 2018 07:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fabricate rewrite PR #41

Fabricate rewrite PR #41

aaronrudkin commented Dec 19, 2017

nfultz left a comment

nfultz Dec 19, 2017

nfultz Dec 19, 2017

aaronrudkin Dec 20, 2017

nfultz Dec 19, 2017

nfultz Dec 19, 2017

aaronrudkin Dec 20, 2017

nfultz Dec 19, 2017

nfultz Dec 19, 2017

nfultz Dec 19, 2017

aaronrudkin Dec 20, 2017

nfultz Dec 19, 2017

nfultz Dec 19, 2017

nfultz Dec 19, 2017

nfultz Dec 19, 2017

nfultz Dec 19, 2017

aaronrudkin commented Dec 20, 2017 •

edited

Loading

coveralls commented Dec 20, 2017

coveralls commented Dec 20, 2017

coveralls commented Dec 20, 2017

		export(fabricate)
		export(join)

Fabricate rewrite PR #41

Fabricate rewrite PR #41

Conversation

aaronrudkin commented Dec 19, 2017

nfultz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aaronrudkin commented Dec 20, 2017 • edited Loading

coveralls commented Dec 20, 2017

coveralls commented Dec 20, 2017

coveralls commented Dec 20, 2017

aaronrudkin commented Dec 20, 2017 •

edited

Loading