Development NumericalDerivatives #7

yuchuan2016 · 2017-07-11T06:01:17Z

@arosenbe , could you kindly take a look at this? This package is simply a translation of this MATLAB library. It should be straightforward. Thanks!

arosenbe

Good stuff @yuchuan2016! A few general comments here, and some more specific stuff below.

Use Jacobian instead of Jacob in the documentation.
Could you explain why you left off the check of whether the func argument is empty?

arosenbe · 2017-07-11T17:23:20Z

NumericalDerivatives/DESCRIPTION

+  person("Matthew", "Gentzkow", email = "gentzkow@stanford.edu", role = c("aut", "cre")), 
+  person("Jesse", "Shapiro", email = "jesse_shapiro_1@brown.edu", role = c("aut"))
+  )
+Description: Functions to calculate Jacob and Hessian matrix numerically.


Maybe

Functions to numerically calculate Jacobian and Hessian matrices.

arosenbe · 2017-07-11T17:23:56Z

NumericalDerivatives/LICENSE

@@ -0,0 +1,7 @@
+Copyright (c) 2016 Matthew Gentzkow, Jesse M. Shapiro


Update the copyright year.

arosenbe · 2017-07-11T17:28:42Z

NumericalDerivatives/R/numHess.R

+#' evaluated at given values of arguments.
+#' @param func The function on which Hessian matrix is calculated.
+#' @param x0 The values of arguments at which the Hessian matrix is evaluated.
+#' @param xTol The tolerance, where a small \code{xTol} corresponds to increased accuracy


Try

where a smaller

arosenbe · 2017-07-11T17:44:54Z

NumericalDerivatives/R/numJacob.R

+#' evaluated at given values of arguments.
+#' @param func The function on which Jacob matrix is calculated.
+#' @param x0 The values of arguments at which the Jacob matrix is evaluated.
+#' @param xTol The tolerance, where a small \code{xTol} corresponds to increased accuracy


Try

where a smaller

arosenbe · 2017-07-11T17:49:52Z

NumericalDerivatives/R/numJacob.R

+        paramdim <- nparam
+    }
+    Jacobian <- matrix(0, noutput, nparam)
+    for (j in 1:nparam) {


I'd like to see a more "R-like" implementation, rather than a direct translation of the Matlab code. Later users will thank you. So, I wouldn't pre-instantiate the Jacobian and fill it in columnwise. Instead, I'd use sapply or lapply followed by do.call(cbind, <list_name>) to create the final Jacobian matrix directly. (Be careful with the dimensions if you use sapply and make sure to try it on single-row matrices)

arosenbe · 2017-07-11T17:50:12Z

NumericalDerivatives/tests/testthat/test-numHess.R

+
+    expect_equal(hess1, truehess1, tolerance = 1e-3)
+    expect_equal(hess2, truehess2, tolerance = 1e-3)
+})


Carriage return

arosenbe · 2017-07-11T17:50:20Z

NumericalDerivatives/tests/testthat.R

+test_check("NumericalDerivatives")
+
+sprintf("Tests end at %s", Sys.time())
+sink()


Carriage return

arosenbe · 2017-07-11T18:15:21Z

NumericalDerivatives/R/numHess.R

+
+    for (i in 1:length(ind_rowvar)) {
+        row <- ind_rowvar[i]
+        for (j in 1:length(ind_colvar)) {


As for the Jacobian, I'd like to see a more R-style implementation using sapply or lapply with do.call(cbind, )

yuchuan2016 · 2017-07-11T20:42:44Z

@arosenbe , thanks for the comments! I have implemented all sugegstions. Could you take a look? About the check of empty function argument, I'm not sure what's the parallel test in R. I don't know how to create an empty function. The closest I can think of is function() NULL. Should I test something like is.null(func())?

arosenbe · 2017-07-12T20:04:10Z

Thanks @yuchuan2016. I like the new sapply syntax a lot! On the empty function check, I think you were initially correct in ignoring it (don't know why I brought it up). Nobody can take the Jacobian of a nonexistent function.

I have one more comment: All arguments should be passed explicitly to f. We shouldn't rely on the parent scope to fill them in for us.

yuchuan2016 · 2017-07-13T16:34:13Z

@arosenbe , thanks! I don't quite understand your last comment. Could you explain it a little? Thanks a lot!

arosenbe · 2017-07-13T19:36:44Z

Yeah, definitely @yuchuan2016! The inner f function takes j as an argument but (at least in numJacob) also depends on x0, xTol, paramdim, and func. The current implementation passes j to f explicitly in the sapply statement, but the other dependencies are filled implicitly by f checking its parent scope (the numJacob function). If the variables weren't found there, then f would check numJacob's parent scope. This can be problematic, because we might end up filling these implicit dependencies with values different from the ones we intended.

A solution is to enumerate these dependencies explicitly as arguments in the f function. These arguments can be filled in the sapply call as arguments after f. A simple example of this is below. Let me know if you need a different description or want to talk in person.

# Good practice (pass y explicitly)
f_good <- function(x, y){
  out <- c(x, y)
  return(out)
}

out_good <- sapply(1:10, f_good, 0)

# Bad practice (check parent scope for implicit dependency on `y`)
f_bad <- function(x){
  out <- c(x, y)
  return(out)
}

y <- 0
out_bad <- sapply(1:10, f_bad)

# Check for equality
all.equal(out_good, out_bad) # True

yuchuan2016 · 2017-07-13T21:03:27Z

@arosenbe , thanks! Your explanation is very clear. I have updated the relevant part.

arosenbe · 2017-07-13T22:31:29Z

Looking better @yuchuan2016! I especially love that the way that the argument positions line up. I do have another comment (sorry!) about your implementation of my suggestion above #7 (comment).

It looks like you moved some steps that only need to be done once (e.g., function evaluation at base argument values) into the inner f function. This results in them being evaluated length(x0) times, which could result in a considerable loss of speed if the function's called a lot, which I expect them to be. It'd be great if you could move these steps outside the f function and just pass their output into the f function.

yuchuan2016 · 2017-07-13T22:54:36Z

@arosenbe , very good point! Thanks. I have modified the function.

arosenbe · 2017-07-14T16:14:36Z

Thanks @yuchuan2016! I'm happy with the format of the code now. I wonder if we don't want to have some error checking though. I was playing around with the code and was surprised by the result below.

# Assumes you've already loaded numJacob
f <- function(x) x[1]^2 + x[2]^3 + x[3]^4
out <- NumericalDerivatives::numJacob(f, c(1, 2), .5)
print(out)
# [1] NA NA

I think this occurs because f(x0) evaluates to NA when too few arguments are passed. Oppositely, when there are too many, we get weird stuff like this

f <- function(x) x[1]^2 + x[2]^3 + x[3]^4
out <- NumericalDerivatives::numJacob(f, c(1, 2, 3, 4), .5)
print(out)
# [1]   2.500  19.000 219.375   0.000

If the Matlab implementation was fine with these edge cases, then I think we can ignore them. Could you check and report back?

yuchuan2016 · 2017-07-14T16:38:54Z

@arosenbe , good point! I think the second case is fine. It just means the function does not depend on the fourth argument, so the derivative is 0. The MATLAB implementation also return a 1*4 matrix.

For the first case, MATLAB function will raise an error "Index exceeds matrix dimensions". The difference comes from that in MARLAB, you have

x=[1;2];
x(3)
Index exceeds matrix dimensions.

While in R, you have

x <- c(1, 2)
x[3]
[1] NA

Can we raise an error if f(x0) is evaluated to be "NA"?

arosenbe · 2017-07-14T17:08:18Z

I think that's a great solution @yuchuan2016.

arosenbe · 2017-07-14T17:27:50Z

@yuchuan2016 I was taking a closer look at numHess, and I have three comments

Not all of the dependencies are enumerated for f_col. It's missing ind_rowvar and f.
We'll need another argument check to avoid specifying rows/columns of the Hessian that don't exist:

f <- function(x) x[1]^2 + x[2]^3 + x[3]^4

x0 <- c(1, 2, 3)
ind_colvar <- length(x0) + 1 # same for row

out <- NumericalDerivatives::numHess(f, x0, .5, ind_colvar = ind_colvar)

print(out)
warnings()

The sapply function is returning different types of objects based on the dimensions of the output. I think this is an issue in numJacob as well, but the example is easier to construct for numHess. I'd like us to ensure that everything is always output as a matrix. Be careful because vectors in R are row vectors, but as.matrix(c(1:10)) is a 10x1 matrix.

f <- function(x) x[1]^2 + x[2]^3 + x[3]^4

x0 <- c(1, 2, 3)

out_row <- NumericalDerivatives::numHess(f, x0, .5, ind_rowvar = 1)
is.matrix(out) # False

out_col <- NumericalDerivatives::numHess(f, x0, .5, ind_colvar = 1)
is.matrix(out) # True

yuchuan2016 · 2017-07-14T18:03:39Z

@arosenbe , thanks! I incorporate your comments. Could you take a look?

arosenbe · 2017-07-19T17:19:17Z

Sorry for my delay in getting back to you @yuchuan2016! Your commit above does address my comments. However, I think it introduces a new bug in numHess. The sapply function seems to concatenate entire matrices when the target function returns a vector.

fun_vctr <- function(x){
  out <- c(x[1]^2, x[2]^2, x[3]^2)
}

fun_sclr <- function(x){
  out <- c(x[1]^2 + x[2]^2 + x[3]^2)
}

dim(numHess(fun_sclr, c(2, 3, 4), .5)) # Square
dim(numHess(fun_vctr, c(2, 3, 4), .5)) # Not square

yuchuan2016 · 2017-07-19T17:43:28Z

@arosenbe , thanks! The MATLAB version does not allow a vector func. If you pass a vector function, it will throw an error

Assignment has more non-singleton rhs dimensions than non-singleton subscripts

since we assign hess(i,j) a vector value.

Do you think we want to allow numHess to handle a vector-value function? If so, should the output be a three-dimension array, and the length of the first dimension is the length of function output? Or we can also raise an error if length(f0)>0, as the original function?

arosenbe · 2017-07-19T17:54:35Z

Good thought going back to the MATLAB @yuchuan2016! I think we want to be as faithful to that implementation as possible. So let's raise an error if length(f0)>1 for numHess. I do think we should add this to the documentation. Maybe something like

The function numerically calculates the Hessian matrix of a given ~~vector~~scalar-valued function.

Do you know if this same requirement is put on numJacob?

yuchuan2016 · 2017-07-19T18:06:51Z

@arosenbe , actually numJacob allows vector-valued functions. It returns a row vector for a scalar-valued function, and a matrix for a vector-valued function with the ith row corresponding to the ith element of the function output. The current R function does the same thing.

Do you mean "a given scalar-valued function" in your previous comment?

arosenbe · 2017-07-19T18:08:08Z

Yes I did @yuchuan2016.

yuchuan2016 · 2017-07-20T00:32:44Z

@arosenbe , thanks! I'm going to merge to master if no other issues arise.

yuchuan2016 added 3 commits July 10, 2017 21:59

#6 add numJacob function

962da58

#6 add numHess function

cf20308

#5 small edit

a81763f

yuchuan2016 requested a review from arosenbe July 11, 2017 06:01

yuchuan2016 mentioned this pull request Jul 11, 2017

Migrate numerical_derivatives from MATLAB to R #6

Closed

arosenbe reviewed Jul 11, 2017

View reviewed changes

yuchuan2016 added 2 commits July 11, 2017 12:22

#7 small fix for PR

ce84078

#7 change to sapply in response to PR

9e0404c

yuchuan2016 added 2 commits July 13, 2017 13:54

#7 respond to PR

778bddc

#7 small edit

d5991a2

#7 response to PR

5816c70

#7 add pre-check

6e68515

#7 response to PR

96f60e7

#7 add error in numHes when input is vector function

9b08942

yuchuan2016 merged commit 3bb1c82 into master Jul 20, 2017

yuchuan2016 deleted the development-numerical_derivatives branch July 20, 2017 16:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Development NumericalDerivatives #7

Development NumericalDerivatives #7

yuchuan2016 commented Jul 11, 2017

arosenbe left a comment

arosenbe Jul 11, 2017

arosenbe Jul 11, 2017

arosenbe Jul 11, 2017

arosenbe Jul 11, 2017

arosenbe Jul 11, 2017

arosenbe Jul 11, 2017

arosenbe Jul 11, 2017

arosenbe Jul 11, 2017

yuchuan2016 commented Jul 11, 2017

arosenbe commented Jul 12, 2017

yuchuan2016 commented Jul 13, 2017

arosenbe commented Jul 13, 2017

yuchuan2016 commented Jul 13, 2017

arosenbe commented Jul 13, 2017

yuchuan2016 commented Jul 13, 2017

arosenbe commented Jul 14, 2017 •

edited

Loading

yuchuan2016 commented Jul 14, 2017

arosenbe commented Jul 14, 2017

arosenbe commented Jul 14, 2017

yuchuan2016 commented Jul 14, 2017

arosenbe commented Jul 19, 2017 •

edited

Loading

yuchuan2016 commented Jul 19, 2017

arosenbe commented Jul 19, 2017 •

edited

Loading

yuchuan2016 commented Jul 19, 2017

arosenbe commented Jul 19, 2017

yuchuan2016 commented Jul 20, 2017

		@@ -0,0 +1,7 @@
		Copyright (c) 2016 Matthew Gentzkow, Jesse M. Shapiro

Development NumericalDerivatives #7

Development NumericalDerivatives #7

Conversation

yuchuan2016 commented Jul 11, 2017

arosenbe left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yuchuan2016 commented Jul 11, 2017

arosenbe commented Jul 12, 2017

yuchuan2016 commented Jul 13, 2017

arosenbe commented Jul 13, 2017

yuchuan2016 commented Jul 13, 2017

arosenbe commented Jul 13, 2017

yuchuan2016 commented Jul 13, 2017

arosenbe commented Jul 14, 2017 • edited Loading

yuchuan2016 commented Jul 14, 2017

arosenbe commented Jul 14, 2017

arosenbe commented Jul 14, 2017

yuchuan2016 commented Jul 14, 2017

arosenbe commented Jul 19, 2017 • edited Loading

yuchuan2016 commented Jul 19, 2017

arosenbe commented Jul 19, 2017 • edited Loading

yuchuan2016 commented Jul 19, 2017

arosenbe commented Jul 19, 2017

yuchuan2016 commented Jul 20, 2017

arosenbe commented Jul 14, 2017 •

edited

Loading

arosenbe commented Jul 19, 2017 •

edited

Loading

arosenbe commented Jul 19, 2017 •

edited

Loading