diveristy(): dealing with NA's #187

baxter-jeremy · 2016-08-02T10:01:17Z

Hi,

I am very new to diversity/ecological statistical analysis (day 1 in fact!). Thank you for a very useful package and the documentation. A quick comment/observation (and given my lack of ecological experience I am not sure if this is a feature): Consider a data frame named thedf with counts of various species (captured as columns) where each row is a particular site, then
diversity(thedf)
will return the Shannon diversity measure, but a particular site on a particular day (i.e. a row of the df) might be lost/missing for some reason, i.e. the original data might be correctly coded as NA for that entire row in the data frame, but diversity() will return
apply( -x*xlog(x,exp(1), margin=1,sum,na.rm=TRUE)
where x <- sweep(thedf, 1, total= apply(df,1,sum), "/")
i.e. 0 (zero)
Should diversity() not return a NA? Or should there not at least be a warning:

theNAindicies <- which(is.na(rowSums(thedf)))
if ( length(theNAindicies) == 0 ) { warning( "rows of missing data" ) }

Thank you.
Jeremy

The text was updated successfully, but these errors were encountered:

jarioksa · 2016-08-04T09:54:25Z

diversity is a pretty simple function that does not check its input, but that is left as user's responsibility. For instance, it does not even check that input are non-negative. Giving zero-diversity for NA abundances does not look too bad. It would not be too complicated to fix both of these features (reject negative input, give NA if observation has any NA), but this can make diversity slower -- and the function can be called millions of times in simulations. Got to see this.

used to give diversity=0 if any observations were NA. Reported as issue #187 in github

gavinsimpson · 2016-08-04T16:49:32Z

@jarioksa Both changes sound useful, from a user point of view. If such checks slow this down to the extent that the negatively impact simulations it sounds like we need a diversity.fit() function that does the actual diversity calculations on known good data, and that we make diversity() more of a user function. (All in the sense of lm and lm.fit).

jarioksa · 2016-08-04T17:21:49Z

microbenchmark showed no consistent difference in moderate data sets (BCI, Oribatid mites). Haven't merged this yet, but it seemed to work both for a single site and multi-site data sets.

jarioksa · 2016-08-05T08:18:23Z

Solved with commit 014b250. This commit also checks that data are non-negative.

used to give diversity=0 if any observations were NA. Reported as issue #187 in github (cherry picked from commit 5859e3a)

jarioksa pushed a commit that referenced this issue Aug 4, 2016

diversity() is NA if observation had NA values

5859e3a

used to give diversity=0 if any observations were NA. Reported as issue #187 in github

jarioksa self-assigned this Aug 4, 2016

jarioksa pushed a commit that referenced this issue Aug 5, 2016

Merge branch 'issue-#187'

014b250

jarioksa closed this as completed Aug 5, 2016

jarioksa added this to the 2.4-1 milestone Aug 22, 2016

jarioksa mentioned this issue Aug 23, 2016

bug fix release 2.4-1? #194

Closed

jarioksa pushed a commit that referenced this issue Aug 30, 2016

diversity() is NA if observation had NA values

19fa3be

used to give diversity=0 if any observations were NA. Reported as issue #187 in github (cherry picked from commit 5859e3a)

jarioksa mentioned this issue Jun 22, 2017

diversity function strange method of NA assignment #239

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

diveristy(): dealing with NA's #187

diveristy(): dealing with NA's #187

baxter-jeremy commented Aug 2, 2016 •

edited by jarioksa

jarioksa commented Aug 4, 2016

gavinsimpson commented Aug 4, 2016

jarioksa commented Aug 4, 2016 •

edited

jarioksa commented Aug 5, 2016

diveristy(): dealing with NA's #187

diveristy(): dealing with NA's #187

Comments

baxter-jeremy commented Aug 2, 2016 • edited by jarioksa

jarioksa commented Aug 4, 2016

gavinsimpson commented Aug 4, 2016

jarioksa commented Aug 4, 2016 • edited

jarioksa commented Aug 5, 2016

baxter-jeremy commented Aug 2, 2016 •

edited by jarioksa

jarioksa commented Aug 4, 2016 •

edited