yaImpute: Nearest Neighbor Observation Imputation and Evaluation Tools
================================================
 - Performs nearest neighbor-based imputation using one or more alternative 
approaches to processing multivariate data. 
 - These include methods based on canonical correlation analysis, canonical 
correspondence analysis, and a multivariate adaptation of the random forest 
classification and regression techniques of Leo Breiman and Adele Cutler. 
 - Additional methods are also offered. 
 - The package includes functions for comparing the results from running 
 alternative techniques, detecting imputation targets that are notably distant from reference observations, detecting and correcting for bias, bootstrapping and building ensemble imputations, and mapping results.
 
### About "yaImpute"

Package yaImpute was built to provide an integrated set of tools designed to meet specific challenges in forestry. It provides alternative methods for finding neighbors, integrates a fast search method, and introduces a novel and experimental application of randomForest. 

A function for computing the error statistics suggested by Stage and Crookston (2007) is also included. 

We anticipate that progress in this field will continue, particularly in the area of discovering better X-variables and transformations improving the essential requirements for applying these methods: that there be a relationship between the X- and Y -variables.
 

In [27]:
install.packages("yaImpute")

Installing package into '/home/nbcommon/R'
(as 'lib' is unspecified)


In [10]:
library(yaImpute)


### Classes:
1. ***`yai`*** is a list returned by function `yai` that contains elements as listed in the manual entry for the package. 

    Of special note here are 2 pairs of data frames: 
    `neiDstTrgs` holds the distances between a target observations (identified by row names) and the k reference observations (there are k columns) and `neiIdsTrgs` is a corresponding data frame of target identifications. neiDstRefs and neiIdsRefs are counterparts for references.

2. ***`impute.yai`*** is a data frame of imputed values. The row names are target observation identifications and the columns are variables (X-variables, Y -variables, both, or new variables (ancillary data) supplied in the call to impute.yai). 

    When observed values are included, additional variables are included that have .o appended as a suffix to the original name. 
    An attribute is attached with the scaling factors for each variable that is used in computing scaled rmsd.

3. ***`compare.yai`*** is a data frame of root mean square differences (scaled) values. Rows are variables and columns correspond to each imputation result passed as arguments.
