NetNPMLE

An R package for implementing Neural-Network based NPMLE, Net-NPMLE in short.

Abstract

Net-NPMLE is a flexible nonparametric maxium likelihood estimator (NPMLE) based on deep neural network for latent mixture model. Net-NPMLE is free of tuning hyperparameters and is capable of estimating a smooth or shapely discrete shape of latent distribution.

Installation

To run the Net-NPMLE smoothly, there are several pre-requisites needed to be installed before the R package. The main of Net-NPMLE is worte in Python, especially Pytorch library and we strongly recommend using CUDA (GPU-Based tool) to train Net-NPMLE which can be accelerated a lot than using CPU.

Python 3.7 or above
Pytroch 1.11.0 or above
NAVID CUDA 10.2 or above

In R, we also need reticulate package to run Python in R and devtools to install R package from github.

install.package("reticulate")
install.package("devtools")

Now, use the following code to install NetNPMLE package.

library(devtools)
install_github(repo = "shijiew97/NetNPMLE")
library(NetNPMLE)

Main function

There are two main functions in the Net-NPMLE package, which is detailed specified below.

Net_NPMLE aims to give out Net-NPMLE estimator in uni-variate mixture model. Currently Net-NPMLE supports mixutre models such as Gaussian-location, Poisson-mixture, Lognormal-location, Gumbel-location, Gaussian-scale, Binomial-prob.
Net_NPMLE2 provides bi-variate Net-NPMLE estimator in bi-variate mixture model. Currently bi-variate Net-NPMLE supports Gaussian location-scale mixture model.

Example (1) : Lognormal-location (Beta) mixture.

As a simple example of NetNPMLE pacakge, we consider a Lognorm-location mixture: $\mathbf{Y} \mid \theta \sim \text{Log-normal}(\theta, 1/5) \text{ and } \theta \sim \text{Beta}(3,2)$ where latent distribution follows $\text{Beta}(3,2)$ have a support of $[0,1]$ and $n=2,000$.

Seed <- 128783;set.seed(Seed);dist <- "LogGaussian";param <- 0.2
n <- 2000;L <- 5;num_it <- 4000;n_grid <- 100
theta <- rbeta(n, 3, 2);Y = rlnorm(n, theta, param)
net_npmle <- Net_NPMLE(Y=Y, param=param, dist=dist, n=n, num_it=num_it, n_grid=n_grid)
plot(net_npmle$support, net_npmle$prob, col=rgb(1,0,0,0.8), type='l',
     xlab='support', ylab='mass', lwd=3, ylim=c(0, 0.5), cex.axis=1.85, cex.lab=1.85)

Example (2) : Gaussian-location (Uniform) mixture.

Here we also consider Efron's $\widehat{g}$ estimator in a Gaussian-location (Uniform) mixutre: $\mathbf{Y} \mid \theta \sim \mathcal{N}(\theta,1) \text{ and } \theta \sim \mathcal{U}\text{nif}(-2,2)$ where $n=2,000$.

Seed <- 128783;set.seed(Seed);dist <- "Gaussian";param <- 1.0
n <- 2000;L <- 5;num_it <- 4000;n_grid <- 100
theta <- runif(n, -2, 2);Y = theta + rnorm(n, 0, param)
net_npmle <- Net_NPMLE(Y=Y, param=param, dist=dist, n=n, num_it=num_it, n_grid=n_grid)
plot(net_npmle$support, net_npmle$prob, col=rgb(1,0,0,0.8), type='l',
     xlab='support', ylab='mass', lwd=3, ylim=c(0,0.2), cex.axis=1.85, cex.lab=1.85)
efnpmle = deconvolveR::deconv(tau=net_npmle$support, X=Y, deltaAt=0, family="Normal", pDegree=5, c0=1.0)
efnpmle20 = deconvolveR::deconv(tau=net_npmle$support, X=Y, deltaAt=0, family="Normal", pDegree=20, c0=0.5)
lines(net_npmle$support, efnpmle$stats[,"g"], type="l", xlab="", ylab="", col=rgb(0,0.5,0.5), lty=1, lwd=2)
lines(net_npmle$support, efnpmle20$stats[,"g"], type="l", xlab="", ylab="", col=rgb(0,0,1), lty=1, lwd=2)
legend("topright", c("Net-NPMLE","Efron(5)","Efron(20)"),
        col=c(rgb(1,0,0),rgb(0,0.5,0.5),rgb(0,0,1)), lty=c(1,1,1),
        lwd=c(3,2,2,2,2), bty="n", cex=1.0, x.intersp=0.8, y.intersp=0.8)

Example (3) : Bi-variate Gaussian location-scale mixture

In this case, we explore the application of bi-variate Net-NPMLE in a Gaussion location-scale mixture model: $\pi(\mu,\sigma^2) \sim \text{Normal-inverse-gamma}(\mu=1,\sigma=1,\text{shape}=2,\text{scale}=0.5)$.

Seed <- 128783;set.seed(Seed);dist <- "Gaussian2s";param <- 0.5
n <- 1000;L <- 5;num_it <- 4000;n_grid <- 50;p <- 2
mu <- 1;sigma <- 1; shape <- 2;scale <- 0.5
Y <- matrix(0, n, p)
theta1 <- MCMCpack::rinvgamma(n, shape, scale)
theta2 <- rnorm(n, mu, sd = sqrt(theta1 * sigma^2))
for(i in 1:n){Y[i,] <- theta2[i]+theta1[i]*rnorm(p)}
net_npmle <- Net_NPMLE2(Y=Y, param=param, dist=dist, n=n, num_it=num_it, n_grid=n_grid, p=p)
mu_support <- unique(net_npmle$support[,1])
sig_support <- unique(net_npmle$support[,2])
mu_prob <- rep(0, n_grid);sigma_prob <- rep(0, n_grid)
for(i in 1:n_grid){
     mu_prob[i] <- sum( net_npmle$prob[net_npmle$support[,1]==mu_support[i]] )
     sigma_prob[i] <- sum( net_npmle$prob[net_npmle$support[,2]==sig_support[i]] )
}
plot(mu_support, mu_prob, col=rgb(1,0,0,0.8), type='l', xlim=c(-3,4),
     xlab='support', ylab='mass', lwd=3, ylim=c(0,0.35), cex.axis=1.85, cex.lab=1.85)
lines(sig_support, sigma_prob, col=rgb(0,0,1,0.8), type='l', lwd=3, lty=2)
legend(x=2.0, y=0.385, c(expression(mu),expression(sigma^2)),
       col=c(rgb(1,0,0),rgb(0,0,1)), lty=c(1,2), lwd=c(3,3),
       bty="n", x.intersp=0.5, y.intersp=0.8, seg.len=1.0, cex=1.85)

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Image		Image
R		R
inst		inst
man		man
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
NAMESPACE		NAMESPACE
NetNPMLE.Rproj		NetNPMLE.Rproj
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image

Image

R

R

inst

inst

man

man

.Rbuildignore

.Rbuildignore

.gitignore

.gitignore

DESCRIPTION

DESCRIPTION

NAMESPACE

NAMESPACE

NetNPMLE.Rproj

NetNPMLE.Rproj

README.md

README.md

Repository files navigation

NetNPMLE

Abstract

Installation

Main function

Example (1) : Lognormal-location (Beta) mixture.

Example (2) : Gaussian-location (Uniform) mixture.

Example (3) : Bi-variate Gaussian location-scale mixture

About

Releases

Packages

Languages

shijiew97/NetNPMLE

Folders and files

Latest commit

History

Repository files navigation

NetNPMLE

Abstract

Installation

Main function

Example (1) : Lognormal-location (Beta) mixture.

Example (2) : Gaussian-location (Uniform) mixture.

Example (3) : Bi-variate Gaussian location-scale mixture

About

Resources

Stars

Watchers

Forks

Languages