Nonparametric Comparisons of Multiple Distributions under Uniform Stochastic Ordering

This repository contains R programs for the article “Nonparametric Comparisons of Multiple Distributions under Uniform Stochastic Ordering.”

Prior to using R programs on this repository, please download the main R program EGJ_USO_Library.R.

Part 1. Reproducing simulation results in the manuscript

Since both distinguishing distribution methods and GOF tests depend on the ODC between consecutive distributions, it suffices to generate random samples from the ODCs with the first distribution assigned to be uniformly distributed. In the manuscripts, we consider G_q with q between 0 and 1 for star-shaped ODC and G_q for non-star-shaped ODC. See the top-left figure in Figure 1. The sequence of the ODCs K_delta from Wang and Tang (2020) on the right of Figure 1 is for power curve comparison. We provide R codes for generating the random samples from G_q rUSO.samples.R The computation times in the following are based on a computer with a 3.0GHz processor and 64GB of memory.

R codes for ODC plots in Figure 1: Figure_1_ODCs.R.

1.1. Table 1: Equal Distribution Test under USO with k=3.

The R codes to reproduce Table 1 are attached: Testing_Equality_k3.R. The calculation took approximately 10 minutes.

1.2. Table 2: Size and Power of Goodness-of-fit Tests for USO with k=3.

Testing_GOF_k3.R provides the size and power studies with k=3 sample with sample size n=200 which requires 6 hours approximately.

1.3. Figure 2: Power Curves Goodness-of-fit Tests for USO with k=3.

The power curves comparison in Figure 2, (R codes Testing_GOF_k3_PC.R) with k=3 samples and equal sample sizes n=200, requires 6 hours totally on a computer with a 3.0GHz processor and 64GB of memory.

1.4. Table 3: Distinguishing Distributions under USO with k=3.

The R codes to reproduce Table 3 is attached: Testing_Jump_k3.R. The calculation took approximately 10 minutes.

Part 2. Reproducing simulation results in Web Appendix

In addition to the simulation results in the manuscript, more simulations results are provided in the supplementary materials with R codes attached in the followings.

2.1. Tables: Equal Distribution Test under USO with k=4,5.

Other than k=3 samples, we applied the distinghishing distribution methods to samples k=4, and k=5 with sample sizes n=60,100,and 200. All the calculations took less than 10 minuntes.

2.2. Tables: Goodness-of-fit Tests for USO with k=4,5.

We provide the size and power study for k=4, and k=5 samples with sample sizes n=60, 100, and 200. All the calculations took less than 10 minuntes.

2.3. Figures: Power Curves for Goodness-of-fit Tests for USO with k=4,5.

We also consider more settings for power curves comparison for GOF tests with k=4,5 samples and sample sizes n= 200 with R codes attached.

For k=4 with sample sizes n=200 which took approximately 8 hours on a computer with a 3.0GHz processor and 64GB of memory, respectively.
For k=5 with sample sizes n=200 which took approximately 10 hours, respectively.

2.4. Tables: Distinguishing Distributions under USO with k=4,5.

Other than k=3 samples, we applied the distinghishing distribution methods to samples k=4, and k=5 with sample sizes n=60,100,and 200. All the calculations took less than 10 minuntes.

Part 3. MFAP4 data analysis

We applied both distinguishing distribution methods and GOF tests to microfibrillar-associated protein 4 (MFAP4) data with clinical cohort characteristics and MFAP4 serum levels from Bracht et. al. (2016) in MFAP4.xlsx. We grouped the MFAP4 levels in fibrosis stages and saved in R data form data_MFAP4 in MFAP4.Rdata.

Here we provide the empirical estimators and estimators under USO for ODCs between consecutive fibrosis stages with R codes attached: Figure_3.

3.1. Table 3 part 1: Equal test for USO for MFAP4 levels

The first part (first 3 rows) of Table 3 provides the differences of distributions from equality in L_p norm with p=1,2, and supremum norms, respectively. The thresholds for each L_p differences are provided to determine if the consecutive distributions are distinct.

3.2. Table 3 part 2: Goodness-of-fit test for USO for MFAP4 levels

The second part (last 3 rows) of Table 3 provides the departures of consecutive distributions from USO in L_p norm with p=1,2, and supremum norms, respectively. The critical values boot.cv.Skps and boot.cv.Wkps for the cumulated test statistics Skp and Wkp, respectively, are provided. The Bonferroni-corrected critical values are saved in function Bon.cvs with L_p norms Bon.cv.p1, $Bon.cv.p2, and Bon.cv.ps.

3.3. Table 3 part 3: Distinghishing Fibrosis stages from MFAP4 levels

The jump detection methods, including J_p^0 and J_p^* with p=1,2, and infinity, for MFAP4 data are coded in MFAP4_Jump_Detection.R

Reference:

Thilo Bracht, Christian Mölleken, Maike Ahrens, Gereon Poschmann, Anders Schlosser, Martin Eisenacher, Kai Stühler, Helmut E. Meyer, Wolff H. Schmiegel, Uffe Holmskov, Grith L. Sorensen and Barbara Sitek (2016). Evaluation of the biomarker candidate MFAP4 for non-invasive assessment of hepatic fibrosis in hepatitis C patients. Journal of Translational Medicine. 14:201.
Dewei Wang, Chuan-Fa Tang, and Joshua M. Tebbs (2020). More powerful goodness-of-fit tests for uniform stochastic ordering. Computational Statistics & Data Analysis. 144:106898.
Chuan-Fa Tang, Dewei Wang, and Joshua M. Tebbs (2017). Nonparametric goodness-of-fit tests for uniform stochastic ordering The Annals of Statistics. 45:2565-2589.

Name		Name	Last commit message	Last commit date
Latest commit History 184 Commits
.DS_Store		.DS_Store
Bonferroni_corrected_critical_values.R		Bonferroni_corrected_critical_values.R
DR.R		DR.R
EGJ_USO_Library.r		EGJ_USO_Library.r
Figure_1_ODC_Plot.pdf		Figure_1_ODC_Plot.pdf
Figure_1_ODC_Plot.png		Figure_1_ODC_Plot.png
Figure_1_ODCs_Plot.r		Figure_1_ODCs_Plot.r
Figure_2_GOF_PowerCurves_k3_200.R		Figure_2_GOF_PowerCurves_k3_200.R
Figure_2_GOF_PowerCurves_k3_200.pdf		Figure_2_GOF_PowerCurves_k3_200.pdf
Figure_2_GOF_PowerCurves_k3_200.png		Figure_2_GOF_PowerCurves_k3_200.png
Figure_3_MFAP4.R		Figure_3_MFAP4.R
Figure_3_MFAP4.pdf		Figure_3_MFAP4.pdf
Figure_3_MFAP4.png		Figure_3_MFAP4.png
GOFCV.R		GOFCV.R
JDCV.R		JDCV.R
ME-B.R		ME-B.R
MEB_EqCV.R		MEB_EqCV.R
MEB_GOFCV.R		MEB_GOFCV.R
MFAP4.Rdata		MFAP4.Rdata
MFAP4.xlsx		MFAP4.xlsx
MFAP4_Jump_Detection.R		MFAP4_Jump_Detection.R
MUSOLibrary.R		MUSOLibrary.R
ODCandDDs.pdf		ODCandDDs.pdf
ODCandDDs.png		ODCandDDs.png
README.md		README.md
Supp_Figure_GOF_PowerCurves_k4_200.pdf		Supp_Figure_GOF_PowerCurves_k4_200.pdf
Supp_Figure_GOF_PowerCurves_k5_200.pdf		Supp_Figure_GOF_PowerCurves_k5_200.pdf
Testing_Equality_k3.R		Testing_Equality_k3.R
Testing_Equality_k4.R		Testing_Equality_k4.R
Testing_Equality_k5.R		Testing_Equality_k5.R
Testing_GOF_k3.R		Testing_GOF_k3.R
Testing_GOF_k3_PC.R		Testing_GOF_k3_PC.R
Testing_GOF_k4.R		Testing_GOF_k4.R
Testing_GOF_k4_PC.R		Testing_GOF_k4_PC.R
Testing_GOF_k5.R		Testing_GOF_k5.R
Testing_GOF_k5_PC.R		Testing_GOF_k5_PC.R
Testing_Jump_k3.R		Testing_Jump_k3.R
Testing_Jump_k4.R		Testing_Jump_k4.R
Testing_Jump_k5.R		Testing_Jump_k5.R
do_EqualityTests.R		do_EqualityTests.R
do_GOFTests.R		do_GOFTests.R
do_JumpDetection.R		do_JumpDetection.R
rUSO_samples.R		rUSO_samples.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nonparametric Comparisons of Multiple Distributions under Uniform Stochastic Ordering

Part 1. Reproducing simulation results in the manuscript

1.1. Table 1: Equal Distribution Test under USO with k=3.

1.2. Table 2: Size and Power of Goodness-of-fit Tests for USO with k=3.

1.3. Figure 2: Power Curves Goodness-of-fit Tests for USO with k=3.

1.4. Table 3: Distinguishing Distributions under USO with k=3.

Part 2. Reproducing simulation results in Web Appendix

2.1. Tables: Equal Distribution Test under USO with k=4,5.

2.2. Tables: Goodness-of-fit Tests for USO with k=4,5.

2.3. Figures: Power Curves for Goodness-of-fit Tests for USO with k=4,5.

2.4. Tables: Distinguishing Distributions under USO with k=4,5.

Part 3. MFAP4 data analysis

3.1. Table 3 part 1: Equal test for USO for MFAP4 levels

3.2. Table 3 part 2: Goodness-of-fit test for USO for MFAP4 levels

3.3. Table 3 part 3: Distinghishing Fibrosis stages from MFAP4 levels

Reference:

About

Releases

Packages

Languages

cftang9/MSUSO

Folders and files

Latest commit

History

Repository files navigation

Nonparametric Comparisons of Multiple Distributions under Uniform Stochastic Ordering

Part 1. Reproducing simulation results in the manuscript

1.1. Table 1: Equal Distribution Test under USO with k=3.

1.2. Table 2: Size and Power of Goodness-of-fit Tests for USO with k=3.

1.3. Figure 2: Power Curves Goodness-of-fit Tests for USO with k=3.

1.4. Table 3: Distinguishing Distributions under USO with k=3.

Part 2. Reproducing simulation results in Web Appendix

2.1. Tables: Equal Distribution Test under USO with k=4,5.

2.2. Tables: Goodness-of-fit Tests for USO with k=4,5.

2.3. Figures: Power Curves for Goodness-of-fit Tests for USO with k=4,5.

2.4. Tables: Distinguishing Distributions under USO with k=4,5.

Part 3. MFAP4 data analysis

3.1. Table 3 part 1: Equal test for USO for MFAP4 levels

3.2. Table 3 part 2: Goodness-of-fit test for USO for MFAP4 levels

3.3. Table 3 part 3: Distinghishing Fibrosis stages from MFAP4 levels

Reference:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages