QRGMM: Quantile-Regression-Based Generative Metamodeling

This repository provides an accessible, well-documented, and research-ready implementation of QRGMM (Quantile-Regression-Based Generative Metamodeling). It is designed both as a reproducibility package for the paper “Learning to Simulate: Generative Metamodeling via Quantile Regression” , available at https://arxiv.org/abs/2311.17797 and as a general-purpose codebase that researchers and practitioners can easily adapt to new simulation-based learning problems.

The primary goal of this codebase is to make QRGMM easy to use, easy to adapt, and computationally efficient for simulation-based learning and real-time decision-making problems, we expose the full modeling pipeline, implementation details, and practical tricks required for fast and reliable conditional sample generation.

1. What is QRGMM?

QRGMM is a generative metamodeling framework for learning conditional output distributions of complex stochastic simulators using quantile regression, with the goal of building a fast “simulator of a simulator.” Given a dataset $(x_i, y_i)_{i=1}^{n}$, QRGMM approximates the conditional distribution $Y \mid X = x$ by:

Fitting conditional quantile models on a grid of quantile levels $(\tau_j = j/m: j=1,\ldots,m-1)$ via quantile regression;
Constructing an approximate conditional quantile function $\widehat{Q}(\tau \mid x)$ by linearly interpolating between the fitted quantile values across adjacent grid points;
Generating samples via inverse transform sampling by plugging in uniform random variables $u \sim \mathrm{Unif}(0,1)$ and outputting $\hat{y} = \widehat{Q}(u \mid x)$, which enables fast online conditional sample generation.

Key characteristics:

Fully nonparametric conditional distribution learning, without imposing specific distribution type for the underlying conditional distribution
Provides more accurate and stable distributional learning than GAN-, diffusion-, and rectified-flow-based methods in the one-dimensional and low-dimensional output regimes that are most common in stochastic simulation and operations research
Extremely fast online sample generation, requiring less than $0.01$ seconds to generate $10^5$ conditional observations
Naturally compatible with simulation-based learning and real-time decision-making problems

2. Implemented QRGMM Variants

This codebase includes multiple QRGMM variants discussed in the paper:

Standard QRGMM Classical linear quantile-regression-based generative metamodel.
Basis-funtion-based QRGMM Linear quantile-regression-based generative metamodel with basis function.
QRGMM-R (Rearranged QRGMM) Enforces monotonicity across quantiles to eliminate quantile crossing.
Neural-network-based QRGMM (QRNN-GMM) Uses quantile regression neural networks for high-dimensional and nonlinear settings.
Multi-output QRGMM Supports vector-valued simulator outputs.

3. Repository Structure

The repository is organized into two complementary layers:

(a) Experiment & Replication Pipelines

End-to-end pipelines and the full source code used to reproduce all numerical experiments in the paper:

Experiments/
├── Artificial_Test_Problems/
├── Esophageal_Cancer_Simulator/
└── Bank_Simulator/

Each pipeline includes:

data generation from simulators;
quantile regression fitting;
QRGMM training and sampling;
comparisons with CWGAN, Diffusion, and Rectified Flow;
Jupyter notebooks for tables and figures.

For a detailed, experiment-by-experiment description of the replication pipelines, execution order, environment setup and implementation details, please refer to README.pdf for more detailed instructions.

(b) QRGMM Core Codebase

Reusable implementations of QRGMM and its variants:

QRGMM/
├── Vanilla_Python/
├── Basis_Function_Python/
├── Basis_Function_MATLAB/
├── QRGMM_R/ # Rearranged QRGMM (QRGMM-R) 
└── MutiOutput_NeuralNetwork/

The above structure clearly organizes the QRGMM-related implementations used in the numerical experiments of the paper in a unified and modular manner. Each directory corresponds to a specific QRGMM variant or implementation setting, and provides a complete end-to-end pipeline, including data preparation, QRGMM fitting, and online conditional sample generation. For details of the underlying datasets, experimental settings, and application-specific configurations, readers are referred to the paper and README.pdf for more information.

4. Environment and Dependencies

We recommend using Anaconda to ensure reproducibility.

conda env create -n QRGMM -f environment_QRGMM_history.yml
conda activate QRGMM

Main dependencies:

Python ≥ 3.9
numpy, scipy, pandas, matplotlib
statsmodels (quantile regression)
torch (for CWGAN, Diffusion, Rectified Flow)
joblib (parallel quantile fitting)

R and MATLAB are used for specific components including quantile regression estimation and simulation procedures.

5. Typical Workflow

A minimal QRGMM workflow consists of:

Generate training data from a simulator
Fit quantile regressions on a predefined grid
Store quantile coefficients
Generate conditional samples using QRGMM

Conceptually:

Simulator → Quantile Regression → Quantile Coefficients → Fast Sampling

Both Python and MATLAB implementations follow this same logic, making the framework easy to understand and modify.

6. Practical Acceleration Techniques

A key focus of this codebase is fast online sample generation, which is critical when QRGMM is used for real-time decision-making problems. The implementations therefore emphasize vectorized computation, parallelism, and avoidance of loop and branch flow.

6.1 Fully Vectorized Online Sampling

The online QRGMM sampling algorithm is implemented using pure matrix operations, avoiding explicit for loops and if–else branching. Given a batch of covariates ($X \in \mathbb{R}^{k \times d}$), all conditional observations are generated simultaneously.

A key trick is to map uniform random variables directly to quantile indices via integer division. For a quantile grid with spacing $\Delta\tau$, the enclosing quantile index is computed as $j = \left\lfloor u / \Delta\tau \right\rfloor$, which corresponds to $j = \lfloor u m \rfloor$ in the QRGMM algorithm. Linear interpolation between adjacent quantiles is then carried out entirely via matrix operations.

An illustrative implementation is:

def QRGMM(X):
    output_size = X.shape[0]
    u = np.random.rand(output_size)

    order = (u // le).astype(np.uint64)
    alpha = u - order * le

    b1 = fastmodels[order, 1:(d+2)]
    b2 = fastmodels[order+1, 1:(d+2)]

    w = alpha.reshape(-1, 1) / le
    b = b1 * (1 - w) + b2 * w

    sample = np.sum(b * X, axis=1)
    return sample

This design avoids explicit for loops and if–else branching and enables efficient batch sampling.

6.2 Tail Handling via Quantile Coefficient Expansion

To avoid special-case handling at the boundary quantiles, the quantile regression coefficient matrix is expanded at both ends by duplicating the first and last quantile coefficients. This allows interpolation to be handled uniformly for all samples without conditional checks, and enables fully vectorized computation throughout the online sampling stage.

fastmodels = np.zeros((nmodels.shape[0] + 2, nmodels.shape[1]))
fastmodels[0, :] = nmodels[0, :]
fastmodels[1:-1, :] = nmodels[:, :]
fastmodels[-1, :] = nmodels[-1, :]

6.3 Parallel Quantile Regression Training

Quantile regression models at different quantile levels are mutually independent. As a result, the offline training stage naturally supports parallel computation. In the Python implementation, this is achieved using joblib, while in the R implementation it is handled via future.apply. Parallelization substantially reduces training time when a fine quantile grid is used.

6.4 Large-Scale Sampling for Fixed Covariates

In applications where a large number of observations (e.g., $K=10^5$ or more) are required for the same covariate vector $x$, an additional acceleration strategy is applied. The entire conditional quantile curve is first computed via matrix multiplication between $x$ and the quantile regression coefficient matrix. All uniform random variables are then mapped simultaneously to target observations using vectorized interpolation for conditional quantile curve instead of quantile regression coefficient.

This approach avoids processing each random draw of the uniform random variable individually and further improves computational efficiency.

def QRGMM_xstar(x,k): # QRGMM online algorithm: input specified covariates (1*(d+1)), output sample vector (k*1)
    
    quantile_curve=np.reshape(np.dot(nmodels[:,1:(d+2)],x.T),-1)
    quantile_curve_augmented=np.zeros(m+1)
    quantile_curve_augmented[0]=quantile_curve[0]
    quantile_curve_augmented[1:m]=quantile_curve
    quantile_curve_augmented[m]=quantile_curve[-1]
    
    u=np.random.rand(k)
    order=u//le
    order=order.astype(np.uint64)
    alpha=u-order*le
    
    q1=quantile_curve_augmented[order]
    q2=quantile_curve_augmented[order+1]
    q=q1*(1-alpha/le)+q2*(alpha/le)
    return q

6.5 QRNN-based (Multi-output) QRGMM: On-Demand Quantile Evaluation via Inverted Indexing

In QRNN-based QRGMM, especially in the multi-output setting, a major computational bottleneck arises from standard inverse transform sampling: to generate samples, one typically needs to evaluate all quantile models in order to construct the full conditional quantile curve, and then perform interpolation. Moreover, in the general setting where samples are associated with different covariates, each distinct conditioning input and distinct quantile level may require a separate call to qrnn.predict(), making it impossible to completely eliminate looping over samples or quantile levels.

This limitation persists even in the most favorable case where all samples share the same covariates x. While a single precomputed quantile curve can be reused for the first output dimension, once the first output vector $(y^{(1)}_k : k = 1,\ldots,K)$ is generated, the conditioning sets ($(x, y^{(1)}_k) : k = 1,\ldots,K$) become sample-specific. As a result, the fixed $x$ acceleration strategy no longer applies to subsequent output dimensions, and quantile evaluations can no longer be shared across samples.

Our implementation resolves this bottleneck by combining on-demand computation with inverted indexing. Instead of first evaluating the entire quantile curve, we generate the uniform random variables ($u$) upfront and determine, for each sample, which two adjacent quantile levels are actually required for interpolation. We then loop over quantile level indices and perform QRNN prediction only on the subset of samples that require the current quantile level model, skipping all irrelevant quantiles.

Concretely, samples are grouped (“bucketed”) according to their lower and upper quantile level indices. For each sample, only the two adjacent quantile models (lower and upper bounds) are required for interpolation; thus, each sample is passed through the QRNN at most twice, rather than through all $m$ quantile models. This immediately reduces the algorithmic prediction complexity from $O(Km)$ to $O(K）$.

Moreover, because qrnn.predict() can efficiently evaluate the same quantile level for a batch of different covariates in a single call, QRNN predictions are performed once per quantile level on a batched subset, rather than once per sample. As a result, the only remaining explicit loop is over the $m$ quantile levels, not over the $K$ samples, yielding an effective computational cost on the order of $O(m)$ batched forward passes, while preserving exact inverse-transform-based sampling.

In the special but common case where a large number of samples (e.g., $K=10^5$) are generated for the same covariate $x$, we apply the acceleration strategy to the first output dimension by precomputing the entire quantile curve once and mapping all $u \sim \mathrm{Unif}(0,1)$ to samples via pure indexing and linear interpolation. For subsequent output dimensions in the multi-output setting, the conditioning sets become sample-specific, and this fixed $x$ acceleration is no longer applicable. In these cases, sampling relies on the on-demand, index-based batching strategy described above.

predicting_x0 <- function(testx, fitmodel){
  ntestx <- nrow(testx)
  u <- runif(ntestx)
  low_ind <- pmax(1, u%/%interval)
  up_ind <- pmin(ntau, u%/%interval + 1)
  weight <- (u %% interval) / interval
  low_geny <- matrix(0, ntestx, 1)
  up_geny <- matrix(0, ntestx, 1)
  testx0 <- matrix(testx[1, ], 1, ncol(testx))
  quantile_curve=matrix(0,ntau,1)
  for(i in 1:ntau){
    quantile_curve[i]=qrnn.predict(testx0,fitmodel[[i]])
  }
  low_geny <- quantile_curve[low_ind]
  up_geny <- quantile_curve[up_ind]
  geny <- low_geny + weight * (up_geny - low_geny)
  return(geny)
}

predicting <- function(testx, fitmodel){
  ntestx <- nrow(testx)
  u <- runif(ntestx)
  low_ind <- pmax(1, u%/%interval)
  up_ind <- pmin(ntau, u%/%interval + 1)
  weight <- (u %% interval) / interval
  low_geny <- matrix(0, ntestx, 1)
  up_geny <- matrix(0, ntestx, 1)
  for(i in 1:ntau){
    low_geny[which(low_ind == i)] <- qrnn.predict(testx[which(low_ind == i),],fitmodel[[i]])
    up_geny[which(up_ind == i)] <- qrnn.predict(testx[which(up_ind == i),],fitmodel[[i]])
  }
  geny <- low_geny + weight * (up_geny - low_geny)
  return(geny)
}

7. Reproducibility

All experimental results reported in the paper can be reproduced using this repository. Random seeds are fixed throughout, and all figures and tables are generated via documented Jupyter notebooks.

8. Reference

If you use this codebase, please cite:

Hong L J, Hou Y, Zhang Q, et al. Learning to simulate: Generative metamodeling via quantile regression[J]. arXiv preprint arXiv:2311.17797, 2023.

9. Contact

For questions, suggestions, or bug reports, please contact:

Qingkai Zhang Email: 22110690021@m.fudan.edu.cn

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QRGMM: Quantile-Regression-Based Generative Metamodeling

1. What is QRGMM?

2. Implemented QRGMM Variants

3. Repository Structure

(a) Experiment & Replication Pipelines

(b) QRGMM Core Codebase

4. Environment and Dependencies

5. Typical Workflow

6. Practical Acceleration Techniques

6.1 Fully Vectorized Online Sampling

6.2 Tail Handling via Quantile Coefficient Expansion

6.3 Parallel Quantile Regression Training

6.4 Large-Scale Sampling for Fixed Covariates

6.5 QRNN-based (Multi-output) QRGMM: On-Demand Quantile Evaluation via Inverted Indexing

7. Reproducibility

8. Reference

9. Contact

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Experiments		Experiments
QRGMM		QRGMM
README.md		README.md
README.pdf		README.pdf
environment_QRGMM_history.yml		environment_QRGMM_history.yml

Keaikai/Generative-Metamodeling

Folders and files

Latest commit

History

Repository files navigation

QRGMM: Quantile-Regression-Based Generative Metamodeling

1. What is QRGMM?

2. Implemented QRGMM Variants

3. Repository Structure

(a) Experiment & Replication Pipelines

(b) QRGMM Core Codebase

4. Environment and Dependencies

5. Typical Workflow

6. Practical Acceleration Techniques

6.1 Fully Vectorized Online Sampling

6.2 Tail Handling via Quantile Coefficient Expansion

6.3 Parallel Quantile Regression Training

6.4 Large-Scale Sampling for Fixed Covariates

6.5 QRNN-based (Multi-output) QRGMM: On-Demand Quantile Evaluation via Inverted Indexing

7. Reproducibility

8. Reference

9. Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages