Chebsampling

Inverse transform sampling with Chebyshev polynomial approximation

General remarks

These codes provide a Fortran 90 implementation of inverse transform sampling with Chebyshev polynomial approximation in one and two dimensions. Cumulative and probability distribution functions are approximated by Chebyshev polynomials, which significantly speeds up the evaluation of functions in root finding. Implementation on massively parallel computers with MPI allows for generation of large sample size. For our application, we use this sampling algorithm to load particles in particle-in-cell simulations. Similarly, this algorithm can be used to initialize other distributions in Monte Carlo simulations, molecular dynamics simulations, and gravitational simulations. The practical use of this method is demonstrated through concrete examples in space plasmas. Compared with the classical rejection sampling in low dimensions, our sampling algorithm is particularly efficient when the distribution function is highly localized into a small fraction of the domain.

Program structure

The core of Chebsampling is an iterative bisection root-finding algorithm with functions represented by Chebyshev polynomials, which is contained in the module invsampling (invsampling.f90). In one dimension, the sampling algorithm is wrapped in the subroutine inv_sampling_1D (available in module invsampling). In two dimensions, samples are generated by drawing from marignal and conditional distribution functions using the subroutine inv_sampling_1D repetitively, which is wrapped in the subroutine pp_inv_sampling_2D (available in module invsampling). The discrete Chebyshev tranform in module invsampling uses trusted fast Fourier transform code inherited from the UPIC framework (libmfft1.f, libmfft1_h.f90, modmfft1.f90; see https://github.com/UCLA-Plasma-Simulation-Group/UPIC-2.0.git).

The domain decomposition for MPI parallelization is done in the module ppush2 (ppush2.f90). In this module, subroutine pfedges2 decomposes the domain such that the number of particles in each partition is approximately the same; subroutine pdcomp2 decomposes the domain such that the number of grids in each partition is approximately the same. We use the subroutine pfedges2 in our application for better load balance. But it is also possible to pdcomp2 if uniform paritition is preferred.

The library pplib2 (pplib2.f90) for basic MPI communications is inherited from the UPIC framework.

The target distribution data is generated by the module distr (distr.f90). Users can add new subroutines in this module to generate other distributions. Alternatively, because these target distribution data are defined on grids. Users can easily modify this module to read realistic distribution data from external files.

The workflow of our sampling algorithm is wrapped in main.f90. For input, the number of particles (npx, npy) and the number of grids (ncx, ncy) are taken from the namelist 'input1.nml'. For output, the deposited density data and the target density data are stored in 'data.bin'.

A detailed description of all subroutines in the codes is available in 'manual.pdf'.

Compilation and execution

For prerequisites, users are suggested to install Linux environment with GNU compilers, openmpi and make.

A makefile is available to compile the code using either GNU or Intel compilers. To compile the code, type 'make' or 'make chebsampling'.

To run the code, type 'echo $i | mpiexec -np 16 chebsampling > timing$(printf "%03d" $i)'. Here 'i' is from 1 to 6 for the six types of target distributions built in the module distr (see main.f90 and distr.f90 for details).

A docker container to run our code and plot the output data is available at https://codeocean.com/capsule/0988490/tree/v2, where the enviroment with all the required softwares has been readily set up.

References

This sampling algorithm and its applications in space plasmas are documented in 'inv_cheb_sampling.pdf', which has been published in Journal of Geophysical Research https://doi.org/10.1029/2021JA030031 (arxiv: https://doi.org/10.48550/arXiv.2202.08203). Useful references in developing our software are given in the manuscript. In particular, we benefit a lot from the textbook 'Approximation Theory and Approximation Practice' by Nick Trefethen.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Chebsampling

General remarks

Program structure

Compilation and execution

References

About

Uh oh!

Releases 1

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
distr.f90		distr.f90
dtimer.c		dtimer.c
input1.nml		input1.nml
inv_sampling_cheb.pdf		inv_sampling_cheb.pdf
invsampling.f90		invsampling.f90
job.sampling.mpi		job.sampling.mpi
libmfft1.f		libmfft1.f
libmfft1_h.f90		libmfft1_h.f90
main.f90		main.f90
manual.pdf		manual.pdf
modmfft1.f90		modmfft1.f90
pplib2.f90		pplib2.f90
ppush2.f90		ppush2.f90

License

phyax/Chebsampling

Folders and files

Latest commit

History

Repository files navigation

Chebsampling

General remarks

Program structure

Compilation and execution

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages