 ## Multicriteria Decision Analysis for Benefit-Risk Analysis of Medicinal Products
 
### Data and Deterministic Model

The data consist of $p$ variables observed on $n$ individuals. Let $Y_{ij}$ represent the observation of the $j$-th variable on the $i$-th individual ($i=1,\dots,n$, $j=1,\dots,p$). Futhermore denote colummn vector of observations corresponding to the $j-$ th variable $Y_{\cdot j}=(Y_{1j}\dots,Y_{nj})^T$, whereas the row vector referring to data on the $i$-th individual is denoted by $Y_{i\cdot}=(Y_{i1},\dots,Y_{ip})$, and let $Y$ be tha matrix of all the observations.
 
The data typically consists of continuous measurement, binary variables or counts. Each variable is assigned a distribution which quite often is the Normal, Bernoulli and Poisson depending on the variable type. The standard procedure of MCDA in the context of benefit risk analysis is to approve the use of drug with the highest score. Denoting by $\mu_j=E(Y_{\cdot j})$ the score of the $d$ drug is defined as

$$
S_d(\mu)=\sum_{j=1}^p w_j h_j(\mu_j),
$$

where ... Since each $\mu_j$ is unknown, an estimate of it, denoted by 
$\hat{\mu}_j=\hat{\mu}_j\left(Y_{\cdot j}\right)$, is used instead based on the data, providing the following estimate of the score drug:

\begin{equation}
\label{drug score}
\hat{S_d}(Y)=\sum_{j=1}^p w_j h_j\left( \hat{\mu_j}\right).
\end{equation}

### Statistical Inference

Note that the formula above does not take into account the uncertainty in the estimates of $\hat{\mu_j}$. Hence it cannot be used to draw statistical inference on the drug score. To address this Wen et al (2014) adopt a Normal approximation to the distribution of each $Y_{i\cdot}$ and using the Delta method to derive the asymptotic distribution of $\hat{S_d}(Y)$. This can facilitate the construction of asymptotic intervals and hypothesis tests, although a Monte Carlo approach was also suggested. **KK notes: doesn't allow for mixed data correlation for binary variables, approximate, non-parsimonious**

A Bayesian approach was adopted in Phillips et al (??? Modelling Rosi for ....). The parameters $\mu_j$ were assummed to be independent a-posteriori thus implying a joint posterior being the product of the $\mu_j$ posteriors. The latter were obtained by assigning conjugate prior for each $j$ depending on the variable type. The ability to sample from the joint posterior of the $\mu_j$'s allows to draw samples from the posterior of $S_d$ and caculate quantities such as the posterior probability of drug having better score than another. **KK notes: independence assumption, posterior vs predictive distribution**.



Check also [here](https://onlinelibrary.wiley.com/doi/epdf/10.1002/pds.3880) and [here](https://spiral.imperial.ac.uk/handle/10044/1/29021), although probably not too statistical. Maybe useful for the previous part though.


### Proposed approach

Our proposed approach retains the desirables characteristics of the the existing framwork, such as handling parameter uncertainty, and provides improvement in the following ways:

 1. Accounts for dependence account while allowing for mixed data without relying on the asumptotic normality approximations.
 2. Allows using the posterior predictive distribution
 3. Addresses issues of model goodness of fit and parsimony
 
It consists of the following step
 
  1. Assign a model on Y that allows for dependence and sample from the posterior of the parameters vector that includes the $\mu_j$'s.
  2. Plug each posterior MCMC sample $k$ of the $\mu_j$'s to obtain a sample from $S_d(\mu)$. These samples can be used to ... 
  3. The MCMC samples from the posterior MCMC sample $k$ of the parameters draw a $Y_i^{(k)}$ given the value of the parameters to obtain a sample from the posterior-predictive. These samples can be used to caclulate quantities such as the probability of one medicinal product having higher score than the other.
  4. Define the individual score 
  
  $$
  s_d(Y_{i\cdot})=\sum_{j=1}^p w_j h_j\left( Y_{ij}\right),
  $$
  
  Note that for linear $h_j(\cdot)$ we get $E\left[ s_d(Y_{i\cdot})\right]=S_d$
  
  5. For each posterior MCMC sample $k$ of the parameters draw a $Y_i^{(k)}$ given the value of the parameters to obtain a sample from the posterior-predictive. These samples can be used to caclulate quantities such as the probability of one medicinal product having higher score than the other.
  
  
**KK:** Note sure about 4 and 5. We may not use them.


## Motivating Example

The Wen et al (2014) provides some justification on including the correlations. Further motivation, from the Bayesian angle, is provided by the following example 

 1. Assumme $Y_i\sim N(\mu,\Sigma)$ where $\mu=(\mu_1,\dots,\mu_p)$ and $\Sigma$ is a $p\times p$ covariance matrix (i.e. symmetric and positive definite). Further assume linear $h_j(\cdot)$'s. Without loss of generality we can write 
 
 $$
S_d(\mu)=\sum_{j=1}^p w_j \mu_j,
 $$
 
 2. Assign some vague priors on $\mu$, $\Sigma$. The posterior for $\mu$ conditional on $\Sigma$ is the following Normal distribution:
 
 ...
 
 The marginal posterior of $\mu$ is a multivariate $t$ with ... degrees of freedom, location ... and scale matric ....
 
 3. The poserior of $S_d(\mu)$ conditional on $\Sigma$ is the following Normal distribution ... (**KK:** You can check the Delta method of Wen et al (2014) here, i.e. standard propeties of the multivariate Normal).
 
 4. In the case of diagonal $\Sigma$ we get ...
 
To contrast the two posteriors of $S_d(\mu)$ analytically.

To also provide a numerical illustration as below:

 - Take $n=500$, $p=5$, $\Sigma$ having $1$'s in the diagonal and $\rho=0.5$ in the off-diagonal. Generate data from this model.
 - Set up some reasonaable weights and $h(\cdot)$'s to explicitly define $S_d(\mu)$.
 - Derive the posterior distributions $S_d(\mu)$ based on the simulated data above.
 - Using the posterior distributions and Monte Carlo, calculate for each case (diagonal or not) the probability $P(S_d(\mu)< c)$ (if large scores are bad) or $P(S_d(\mu)> c)$ (if large scores are good), for a meaningful choice of $c$. 
