# Model Update

### Description

Model Update is the part of syncing the layout and stochastic distribution parameters of the system. Model update has two types.
1. **Logic update** - Updating the layout and logic of the system (.json file of the digital model). It is done by regenerating the digital model using **model generation** techniques.
2. **Input update** - Updating the stochastic parameters of the system such as stochastic processing time. The stochastic processing parameters are calculated per machine using the history of processing times **(Xr)** available in the real_log.

## Input update

### Testing Modules

##### Assumption
We assume that processing time distribution is an inherent property of the machine, independent of its environment.

##### Assumption:
We assume that the dataset is made of only one distribution set. For cases of mixture of distributions, there are several types of mixture models, such as the Gaussian mixture model (GMM) and the finite mixture model (FMM), each with their own strengths and limitations. Mixture models can be a powerful tool for analyzing datasets with minimal data points and complex underlying distributions, but their application requires a solid understanding of statistical modeling and estimation techniques.

##### Available test methods
1. Kolmogorov-Smirnov test (sensitive to differences in the tails of the distribution)
2. Anderson-Darling test (sensitive to differences in the centre of the distribution)
3. maximum likelihood estimation (for parameters estimation with a know distribution type)
4. kernel density estimation (for PDF without knowing the distribution type)
5. Shapiro-Wilk test for normal distribution (small dataset)

##### Datasets below 30 data points can be considered as a small sample size.
Some academic sources that discuss sample size and statistical tests for determining the distribution of a dataset:

1. Everitt, B. S. (2002). The Cambridge Dictionary of Statistics. Cambridge University Press.
2. Wilcox, R. R. (2017). Introduction to robust estimation and hypothesis testing. Academic Press.
3. Harwell, M. R., Stone, C. A., & Hsu, T. C. (1996). Sample size planning for statistical power and accuracy in parameter estimation. Boca Raton: CRC Press.
4. McDonald, J. H. (2014). Handbook of Biological Statistics. Sparky House Publishing.
5. NIST/SEMATECH e-Handbook of Statistical Methods, http://www.itl.nist.gov/div898/handbook/, accessed on January 11, 2023.

It is important to note that these sources provide general guidelines and that the appropriate sample size and statistical test should be chosen based on the specific research question and context.

##### The assumptions and limitations of the Kolmogorov-Smirnov (KS) test and Anderson-Darling (AD) test are:

1. The KS test assumes that the distribution being tested is continuous, while the AD test can be used for both continuous and discrete distributions.
2. The AD test is known to be more powerful than the KS test for identifying deviations from the hypothesized distribution in the tails of the distribution, but the KS test is more sensitive to differences in the center of the distribution.
3. Both tests assume that the sample data is independent and identically distributed.
4. Both tests assume that the sample data is a random sample from the population being tested.

The choice of which test to use depends on the specific context of the analysis and the distribution being tested. In general, it is recommended to use both tests and compare their results to gain a better understanding of the distribution being tested.