sensitivity

Selecting a dataset size for machine learning is a challenging open problem. Typically, there is a relationship between training dataset size and model performance, especially for nonlinear models. However, the caveat is that such a relation might not exist for some models and datasets.

A sensitivity analysis forms the basis for testing different model types and model configurations in addition to the ones chosen at the outset to address a given problem. This helps in evaluating a better model algorithm and decide on a rough estimate of the training data needed to build the predictive model, apart from the statistical heuristics such as number of classes (classification problem), number of input features or the number of model parameters that are considered.

A good read on the effectiveness of larger datasets in deep learning models: https://arxiv.org/abs/1707.02968

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
notebooks		notebooks
1707.02968.pdf		1707.02968.pdf
README.md		README.md
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notebooks

notebooks

1707.02968.pdf

1707.02968.pdf

README.md

README.md

requirement.txt

requirement.txt

Repository files navigation

sensitivity

About

Releases

Packages

Contributors 2

Languages

ranja-sarkar/sensitivity

Folders and files

Latest commit

History

Repository files navigation

sensitivity

About

Topics

Resources

Stars

Watchers

Forks

Languages