NFLT journal repository

This repository contains binary class, multi-class, and regression datasets alongside the R scripts to show empirically that the no free lunch theorem (NFLT) of statistical machine learning is indeed valid for every learning problem.

Abstract

In this paper, we provide a substantial empirical demonstration of the statistical machine learning result known as the No Free Lunch Theorem (NFLT). We specifically compare the predictive performances of a wide variety of machine learning algorithms/methods on a wide variety of qualitatively and quantitatively different datasets. Our research work conclusively demonstrates a great evidence in favor of the NFLT by using an overall ranking of methods and their corresponding learning machines, revealing in effect that none of the learning machines considered predictively outperforms all the other machines on all the widely different datasets analyzed. It is noteworthy however that while evidence from various datasets and methods support the NFLT somewhat emphatically, some learning machines like Random Forest, Adaptive Boosting, and Support Vector Machines (SVM) appear to emerge as methods with the overall tendency to yield predictive performances almost always among the best.

Keywords: Learning Machine, Generalization, Bayes Risk, Predictive Performance, No Free Lunch Theorem (NFLT), Empirical Evidence, Statistical Learning, Data science, Dataset, Function Space, Random Split, Score Function.

Implementation

For the purpose of showing tangible practical evidence that the NFLT is indeed valid, we trained fifteen (15) different models chosen from linear and/or non-linear, parametric and/or non-parametric on different binary class, multi-class, and regression datasets by using 80% of data and the remaining 20% for model performance. It was seen from the evaluation (misclassification rate) of the models using the test set that the performance of each learning model is different for various datasets that were involved.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Appendix		Appendix
Classification_datasets		Classification_datasets
Classification_figures_test_errors		Classification_figures_test_errors
Classification_scripts		Classification_scripts
Large p small n analysis		Large p small n analysis
R codes for LaTeX tables		R codes for LaTeX tables
Regression_datasets		Regression_datasets
Regression_figures_test_errors		Regression_figures_test_errors
Regression_scripts		Regression_scripts
Table of models' test errors		Table of models' test errors
Table of rank of models		Table of rank of models
Various heatmaps		Various heatmaps
Packages.R		Packages.R
README.md		README.md
R_project_and_Benchmark_dataset.Rproj		R_project_and_Benchmark_dataset.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NFLT journal repository

Abstract

Implementation

About

Releases

Packages

Languages

gbganalyst/NFLT-journal

Folders and files

Latest commit

History

Repository files navigation

NFLT journal repository

Abstract

Implementation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages