Skip to content
Please note that GitHub no longer supports Internet Explorer.

We recommend upgrading to the latest Microsoft Edge, Google Chrome, or Firefox.

Learn more
Real numbers, data science and chaos: How to fit any dataset with a single parameter
Jupyter Notebook TeX Other
Branch: master
Clone or download
Ranlot Merge pull request #3 from jamesmyatt/download-cifar10-binder
Use postBuild to download cifar10 data for binder
Latest commit 677ea41 May 27, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
resources created a README Apr 30, 2019
.gitignore
1904.12320.pdf PDF from arXiv Apr 30, 2019
README.md Add binder link May 27, 2019
animalShapes.ipynb
cifar10.ipynb notebooks to reproduce all plots Apr 27, 2019
dyadicDecoder.section1.ipynb
generalization.timeSeries.ipynb notebooks to reproduce all plots Apr 27, 2019
helloWorld.audio.ipynb notebooks to reproduce all plots Apr 27, 2019
helperFunctions.py notebooks to reproduce all plots Apr 27, 2019
logisticDecoder.section2.ipynb notebooks to reproduce all plots Apr 27, 2019
main.tex
postBuild Use postBuild to download cifar10 data for binder May 27, 2019
requirements.txt

README.md

Real numbers, data science and chaos: How to fit any dataset with a single parameter

All details and more examples can be found in the accompanying arXiv:1904.12320 paper (also hosted here).

Binder

We show how any dataset of any modality (time-series, images, sound...) can be approximated by a well-behaved (continuous, differentiable...) scalar function with a single real-valued parameter:

Building upon elementary concepts from chaos theory, we adopt a pedagogical approach demonstrating how to adjust this parameter in order to achieve arbitrary precision fit to all samples of the data. Targeting an audience of data scientists with a taste for the curious and unusual, the results presented here expand on previous similar observations regarding expressiveness power and generalization of machine learning models.

As a real number, the parameter α is non-terminating and its capacity to encode an infinite amount of information is used to translate any arbitrary dataset into a single numerical value.

As such, there is no reason to expect this model to provide any kind of generalization to data outside of its training samples as demonstrated by the time series below:

You can’t perform that action at this time.