This is a place to hold some simple matplotlib code which creates hypothetical learning curves. These learning curves live within a new framework which Jiani Yan and I have created which is in response to the recent rise in the application of machine and deep learning methods. Our preprint is entitled `On the Unknowable Limits to Prediction', and can be found on arXiV here:
A very quick standfirst for this work is:
We critique the classic dichotomization of prediction error into reducible and irreducible components, noting that certain types of error can be eliminated at differential speeds. We propose an improved analytical framework that better distinguishes epistemic from aleatoric uncertainty, emphasizing that predictability depends on information sets. We also caution against premature claims of unpredictability.
See ./src
for code which makes the figure, and ./output
for the figure itself. Within ./src
, we have both interactive jupyter notebooks (the .iypnb
s) which make the two figures in the Correspondence (Figure 1 and Supplementary Information Figure 1), and a Python file (the figure_helpers.py
) which contains helper functions. A requirements.txt
accompanies this repository, and can be used to install the necessary packages. It contains nothing more than a standard Anaconda build.
If you would be interested in citing this work, please use the following BibTeX entry:
@misc{yan2024unknowablelimitsprediction,
title={On the Unknowable Limits to Prediction},
author={Jiani Yan and Charles Rahal},
year={2024},
eprint={2411.19223},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2411.19223},
}
or copy and paste this Chicago style citation:
Yan, Jiani, and Charles Rahal. "On the Unknowable Limits to Prediction." arXiv preprint arXiv:2411.19223 (2024).
There is no data.
This work is made available under a GNU General Public License v3.0.