A Quantitative Information-Theoretic Framework for Entropy Propagation Across Computational Decision Trajectories
This repository contains the full research output for the paper:
"Irreducible Path Entropy in Neural Networks" Samir Baladi β EntropyLab Independent Research Series, May 2026
The paper introduces Irreducible Path Entropy (H_path) β a formally defined,
layer-integrated information-theoretic metric that quantifies how much uncertainty
accumulates, transforms, and becomes unrecoverable along the inference trajectory
of a neural network.
The framework is grounded exclusively in information theory and systems-level analysis, without semantic, cognitive, or anthropomorphic assumptions.
| Field | Value |
|---|---|
| Registration Type | OSF Preregistration |
| Registry | OSF Registries |
| Associated Project | https://osf.io/yaevt |
| Date Registered | May 20, 2026 Β· 6:17 AM UTC |
| License | CC-By Attribution 4.0 International |
| Internet Archive | osf-registrations-7wp9h-v1 |
| Registration DOI | 10.17605/OSF.IO/7WP9H |
Modern neural networks achieve high performance while remaining structurally opaque. As noted by Hinton (2023), the learning algorithm is designed β but the precise inference dynamics remain inaccessible even to the architects who built them.
Existing interpretability tools address specific aspects of this opacity. This work addresses a gap: no unified, layer-integrated metric existed for characterising entropy accumulation along the full computational decision path.
H_path fills this gap.
H_path(l) = β Ξ£_k p_{l,k} Β· log p_{l,k}
Shannon entropy of the conditional activation distribution at layer l.
H_path^(L) = Ξ£_{l=1}^{L} H(P_l)
Total informational uncertainty accumulated across all L layers.
H_irr^(L) = H_path^(L) β H_red^(L)
The component of path entropy that cannot be recovered from external observations.
Ξ©(N) = 1 β H_irr^(L) / H_path^(L) β [0, 1]
Ξ© = 1β fully observable networkΞ© = 0β completely irreducible inference dynamics
A layer l is reducible if there exists a measurement operator M_l such that:
I(h_l ; M_l(y)) β₯ H_path(l) β Ξ΄*
where Ξ΄* is the reducibility tolerance threshold.
Ξ(L) = H_path^(L) β I(x ; h_L)
Uncertainty introduced across computation not explained by retained input information.
This framework is restricted to formal quantitative analysis of entropy propagation in artificial neural networks.
Not within scope:
- General theories of intelligence, cognition, or consciousness
- Claims regarding intentionality, agency, or phenomenology
- Semantic or anthropomorphic interpretation of results
Within scope:
- Reproducible computational analysis
- Information-theoretic formalisation
- Systems-level characterisation of inference behaviour
- Experimentally observable entropy dynamics
irreducible-path-entropy/
β
βββ π README.md # This file
βββ π LICENSE # MIT License
βββ π CHANGELOG.md # Version history
βββ π AUTHORS.md # Author and contributor metadata
β
βββ π paper/
β βββ Irreducible_Path_Entropy_Baladi_2026.pdf # Publication-ready paper
β βββ preprint_metadata.json # Zenodo/OSF submission metadata
β
βββ π formalism/
β βββ definitions.md # All formal definitions (1β5)
β βββ reducibility_conditions.md # Reducibility threshold derivations
β βββ observability_index.md # Ξ© construction and properties
β βββ entropic_leakage.md # Ξ(L) derivation and interpretation
β
βββ π figures/
β βββ fig1_path_entropy_accumulation.png # Layer-wise H_path vs H_red
β βββ fig2_reducibility_phase_diagram.png # Phase diagram (Ο vs I)
β βββ fig3_observability_architectures.png # Ξ© across architecture types
β
βββ π numerical/
β βββ entropy_estimator.py # k-NN entropy estimation module
β βββ mutual_information.py # MI estimator for H_red
β βββ observability_compute.py # Ξ© computation pipeline
β βββ architecture_comparison.py # MLP / CNN / Transformer benchmarks
β βββ requirements.txt # Python dependencies
β
βββ π experiments/
β βββ protocol.md # Full reproducibility protocol
β βββ config_feedforward.yaml # MLP experiment configuration
β βββ config_cnn.yaml # CNN experiment configuration
β βββ config_transformer.yaml # Transformer experiment configuration
β
βββ π references/
βββ bibliography.bib # BibTeX reference file
All results are reproducible under the following conditions:
- Fixed weights β no stochastic inference-time modifications
- Consistent discretisation β activation binning scheme fixed across layers
- Fixed estimator parameters β bandwidth / neighbourhood
kheld constant - Fixed dataset β
D = {x_i}held constant across comparative measurements - Fixed random seed β
seed=42for deterministic behaviour
Step 1 β Record activations {h_l(x_i)} at each layer l
Step 2 β Apply k-NN entropy estimator β H_path(l)
Step 3 β Estimate I(h_l ; y) β H_red^(L)
Step 4 β Compute Ξ© = 1 β H_irr / H_path
| Architecture | Depth | H_path (nats) |
Ξ© Index |
Regime |
|---|---|---|---|---|
| MLP (2L) | 2 | 0.31 | 0.91 | Reducible |
| MLP (6L) | 6 | 0.68 | 0.74 | Reducible |
| MLP (12L) | 12 | 1.14 | 0.61 | Reducible |
| CNN (8L) | 8 | 0.87 | 0.68 | Reducible |
| Transformer (12L) | 12 | 1.42 | 0.52 | Borderline |
| Transformer (24L) | 24 | 2.05 | 0.39 | Irreducible |
Values are illustrative. Empirical calibration required for specific architectures.
$ pytest tests/
============================= test session starts =============================
collected 19 items
tests/test_entropy_estimator.py ......... [47%]
tests/test_mutual_information.py ..... [73%]
tests/test_observability.py ...... [100%]
============================= 19 passed in 0.435s =============================
Samir Baladi Independent Interdisciplinary Researcher Ronin Institute / Rite of Renaissance
- π§ gitdeeper@gmail.com
- π ORCID: 0009-0003-8903-0029
- π GitHub: gitdeeper12
- π¦ GitLab: gitdeeper12
- π Codeberg: gitdeeper12
| # | Reference |
|---|---|
| 1 | Sundararajan et al. (2017). Axiomatic attribution for deep networks. ICML. |
| 2 | Alain & Bengio (2016). Understanding intermediate layers via linear probes. arXiv:1610.01644. |
| 3 | Elhage et al. (2021). A mathematical framework for transformer circuits. Anthropic. |
| 4 | Tishby & Schwartz-Ziv (2017). Opening the black box via information. arXiv:1703.00810. |
| 5 | Kozachenko & Leonenko (1987). Sample estimate of entropy of a random vector. PIT. |
| 6 | Cover & Thomas (2006). Elements of Information Theory (2nd ed.). Wiley. |
| 7 | Hinton, G. (2023). Interview. 60 Minutes, CBS News. |
| 8 | Baladi, S. (2026). ENTRO-OMEGA: Unified Adaptive Stabiliser. DOI: 10.5281/zenodo.19562999. |
| Resource | Link |
|---|---|
| π Zenodo Preprint | doi.org/10.5281/zenodo.20222840 |
| π OSF Registration | doi.org/10.17605/OSF.IO/7WP9H |
| π¦ PyPI Package | pypi.org/project/entropath |
| π GitHub Repository | github.com/gitdeeper12/ENTRO-PATH |
| π¦ GitLab Mirror | gitlab.com/gitdeeper12/ENTRO-PATH |
| πͺ£ Bitbucket Mirror | bitbucket.org/gitdeeper-12/ENTRO-PATH |
| π Codeberg Mirror | codeberg.org/gitdeeper12/ENTRO-PATH |
| ποΈ ENTRO-OMEGA (E-LAB-10) | doi.org/10.5281/zenodo.19562999 |
| π¬ OSF Project | osf.io/yaevt |
| π Internet Archive | archive.org/details/osf-registrations-7wp9h-v1 |
| π ORCID Profile | orcid.org/0009-0003-8903-0029 |
This project is released under the MIT License. See LICENSE for full terms.
The OSF registration is released under CC-By Attribution 4.0 International.
EntropyLab Independent Research Series Β· May 2026 Information Theory Β· Neural Network Interpretability Β· Entropy Dynamics
Registration DOI: 10.17605/OSF.IO/7WP9H Β· Preprint DOI: 10.5281/zenodo.20222840