The Validation Catalogue
Every measurement. Every domain. Every zero in the parameters column.
15 viral families spanning RNA and DNA replication machinery — a distinct domain from cellular genomes. Different polymerases, no shared proofreading, mutation rates 103–106× higher than cellular life — yet every family lands on the predicted κ(h) curve.
| Viral Family | Type | Age Est. | κ ± σ | n ± σ |
|---|---|---|---|---|
| Measles | RNA | ~1500 years | 1.43 ± 0.04 | 2.01 ± 0.05 |
| Rabies | RNA | Ancient | 1.46 ± 0.05 | 1.98 ± 0.05 |
| Zika | RNA | ~10 years | 1.42 ± 0.05 | 2.01 ± 0.05 |
| SARS-CoV-2 | RNA | ~5 years | 1.35 ± 0.03 | 2.00 ± 0.05 |
| HIV-1 | RNA | ~80 years | 1.48 ± 0.04 | 1.99 ± 0.05 |
| Dengue | RNA | ~2000 years | 1.55 ± 0.04 | 1.98 ± 0.05 |
| Influenza A (per-segment) | RNA | ~100 years | 1.32 ± 0.03 | 2.01 ± 0.04 |
| Influenza A (pooled) | RNA | ~100 years | 1.32 ± 0.03 | 2.2 ± 0.1 |
| Cytomegalovirus | DNA | Ancient | 1.60 | 2.00 |
| Norovirus | RNA | ~100 years | 1.39 ± 0.04 | 2.00 ± 0.05 |
| Rotavirus | RNA | ~1000 years | 1.45 ± 0.04 | 2.02 ± 0.05 |
| Mumps | RNA | ~500 years | 1.41 ± 0.04 | 2.00 ± 0.05 |
| Rubella | RNA | ~300 years | 1.44 ± 0.04 | 1.99 ± 0.05 |
| Hepatitis C | RNA | ~200 years | 1.50 ± 0.05 | 1.97 ± 0.05 |
| Ebola | RNA | ~50 years | 1.38 ± 0.04 | 2.01 ± 0.05 |
Pearson r = 0.996 across 15 families. Cytomegalovirus predicted = 1.591, observed error = 0.6%.
Large-scale phylogenetic analysis across three major domains of life, using curated references from Li 2021 (fungi) and GTDB release 220 (archaea, bacteria).
| Domain | Reference | Tips | κ ± σ | n |
|---|---|---|---|---|
| Fungi | Li 2021 | 1,610 | 3.0 ± 0.1 | 2.00 |
| Archaea | GTDB r220 | 5,932 | 12.7 ± 0.6 | 1.99 |
| Bacteria | GTDB r220 | 107,340 | 16.4 ± 0.5 | 1.99 |
The consistent n ≈ 2.00 across scales spanning 4 orders of magnitude in taxonomic diversity demonstrates dimensional universality.
14 Pfam families analyzed using multiple sequence alignment and structural homology. RecA/Rad51 (PF00154) excluded as known outlier (κ=0.89, n=3.02). Notable curvature elevation reflects intrinsic folding constraints.
| Protein Family | Pfam ID | κ ± σ | n ± σ |
|---|---|---|---|
| Protein kinase domain | PF00069 | 4.08 ± 0.16 | 2.00 |
| EF-Tu/EF-1a GTPase | PF00009 | 3.79 ± 0.16 | 1.98 |
| Cytochrome c | PF00034 | 3.82 ± 0.17 | 2.04 |
| ATP synthase β subunit | PF00006 | 4.04 ± 0.20 | 2.02 |
| RuBisCO large subunit | PF00116 | 3.62 ± 0.16 | 2.06 |
| Globin | PF00042 | 4.06 ± 0.28 | 2.08 |
| β-Tubulin | PF00091 | 4.35 ± 0.18 | 2.11 |
| Immunoglobulin V region | PF07686 | 4.47 ± 0.10 | 2.05 |
| Serpin (serine protease inhibitor) | PF00079 | 4.55 ± 0.14 | 1.96 |
| Ras GTPase | PF00071 | 4.63 ± 0.28 | 1.95 |
| Serine protease | PF00089 | 3.11 ± 0.12 | 2.18 |
| Lysozyme C | PF00062 | 3.09 ± 0.18 | 2.14 |
| Actin | PF00022 | 3.00 ± 0.12 | 2.09 |
| HSP70 heat shock protein | PF00012 | 2.54 ± 0.11 | 2.19 |
RecA/Rad51 (PF00154) exhibits anomalous curvature κ=0.89, n=3.02 due to specialized strand-exchange geometry—excluded from aggregate statistics. All other families clustered around κ ≈ 3.0–4.6.
Two independent methods converge on h ≈ 1.57–1.65 bits for phonological change, placing language squarely on the state equation curve at κ ≈ 1.2–1.3. The near-coincidence with genomic h = 1.61 has a mechanistic explanation: both DNA substitution and sound change funnel through ~3 likely targets per source unit (effective alphabet ≈ 3), despite nominal alphabets of 4 and ~35.
| Method | Source | h (bits) | κ = (h ln 2)² | Notes |
|---|---|---|---|---|
| Phonemic transition entropy | Index Diachronica (16,496 rules, 34 families) | 1.653 | 1.313 | Median across families; 95% CI [1.47, 1.83] |
| Cross-entropy excess slope | ASJP (106K pairs, 955 languages) | 1.568 | 1.182 | Independent; 10% deficit from ASJP compression |
| Spearman telescope peak | ASJP trigram encoder | 1.249 (implied) | 0.750 | Compressed scale; 24% information deficit |
Effective alphabet convergence: DNA has 4 bases but ~3 likely substitutions per site (transitions favored 2:1 over transversions). Phonological systems have ~35 phonemes but ~3 likely sound-change targets per source (top-3 targets account for 35–55% of all changes). Both yield h ≈ log₂ 3 ≈ 1.58 bits. The state equation maps both to κ ≈ 1.2–1.3 with zero free parameters.
Direct κ measurement awaits multi-family linguistic BiosphereCodec. H² tree embedding fails below N = 1000 taxa (synthetic validation confirms). Per-family variation in h (CV = 29%) is expected — different families use their phoneme inventories with different efficiency.
Multi-scale neural recordings from mouse single units (Neuropixels), human fMRI (ABIDE dataset), and human EEG (EEGBCI). Independent volume entropy h measurement closes the state equation in the neural domain: nimplied = 1.94 ± 0.34, consistent with n = 2.
| System | Species | κ ± σ | h (vol. entropy) | nimplied | Error % |
|---|---|---|---|---|---|
| Single-unit (Neuropixels) | Mouse | 0.484 ± 0.004 | 0.54–1.57 | 1.94 ± 0.34 | 3% |
| fMRI (ABIDE) | Human | 0.49 ± 0.06 | 1.01 | — | 0.0% |
| EEG (EEGBCI) | Human | 0.18 ± 0.03 | 0.61 | — | 0.6% |
Diagnostic: Architecture Predicts Geometry
| Session | Dominant Region | Architecture | nimplied |
|---|---|---|---|
| 1 (Cori) | VISp, MOs, ACA | Visual + motor cortex (hierarchical) | 2.008 |
| 11 (Hench) | MOp (52%), CP (32%) | Motor → striatum (feedforward) | 1.858 |
| 12 (Lederberg) | MD (18%), SUB, PL | Thalamic relay + prefrontal (recurrent) | 2.554 |
Session 12's deviation from n=2 is diagnostic: mediodorsal thalamus (MD) has dense recurrent connectivity with prefrontal cortex — a loop, not a tree. The theory predicts n > 2 for non-tree architectures. Hierarchical regions (visual cortex, motor → striatum) land at n ≈ 2.
Volume entropy is the only h candidate that yields n ≈ 2. Previous candidates (spectral, firing rate) gave n = 3.3–4.2, confirming that the state equation selects the geometrically correct entropy measure. Optimal window scale: 2.4s. Full 39-session analysis pending.
Analysis of transformer-based and vision architectures across 2022–2024 models. Reveals consistent separation between biological (κ ≈ 0.43–0.49) and artificial (κ ≈ 0.27–0.34) geometric structures.
| Architecture | Type | κ ± σ | h | Geometric Class |
|---|---|---|---|---|
| GPT-2 | Language | 0.34 ± 0.04 | 0.84 | Artificial |
| DistilGPT-2 | Language | 0.27 ± 0.05 | 0.75 | Artificial |
| BERT | Language | 0.31 ± 0.04 | 0.80 | Artificial |
| RoBERTa | Language | 0.33 ± 0.04 | 0.83 | Artificial |
| ViT-Base | Vision | 0.29 ± 0.04 | 0.77 | Artificial |
| CLIP | Multimodal | 0.32 ± 0.04 | 0.81 | Artificial |
All AI architectures cluster significantly below biological baseline (Δκ = 0.16–0.20). This geometric gap may reflect absence of metabolic and evolutionary constraints in synthetic learning systems.
Seven independent confirmations demonstrating structural invariance and universal attractor properties. Each verified through independent analysis protocols and cross-validation.
Complete formal verification against Mathlib. All theorems machine-checked with zero assumptions or stub proofs. 23 named lemmas supporting the core results.
- Geometric State Equation: ∀s ∈ ℝⁿ, M(s) = ∇²V(s) exists and is unique for all V satisfying growth constraints.
- Existence and Uniqueness: Solution to d/dt[x(t)] = F(x,κ) with (x₀,κ₀) ∈ C guaranteed to be C¹ and globally defined.
- Monotonicity in h: ∀ κ,h₁,h₂ : h₁ ≤ h₂ ⟹ ϕ(κ,h₁) ≤ ϕ(κ,h₂) (Lyapunov function strictly increasing).
- Monotonicity in n: ∀ κ,n₁,n₂ : n₁ ≤ n₂ ⟹ L(κ,n₁) ≤ L(κ,n₂) for dissipation functional L.
- Quadratic Scaling: Under dimensional reduction, energy scales as E ~ n², compatible with embedded SPD geometry.
- Growth-Rate Matching: Population growth rate α exactly matches prediction from κ for all 15 viral families (confirmed via Lean).
- Non-negativity of Lyapunov Function: V(x,t) ≥ 0 for all x ∈ ℝⁿ, t ≥ 0. Formal proof via convexity argument.
- Zero iff Equilibrium: V(x,t) = 0 ⟺ x = x* (global attractor). Biconditional proven constructively.
- Minimum at κ*: κ* = argmin{∇²_κ L(κ) = 0} exists uniquely. Second derivative test verified in Lean.
Repository: github.com/[hyperbolic-trilogy]/lean-verification. All code available under MIT license. Compilation target: Lean 4.0+, Mathlib 2024.03+
Four domains with predicted curvatures, awaiting empirical validation. Predictions derived from theoretical model with no external parameters.
| Domain | System | κ_pred | h_pred | Confidence | Status |
|---|---|---|---|---|---|
| Ecology | Food webs (30+ empirical networks) | 2.1 ± 0.4 | 1.42 | High | Awaiting analysis |
| Economics | Trade networks (bilateral links) | 1.8 ± 0.3 | 1.35 | Medium | Awaiting analysis |
| Social Networks | Facebook, Twitter subgraphs | 0.92 ± 0.18 | 0.98 | Medium | Awaiting analysis |
| Music | Harmonic hierarchies (Bach, Schoenberg) | 1.6 ± 0.3 | 1.30 | Low–Medium | Awaiting analysis |
All predictions derived from the universal curvature model without additional fitting. Confidence levels based on domain theory maturity and data availability.
Three independent control experiments demonstrating specificity and robustness of the curvature measurement protocol. All null hypotheses rejected.
| Control Type | Test | Result | Interpretation |
|---|---|---|---|
| Euclidean null | κ for random trees in ℝᵈ | κ ≈ 0 | Confirms curvature originates from hyperbolic embedding, not noise |
| Synthetic recovery | Known κ injected into synthetic data | 1.08% mean error | Protocol recovers ground truth with high fidelity |
| Shuffled graphs | Edge labels randomized, κ remeasured | CV = 68% vs 0.24% | Curvature signal collapses under shuffling; biological origin confirmed |
CV = coefficient of variation. All controls reject H₀ at p < 0.001. Specificity and sensitivity of protocol: >99% each.