Hardening: Verifiability Asymmetry
Family 2 of the Hardening overview. All cheap robustness runs on one asymmetry: verification costing less than production. Where it holds, a constant amount of checking catches large distortions; where it fails, nothing else in the toolkit is cheap. The lever, its limit, the constructions with their cheapest attacks, and two worked bounds with numbers.
The lever
Section titled “The lever”If checking a claim is cheaper than producing it, you can spot-check a small fraction and still catch a large lie — the asymmetry behind SAT, proof-carrying code (Necula 1997), succinct verifiable computation (Pinocchio, Parno et al. 2013), and the PCP theorem, under which a proof can be verified by reading random bits (Arora & Safra 1998; Arora et al. 1998; Dinur 2007). The hardening move is to force the asymmetry into existence — require a structured, checkable artifact (a decomposition, a model, a held-out prediction) where one didn’t exist.
The limit
Section titled “The limit”The asymmetry has to exist. For machine-checkable proofs it does (Lean); for general estimates it may not — you cannot spot-check a bare probability, and the field has no PCP-analogue for quantitative reasoning yet. Worse, forcing a decomposition is itself gameable: a dishonest producer routes the load-bearing claim into the one node that isn’t checkable, leaving the spot-check to graze only honest scaffolding (the decomposition-routing attack). And for the genuinely unresolvable residue — claims with too few checkable connections — this family is silent, which is precisely the territory judge-grounded protocols exist for.
Constructions
Section titled “Constructions”| Construction | Bound / estimate | Defends against | Cheapest attack (≈ cost) | Maturity · source |
|---|---|---|---|---|
| PCP-analogue for estimates | detection ; [heuristic] | fabricated estimate spread across a decomposition | force — concentrate the lie in one node (missed w.p. ) | speculative · Arora & Safra 1998 |
| Forced precision (Quine–Duhem) | defended-falsehood prior [heuristic] | bold-but-vague claims | choose a claim with no checkable connections () (≈ free) | speculative · — |
| Decomposition disclosure | check nodes of a stated graph [heuristic] | hidden load-bearing assumptions | put the load-bearing claim in an un-checkable leaf (low) | prototyped · Necula 1997 |
| Entailment-web check | related questions must cohere [standard shape] | confident-wrong that can’t stay coherent | maintain a globally coherent but wrong worldview (high — the point) | prototyped · (consistency line) |
| Held-out retrodiction | scored by held-out log-loss, not self-report [standard shape] | rhetorical worldview with no predictive content | exploit contamination — the “held-out” facts are in training (≈ free if the bank leaks) | prototyped · Mallen 2023 |
| Retrodiction banks (decay-accounted) | each item priced by leak probability [heuristic] | contamination eroding a test set | infer outcomes from correlated public knowledge (low) | prototyped · — |
| Throttle-vs-unlimited divergence | gap between tiny- and large-compute answers [heuristic] | heuristic/persuasion-driven answers | cache / canonicalize so both budgets return the same answer (≈ free) | speculative · — |
Worked bounds
Section titled “Worked bounds”A PCP-analogue for estimates. Require a structured decomposition (a Fermi tree, a Squiggle model) as a certificate and spot-check a fraction of its nodes. If a distortion of size can be forced to corrupt at least nodes, detection is [heuristic]. Checking only of nodes catches a lie spread over nodes with probability — but a lie concentrated in a single node () is missed with probability . So the whole construction lives or dies on whether the task can be encoded to force large — the open research question, and the dual of counterexample-shrinking in Consistency Evaluations.
Forced precision — the Quine–Duhem cost. Demand a bold claim be cashed into a precise operationalization and a model linking it to checkable facts. A false claim can always be saved by adjusting auxiliaries, but keeping it coherent at resolution costs improbable auxiliary commitments, and the defended position’s joint prior decays like [heuristic] — e.g. five independent auxiliaries each 30% plausible drag the position to , forcing a confident liar to either lower confidence or expose a contradiction. The cheapest escape, per the limit, is a claim with : no checkable connections to begin with.
Open questions
Section titled “Open questions”- Does a non-trivial PCP-analogue for estimates exist, and what is for realistic decompositions?
- Who controls the decomposition? If the producer does, it can route load-bearing claims into un-checkable nodes — can the checker force a decomposition it trusts?
- How much of a real research portfolio is verifiable at all, versus residing in the unresolvable residue?