Skip to content

Hardening: Verifiability Asymmetry

Family 2 of the Hardening overview. All cheap robustness runs on one asymmetry: verification costing less than production. Where it holds, a constant amount of checking catches large distortions; where it fails, nothing else in the toolkit is cheap. The lever, its limit, the constructions with their cheapest attacks, and two worked bounds with numbers.

If checking a claim is cheaper than producing it, you can spot-check a small fraction and still catch a large lie — the asymmetry behind SAT, proof-carrying code (Necula 1997), succinct verifiable computation (Pinocchio, Parno et al. 2013), and the PCP theorem, under which a proof can be verified by reading O(1)O(1) random bits (Arora & Safra 1998; Arora et al. 1998; Dinur 2007). The hardening move is to force the asymmetry into existence — require a structured, checkable artifact (a decomposition, a model, a held-out prediction) where one didn’t exist.

The asymmetry has to exist. For machine-checkable proofs it does (Lean); for general estimates it may not — you cannot spot-check a bare probability, and the field has no PCP-analogue for quantitative reasoning yet. Worse, forcing a decomposition is itself gameable: a dishonest producer routes the load-bearing claim into the one node that isn’t checkable, leaving the spot-check to graze only honest scaffolding (the decomposition-routing attack). And for the genuinely unresolvable residue — claims with too few checkable connections — this family is silent, which is precisely the territory judge-grounded protocols exist for.

ConstructionBound / estimateDefends againstCheapest attack (≈ cost)Maturity · source
PCP-analogue for estimatesdetection 1(1f)g(Δ)\ge 1-(1-f)^{g(\Delta)}; f=.1,g=2088%f{=}.1,g{=}20\Rightarrow88\% [heuristic]fabricated estimate spread across a decompositionforce g(Δ)=1g(\Delta){=}1 — concentrate the lie in one node (missed w.p. 1f1{-}f)speculative · Arora & Safra 1998
Forced precision (Quine–Duhem)defended-falsehood prior pkR\sim p^{k_R} [heuristic]bold-but-vague claimschoose a claim with no checkable connections (kR=0k_R{=}0) (≈ free)speculative · —
Decomposition disclosurecheck O(1)O(1) nodes of a stated graph [heuristic]hidden load-bearing assumptionsput the load-bearing claim in an un-checkable leaf (low)prototyped · Necula 1997
Entailment-web checkNN related questions must cohere [standard shape]confident-wrong that can’t stay coherentmaintain a globally coherent but wrong worldview (high — the point)prototyped · (consistency line)
Held-out retrodictionscored by held-out log-loss, not self-report [standard shape]rhetorical worldview with no predictive contentexploit contamination — the “held-out” facts are in training (≈ free if the bank leaks)prototyped · Mallen 2023
Retrodiction banks (decay-accounted)each item priced by leak probability [heuristic]contamination eroding a test setinfer outcomes from correlated public knowledge (low)prototyped · —
Throttle-vs-unlimited divergencegap between tiny- and large-compute answers [heuristic]heuristic/persuasion-driven answerscache / canonicalize so both budgets return the same answer (≈ free)speculative · —

A PCP-analogue for estimates. Require a structured decomposition (a Fermi tree, a Squiggle model) as a certificate and spot-check a fraction ff of its nodes. If a distortion of size Δ\Delta can be forced to corrupt at least g(Δ)g(\Delta) nodes, detection is 1(1f)g(Δ)\ge 1-(1-f)^{g(\Delta)} [heuristic]. Checking only f=10%f=10\% of nodes catches a lie spread over g=20g=20 nodes with probability 10.92088%1-0.9^{20}\approx88\% — but a lie concentrated in a single node (g=1g=1) is missed with probability 1f=90%1-f=90\%. So the whole construction lives or dies on whether the task can be encoded to force g(Δ)g(\Delta) large — the open research question, and the dual of counterexample-shrinking in Consistency Evaluations.

Forced precision — the Quine–Duhem cost. Demand a bold claim be cashed into a precise operationalization and a model linking it to checkable facts. A false claim can always be saved by adjusting auxiliaries, but keeping it coherent at resolution RR costs kRk_R improbable auxiliary commitments, and the defended position’s joint prior decays like pkRp^{k_R} [heuristic] — e.g. five independent auxiliaries each 30% plausible drag the position to 0.350.2%0.3^5\approx0.2\%, forcing a confident liar to either lower confidence or expose a contradiction. The cheapest escape, per the limit, is a claim with kR0k_R\to0: no checkable connections to begin with.

  • Does a non-trivial PCP-analogue for estimates exist, and what is g(Δ)g(\Delta) for realistic decompositions?
  • Who controls the decomposition? If the producer does, it can route load-bearing claims into un-checkable nodes — can the checker force a decomposition it trusts?
  • How much of a real research portfolio is verifiable at all, versus residing in the unresolvable residue?