Skip to content

Glossary & Notation

Every term this book mints or borrows, and every recurring symbol, with a one-line definition and a link to the page that owns it. When prose and glossary disagree, the owning page wins; please file the disagreement.

TermOne-line definitionDefined in
Robust reasoning processes (RRP)The proposed field: the study and engineering of procedures that turn effort and evidence into trustworthy conclusions, and keep doing so under adversarial pressureRobust Reasoning Processes
Reasoning processA procedure — peer review, a prediction market, an LLM pipeline — that consumes effort and evidence and emits claims, estimates, or evaluationsThe Core Model
JudgeThe consumer of reasoning, who cannot check the work object-level and must decide how much to updateThe Core Model
Epistemic weightA process’s likelihood-ratio profile: how strongly its outputs discriminate truth from falsehood, conditional on its incentive environmentThe Core Model
Loss pipelineThe heuristic decomposition of delivered value into codification, evaluator, and interpretation lossesThe Core Model
Cost per validated bitThe field’s figure of merit: the price of information that survived checking, not the price of textThe Core Model
Corruption cost curveThe minimum an adversary of given capability must spend to distort a process’s output by a given amountThe Core Model
Corruption surplusA participant’s maximum expected gain from deviating from honest effort — what attackers harvestThe Core Model
Robustness conditionCorrupting the process costs more than the distortion is worth — evaluated per threat model, never threat-model-freeThe Core Model
Incentive auditMeasuring a corruption cost curve empirically: pay red-team producers to corrupt the process and record the priceThe Core Model
Grading schemeEvery formalism carries [exact], [standard shape], or [heuristic] — trust each exactly as far as its gradeThe Core Model
Epistemic Impact Analysis (EIA)Pricing information by how it changes a calibrated agent’s beliefs, weighted by a utility function, validated at resolutionEpistemic Impact Analysis
ProfundityHow load-bearing a changed belief is — how much an update propagates to downstream questionsEpistemic Impact Analysis
Falsehood nullificationThe demand that false claims contribute zero or negative value — the hard problem of verification, restated as a desideratumEpistemic Impact Analysis
Question portfolioA utility function represented as a few hundred importance-weighted resolvable questionsConstructing Utility Functions
Relative value functionsRepresenting value as pairwise ratio distributions rather than absolute units, preserving correlationConstructing Utility Functions
GroundingWhat a protocol’s incentives terminate in: a judge’s verdict, external resolution, internal coherence, or peer agreementWhat Grounds an Oversight Protocol?
Producer/consumer prediction gameAn information producer trades against a calibrated reference model and profits by moving its beliefs and being validated at resolutionWhat Grounds an Oversight Protocol?
RetrodictionScoring models on known facts hidden from them — resolution without waiting, if contamination can be controlledWhat Grounds an Oversight Protocol?
Epistemic selection protocolsCommitting now to a process for choosing the most-trusted resolver at resolution timeThe Reliability Ladder
Consistency batteryA suite of checks that a system’s estimates obey the constraints any rational belief set must — necessary, never sufficientConsistency Evaluations
Dutch book / arbitrage metricInconsistency priced as the guaranteed profit extractable from a system’s own estimatesConsistency Evaluations
Opinion fuzzingSampling judgments across prompts, models, and personas and treating the variance structure as signalConsistency Evaluations
Strong reasonerAn AI system whose judgments deserve substantial weight — in the limit, deference — within its domain; warranted trust, not raw capability (and not “reasoning model”)What Is a Strong Reasoner?
LLM-based epistemic processThe unit of analysis: model plus scaffolding plus protocols, not the bare modelWhat Is a Strong Reasoner?
Reliability ladderFive application tiers, each deployable only when its verification machinery exists and survives optimization against itThe Reliability Ladder
Output-metered oversightDon’t audit the research process; meter the output’s validated epistemic impact against a question portfolioOverseeing Automated Research
Consumer agentThe calibrated reference model whose belief state over the portfolio is “the book”Overseeing Automated Research
Object loop / meta loopProducers maximize validated impact against VV; an institutionally separated party audits and re-weights VV itselfEpistemic Impact Analysis
Resolution layerThe enforcement of falsehood nullification: resolvers, retrodiction, consistency checks, randomized audits. “The resolution layer is the system.”Overseeing Automated Research
Deception affordanceA form’s perceived likelihood ratio divided by its actual cost-to-fake one; operationally, its false-side belief swing relative to its true-side swing — near one, the form transmits persuasion, not informationUntrustworthy Sources
Built-in biasesA process’s systematic distortions with no adversary at all — corruption an attacker gets at zero marginal costThe Process Catalogue
Decision-relative (goal) biasA process’s systematic lean toward one resolution of a specific decision DD (nearest standard term: directional bias) — as opposed to broad, decision-independent biases; should be ≈0 on decisions it has no information aboutThe Process Catalogue
Label-swap neutralityThe unsupervised check for goal bias: swap the options of a no-information decision; residual output asymmetry is the biasThe Process Catalogue
Deception preconditionsThe conjunction (verification gap, reproduction gap, dependence, goal-divergence, never-resolves, stakes) all required for deception to be a live risk — break any one to defuse it; read as the defender’s attack-surface-reduction checklistUntrustworthy Sources
Residual attack surfaceThe few channels that survive design-time hardening — questions turning on an irreplaceable advantage, that never resolve, where goals diverge; where the runtime test must carry the loadUntrustworthy Sources
Irreplaceable advantageA source’s output-relevant epistemic edge with no Blackwell-sufficient trusted substitute at feasible cost (replacement cost unbounded, though its value stays finite) — the core of the residual attack surface, where deception is genuinely dangerousUntrustworthy Sources
Counterfactual-deceiver testThe listener’s runtime rule: use a message exactly to the degree a motivated liar in the source’s position could not have produced it as cheaply if the claim were falseUntrustworthy Sources
The empty quadrantCheap to run and expensive to corrupt — the region of the process map the AI era needs filledThe Process Catalogue
SymbolMeaningHome
π\pia reasoning processThe Core Model
JJthe judgeThe Core Model
UUthe judge’s utility function over information, represented as a weighted question portfolioThe Core Model, Constructing Utility Functions
Vsource,VdeliveredV_{source}, V_{delivered}information value before and after the loss pipelineThe Core Model
Lcod,Leval,LinterpL_{cod}, L_{eval}, L_{interp}codification, evaluator, and interpretation lossesThe Core Model
cprocess,cinterpc_{process}, c_{interp}, nnprocess cost, per-consumer interpretation cost, number of consumersThe Core Model
Cπ(Δ,g,t)C_\pi(\Delta, g, t)corruption cost curve: minimum adversary spend to distort π\pi‘s output by Δ\Delta at capability gg, time ttThe Core Model
Δ\Deltadistortion, measured in UU-unitsThe Core Model
ggadversary capabilityThe Core Model
V(I,A,U)V(I, A, U)EIA’s value of information set II to agent AA under utility function UUEpistemic Impact Analysis
IIa set of information (data, arguments, a paper)Epistemic Impact Analysis
AAthe calibrated consumer agent — itself a reasoning process π\pi in the core model’s sense, playing the judge’s belief-keeperEpistemic Impact Analysis
QQ, QQ'an estimation question; a semantically equivalent rephrasingConsistency Evaluations
M(Q,t)M(Q, t)the evaluated model’s answer to QQ at time ttConsistency Evaluations
qq, qXq_Xa forecast; the forecast after observing source XXThe Core Model
I(Q;X)I(Q; X)mutual information between question QQ and source XX — the decision-free value of a source, per questionThe Core Model
τ\tauthe listener’s credence that a source is honest rather than strategic — underspecified as a scalar, refined into decision-indexed goal-divergenceUntrustworthy Sources
bπ(D)b_\pi(D)decision-relative (goal) bias: π\pi‘s systematic lean toward one resolution of decision DD, holding truth-relevant information fixedThe Process Catalogue