Poster for “The Key to a Stable Governance Layer” contrasting problem-first governance with wrapper-first control design, showing classical figures, core equilibrium clarity, policy controls, and RATIUM.AI governance framing.

The Key to a Stable AI Governance Layer

Problem-to-Permission Derivation Completeness as a Necessary Condition for Operational Governance

Maturity: Advanced candidate integrative governance-design theory and measurement framework; empirically unvalidated

Abstract

Artificial-intelligence governance is frequently inferred from the presence of visible controls: policies, evaluation suites, risk scores, human-review procedures, audit trails, dashboards, approval committees, release gates, rollback provisions, and organizational management systems. These instruments may be necessary. Their presence does not establish that the governed system is connected to a coherent decision problem, that observed signals represent relevant failure mechanisms, that metrics support justified thresholds, that decision authority is operationally real, or that a nominal governance decision can change what the system is permitted to do.

This article develops Problem-to-Permission Derivation Completeness (PPDC) as a candidate integrative governance-design theory and measurement framework. Its unit of analysis is the bounded Governance Derivation Instance: a specified governance object, domain and purpose, AI or workflow configuration, credible baseline, and evaluation period. A governance regime is derivationally complete only where a defined problem model can be traced through an explicit failure structure, observable signals, interpretable metrics, justified decision thresholds, an authorized decision locus, a gate determination, an implemented permission state, and a review, replay, correction, remedy, and reauthorization path.

For governance instance g, the canonical node structure is:

PPDC_g ⇔ P_g ∧ F_g ∧ S_g ∧ M_g ∧ Θ_g ∧ A_g ∧ G_g ∧ I_g ∧ R_g

where P denotes the problem model; F, failure structure; S, observable signals; M, interpretable metrics; Θ, materiality and decision thresholds; A, authorized decision locus; G, gate determination; I, implemented permission state; and R, review, replay, correction, remedy, and reauthorization capacity.

Node presence is insufficient. Every adjacent transition must possess a documented, reviewable, and defeasible derivation relation. Let N_g=(P,F,S,M,Θ,A,G,I,R) and let τ(x_i,x_(i+1)) denote a valid derivation relation. Then:

PPDC_g ⇔ ( ∧ _(x ∈ N_g)x) ∧ ( ∧ _(i=1)^(|N_g|-1)τ(x_i,x_(i+1)))

A metric that is not connected to a failure mechanism is an orphaned measurement. A threshold without evidential and materiality rationale is an administrative convention. A gate without real authority is symbolic. A decision without implementation is non-operative. Feedback, appeal, audit, or human review without the capacity to alter the relevant decision regime constitutes procedural openness without operative correction.

PPDC draws together—but does not claim priority over—requirements traceability, system-theoretic hazard analysis, assurance cases, runtime assurance, AI risk-management frameworks, organizational authority theory, management-system standards, and algorithmic contestability. These traditions already establish major components of problem-based engineering, traceability, evidence-supported claims, monitored state transition, risk treatment, formal versus effective authority, and contestation. The residual contribution proposed here is their integration into one governance chain whose endpoint is an implemented and reviewable permission state, together with cross-cutting tests of whose problem is governed, whose need is operationalized, who bears the burden, who controls valid correction, and whether the public grammar preserves or conceals the derivation.

The article distinguishes four independent dimensions: derivational completeness, substantive adequacy, operational completion, and governance reliability. Derivational completeness does not prove that the problem model is correct. A substantively justified gate does not prove that it was implemented. One implemented correction does not establish reliable correction under repeated pressure. PPDC is therefore not a validation claim for any governance architecture, including LoopGuard-AI. It is a theory-construction and measurement-design proposal whose independent status depends on whether it improves classification, prediction, traceability, and correction beyond adjacent frameworks.

Keywords: AI governance; derivation traceability; permission state; governance reliability; algorithmic contestability; runtime assurance; assurance cases; decision authority; correction sovereignty; LoopGuard-AI; Central Equilibrium Problem

—

Part I — Canonical Theory

1. Controls Without Derivation

AI governance commonly becomes visible through artifacts. An organization publishes a policy, creates a model card, conducts red-team exercises, records evaluator scores, adds a human-review checkpoint, maintains an incident log, establishes a safety committee, or places a release gate between evaluation and deployment. These structures may be valuable. Some are indispensable. None is self-interpreting.

A risk score does not explain why the measured condition constitutes a relevant risk. An evaluator disagreement does not determine whether disagreement should produce further testing, restricted operation, suspension, or release. A human-review requirement does not establish that the reviewer possesses relevant evidence, appropriate competence, or authority to alter the system. An audit record does not establish that its findings can change a threshold, policy, configuration, permission, or future release decision.

The category error is therefore not the use of controls. It is the inference:

Governance Artifacts ⇒ Governance

That inference is invalid.

A system may contain extensive observation without a justified evaluation model. It may contain evaluation without a decision threshold. It may contain a threshold without competent authority. It may contain formal authority without implementation control. It may contain a gate label without a change in permission. It may contain post-event documentation without a path for revising the mechanism that generated the event.

The resulting system need not be fraudulent or useless. It may be administratively serious and technically sophisticated. The narrower diagnosis is that it contains more governance form than governance derivation.

1.1 Engineering and institutional antecedents

The requirement that controls remain connected to upstream purposes and failure structures has established antecedents. Requirements management already demands traceability between stakeholder expectations, technical requirements, design, verification, and change. NASA defines bidirectional traceability as the ability to trace a requirement to both its parent expectation and its allocated descendants, and treats requirements management as a lifecycle process linking expectations, requirements, design documents, and test procedures (NASA, 2023).

System-Theoretic Process Analysis begins from unacceptable losses, hazards, control structures, unsafe control actions, loss scenarios, and safety constraints rather than from generic safeguards added after design (Leveson and Thomas, 2018). Assurance cases organize claims, arguments, context, assumptions, and evidence, making explicit why a system property should be believed rather than merely accumulating documents (Rushby et al., 2015; SCSC Assurance Case Working Group, 2021). Dynamic assurance extends that logic by connecting design-time assurance claims to operational updates and runtime evidence (Denney, Menzies, and Pai, 2023). Runtime-assurance architectures connect monitored property violations to operational control transfer, supplying a concrete precedent for the movement from signal to threshold to authority transfer and implemented state change (Slagel et al., 2024). The NIST AI Risk Management Framework connects governance, context mapping, measurement, and management across the AI lifecycle (Tabassi, 2023). ISO/IEC 42001 and ISO/IEC 23894 provide organizational management and risk-management structures for responsible AI use (ISO/IEC, 2023a, 2023b). Empirical contestability research further shows that affected decision subjects require not only explanations but information, procedural support, responsibility attribution, and sites at which a decision can actually be challenged (Yurrita et al., 2025).

PPDC does not claim to discover any of these components. It asks whether they are integrated sufficiently to show a reviewable path from a human or institutional decision problem to an implemented permission state.

1.2 The residual problem

Existing approaches may establish that requirements are linked to system components, hazards are linked to unsafe control actions, assurance claims are linked to evidence, monitored violations are linked to controller transitions, risks are mapped and managed, responsibilities are assigned, and affected persons possess explanation or appeal. Yet a governance regime may still fail to answer the complete sequence:

Whose decision problem is the system entering?
Which human, institutional, or public need is operationalized?
What is the bounded governance object?
What failure mechanism threatens the relevant purpose?
Which observable signal represents that mechanism?
How is the signal converted into an interpretable metric?
What evidence justifies the threshold?
Who possesses formal decision authority?
Who possesses real control over the decision?
Which gate follows?
Was the corresponding permission state implemented?
Can valid criticism alter the derivation rather than merely the individual output?

The residual object is the integrity of the derivation by which governance becomes decision-bearing.

2. Governance Object and Governance Derivation Instance

Governance cannot be evaluated in the abstract. It must be indexed to an object capable of receiving an operational state.

A governance object is the smallest bounded system state, event, action, configuration, or decision path to which an enforceable permission determination can be applied. It may be a model output, agent action, tool call, external transaction, workflow transition, release candidate, model version, permission set, persistent-memory configuration, deployment scope, human–AI decision procedure, or complete AI-mediated decision regime.

The correct scale depends on causal completeness. An individual output may be the proper object where it can be allowed, restricted, withheld, or withdrawn independently. It is not the proper object where the relevant harm arises from repeated routing, ranking, memory, tool use, accumulated classification, or institutional reliance. Conversely, “the entire organization” is usually too broad where one release, workflow, or permission set can resolve the identified failure.

The object-selection rule is:

Select the smallest state that is both causally complete with respect to the specified failure and capable of receiving an implemented permission determination.

A single analytical unit is a Governance Derivation Instance:

g=(o,d,a,b,T)

where:

o = bounded governance object;
d = domain and evaluative purpose;
a = AI system, workflow, or governance configuration;
b = credible baseline or alternative;
T = evaluation period.

The same output may be low-impact in one domain and high-impact in another. The same agent action may be acceptable in a sandbox and unacceptable in production. The same uncertainty may justify SHIP for reversible drafting assistance and HOLD for an irreversible medical, employment, financial, or infrastructural action. Classification therefore requires explicit coordinates.

3. Permission State and Gate Determination

A permission state specifies what the governance object is operationally allowed to do.

Structured table:

Permission state: SHIP

Operative meaning: The object may proceed under defined conditions.

Permission state: RESTRICT

Operative meaning: The object may proceed only under specified limitations.

Permission state: HOLD

Operative meaning: The object may not proceed until identified conditions are resolved.

Permission state: ROLLBACK

Operative meaning: A prior permission, deployment, configuration, action path, or system state must be reversed, disabled, downgraded, withdrawn, or restored to a safer prior state.

These are normalized state families rather than an exhaustive legal vocabulary. A domain may use authorize, approve, condition, suspend, revoke, terminate, remand, quarantine, or require reconsideration. The question is whether the determination changes the object’s operative state.

A gate determination is the authorized decision that a permission state should apply. An implemented permission state is the resulting change in the governed object or its environment.

G ⇏ I

A correct HOLD determination does not constitute operational completion if deployment continues. A ROLLBACK recommendation does not constitute rollback if the prior configuration remains active. A RESTRICT decision does not constitute restriction if permissions, tool access, user exposure, or downstream reliance are unchanged.

4. Canonical Definition of PPDC

For Governance Derivation Instance g=(o,d,a,b,T), Problem-to-Permission Derivation Completeness exists where:

a bounded and reviewable problem model is defined;
a failure structure is derived from that model;
observable signals are linked to the failure structure;
metrics interpret those signals under specified rules;
materiality and decision thresholds are justified;
a competent and authorized decision locus exists;
a gate determination is generated under the applicable evidence and policy conditions;
the corresponding permission state is implemented; and
the decision, evidence, derivation, implementation, and subsequent outcomes remain reviewable, replayable, correctable, remediable, and subject to reauthorization.

Formally:

PPDC_g ⇔ P_g ∧ F_g ∧ S_g ∧ M_g ∧ Θ_g ∧ A_g ∧ G_g ∧ I_g ∧ R_g

No component is independently sufficient. A strong problem model without observability cannot govern runtime events. Extensive signals without an explicit failure model produce measurement load. Valid metrics without thresholds remain descriptive. Thresholds without authority remain recommendations. Authority without implementation control remains formal. Implementation without review can become arbitrary control.

The Problem-to-Permission Derivation Requirement is the governance obligation to demonstrate PPDC before a material permission is assigned, expanded, renewed, or restored. The requirement is procedural and epistemic: it requires a defensible path to permission, not agreement with one substantive theory of safety, justice, or institutional purpose.

4.1 Derivation relations

Let:

N_g=(P,F,S,M,Θ,A,G,I,R)

and let:

τ(x_i,x_(i+1))

denote a valid, documented, and reviewable derivation relation between adjacent components. Then:

PPDC_g ⇔ ( ∧ _(x ∈ N_g)x) ∧ ( ∧ _(i=1)^(|N_g|-1)τ(x_i,x_(i+1)))

A derivation relation should state, where the domain permits:

source of the upstream claim;
responsible owner;
derivation rule or rationale;
supporting evidence;
relevant assumptions;
uncertainty and incompleteness;
rejected alternatives;
validity and invalidation conditions;
version and date;
backward traceability;
forward traceability.

Forward traceability asks what governance consequence follows from a problem claim, failure mechanism, signal, metric, or threshold. Backward traceability asks from which problem claim and failure mechanism a metric, threshold, authority allocation, or gate was derived. Without forward traceability, a problem model may remain intellectually elaborate but operationally inert. Without backward traceability, a control may be operational but conceptually arbitrary. The chain specifies logical dependence, not a mandatory temporal or bureaucratic sequence; iteration, parallel processing, and institutional compression remain admissible where the derivation can still be reconstructed.

5. Four Independent Validation Dimensions

PPDC distinguishes four dimensions that are frequently collapsed.

5.1 Derivational completeness

Are the necessary components present, and are their transitions traceable, justified, reviewable, and defeasible?

5.2 Substantive adequacy

Are the problem model, failure hypotheses, evidence, metrics, thresholds, authority choices, and permission determination actually defensible?

A perfectly traceable derivation may still begin from a distorted need profile, invalid proxy, weak evidence, or unjust threshold.

Derivational Completeness ⇏ Substantive Adequacy

5.3 Operational completion

Did the authorized gate produce the corresponding state change before the decision opportunity closed?

Gate Determination ⇏ Implemented Permission State

5.4 Governance reliability

Does corrective capacity remain functional across repeated events, pressure, changing incentives, organizational turnover, uncertainty, drift, and conflict?

Implemented Correction ⇏ Governance Reliability

A governance regime may therefore be derivationally complete but substantively wrong, substantively correct but operationally incomplete, or operationally effective once but unreliable over time.

6. Governance Maturity and Claim Boundary

A useful maturity ladder is:

Structured table:

Level: G0 — Governance Artifacts

Description: Policies, dashboards, audits, reviews, model cards, or committees exist.

Level: G1 — Derivational Traceability

Description: The path from problem model to proposed permission state can be reconstructed.

Level: G2 — Executable Gate Capacity

Description: A competent authority can issue a binding SHIP, RESTRICT, HOLD, or ROLLBACK determination.

Level: G3 — Implemented Permission Control

Description: Gate determinations produce verifiable state changes.

Level: G4 — Governance Reliability

Description: Corrective capacity persists under repeated pressure, uncertainty, drift, and incentive conflict.

PPDC is proposed as a candidate necessary condition for stable governance, not as a sufficient condition. Part I establishes only that a bounded and testable distinction can be made between the presence of governance instruments and the completeness of the derivation connecting a decision problem to an implemented and reviewable permission state.

The framework does not claim that every governance process must be temporally linear, that all evidence must be public, that every affected party should possess veto authority, that traceability guarantees correctness, that four gates are universally optimal, or that LoopGuard-AI satisfies PPDC in production.

—

Part II — Upstream Problem Definition

7. Interested Parties and Need Profiles

A governance derivation cannot begin from capability alone. The statement that a system can rank, predict, summarize, plan, retrieve, classify, recommend, generate, or act does not identify the decision problem into which that capability will enter. It does not identify whose condition should improve, whose workload should decline, whose exposure may increase, or whose understanding and correction capacity may be altered.

The logically prior question is not:

What can the AI system do?

It is:

Whose decision problem is the AI system entering?

An interested party is an actor, group, institution, role, or functional position whose objective, exposure, authority, vulnerability, burden, or need is materially affected by the governance object. It may be the organization purchasing the system, the operational team using it, the person interacting with its interface, the person classified or ranked by it, the institution responsible for compliance, downstream actors relying on its records, or a public whose resources, rights, knowledge, or infrastructure are altered.

Let:

IP_g=(ip_1,ip_2,ldots,ip_n)

Each materially relevant party may be represented as:

ip_i=(r_i,e_i,b_i,h_i,k_i,a_i,c_i)

where r_i is functional role; e_i, evaluative interest; b_i, expected benefit; h_i, potential harm or burden; k_i, access to knowledge; a_i, formal and effective authority; and c_i, correction capacity. The representation is not intended to reduce persons to numerical profiles. Its purpose is to prevent the analysis from treating the system as though it served one undifferentiated user.

A mature problem model should distinguish:

the primary intended beneficiary;
secondary beneficiaries;
exposed or burden-bearing parties;
hidden operative beneficiaries whose needs dominate design although another party is presented publicly as principal.

These roles may overlap. A clinician may obtain useful support while acquiring verification obligations. A citizen may obtain faster service while losing access to meaningful human reconsideration. A company may reduce direct execution cost while increasing configuration and governance burden.

A need profile is the structured set of needs that the AI-mediated configuration is intended to address for an interested party. Needs may include access, continuity, safety, accuracy, explanation, correction, recognition, opportunity, privacy, autonomy, workload reduction, resource allocation, procedural consistency, legitimacy, reversibility, timely decision, or protection from arbitrary treatment.

The same technical capability may serve different need profiles. A routing system may serve an institution’s need to allocate scarce capacity and an exposed person’s need to obtain access. An explanation system may serve the institution’s need to document a decision and the affected person’s need to understand evidence, challenge error, and identify remedy. The relevant question is:

Which need is the function instrumentally subordinate to?

7.1 Need-profile substitution

Need-profile substitution occurs when the need of one interested party is silently replaced by the metric, proxy, or operational objective of another.

Let NP_i denote the declared need profile of party i, and Q_a the need actually optimized by configuration a. A candidate finding exists where:

NPS_(i,a) ⇔ NP_i ≠ Q_a

and the difference is materially consequential, insufficiently disclosed, obscured by the governing metric, and not subject to a proportionate correction path.

A recruitment system may be introduced to improve applicant consideration while operationally optimizing recruiter review volume. A customer-service system may be framed as improving service while principally deflecting human escalation. The failure is not that the administrative side benefits. It arises where the administrative proxy is presented as though it exhausts the exposed-side need.

8. Decision Problem and Correction Path

The decision problem is not synonymous with the task assigned to the AI system. “Rank applications,” “summarize evidence,” “detect risk,” or “route requests” describes an activity. A decision problem identifies the state requiring judgment, alternatives, affected parties, evidence, uncertainty, consequences of error, authority, reversibility, and correction.

A bounded decision problem may be represented as:

DP_g=(Ω,o,X,E,U,H,t_L,A_r,C_p)

where:

Ω = evaluative purpose;
o = governance object;
X = feasible decisions or permission states;
E = evidence structure;
U = uncertainty;
H = harm and burden profile;
t_L = decision-relevant deadline;
A_r = authority relations;
C_p = correction path.

A task specification asks what input is provided, what output is required, and what performance target applies. A decision-problem specification additionally asks why the task should be performed, whose problem it addresses, what happens if the output is wrong, who may rely on it, who may override it, what makes it reversible, and what correction remains after implementation.

Capability-first design follows:

Capability → Use Case → Workflow → Evaluation → Governance

Problem-first design follows:

Interested Party → Need Profile → Decision Problem → Correction Path → Permitted Capability

Exploratory research may legitimately begin from capability discovery. The governance problem begins when experimental capability becomes operational authority without a corresponding decision-problem specification. The problem-first principle can be summarized as:

(A)_(agent)=f(IP,NP,DP,CP)

where the permitted agentic architecture (A)_(agent) is a function of the interested party, need profile, decision problem, and correction path (Dunavich, 2026a).

A correction path is the process through which a valid trigger, criticism, anomaly, failure, appeal, or evidence change can alter the governed state. It should be specified before deployment rather than added only as incident response.

A minimum correction path is:

Trigger Recognition → Standing → Evidence Access → Competent Review → Judgment → Gate Authority → Implementation

These functions may be compressed into one actor or distributed across institutions. Their distribution is not pathological by itself. The question is whether the handoffs form a reliable path to an implemented state before the decision opportunity closes.

Standing identifies who may activate the path. Evidence access determines whether the reviewer can inspect relevant data, context, tool traces, versions, rationales, baselines, affected-party evidence, and downstream effects. Competence requires domain knowledge, technical understanding, sufficient time, and ability to reconstruct the decision. A review that cannot reach gate authority is diagnostically useful but correction-incomplete. A gate authority without implementation control is formally active but operationally weak.

Correction and remedy are distinct. Correction changes the current or future decision regime. Remedy addresses already realized burden or harm. A system may revise a threshold without reopening earlier exclusions, or withdraw unsafe advice without notifying those who relied on it. A mature problem model asks both how the regime can change and how prior consequences can be repaired.

8.1 Burden-sensitive correction

Correction strength should be proportionate to severity, irreversibility, scale, uncertainty, power asymmetry, observability, available alternatives, and risk of strategic abuse.

Required Correction Strength ↑ as Material Burden, Irreversibility, and Power Asymmetry ↑

This does not transfer full control to the exposed party. It requires that valid correction signals can reach regime-changing authority under defined evidence and procedural conditions.

9. Purpose, Baseline, and Burden Distribution

An AI system is not justified by its technological identity, the volume of its activity, the expansion of its capabilities, or its success against infrastructure that it helped create.

An evaluative purpose is a condition whose improvement can be assessed independently of the continued existence, expansion, or activity of the AI configuration. Examples include improved diagnostic accuracy, reduced decision latency, increased access, reduced direct execution burden, improved safety, reduced error, improved resilience, improved understanding, or preservation of rights.

The purpose should identify the condition to improve, relevant interested party, baseline, period, evidence, adequacy condition, and unacceptable tradeoffs. Purpose governance asks who defined the end, which need profile it represents, how progress is assessed, what evidence supports restriction or termination, and whether the purpose has changed.

A purpose may legitimately change. A system introduced to reduce labor may later become valuable as a research environment. The later purpose does not retroactively establish success under the earlier purpose. Purpose revision should be explicit, dated, justified, treated as a new authorization, and evaluated against an appropriate baseline.

9.1 Means–ends reversal

Means–ends reversal occurs where the mechanism introduced to serve an objective becomes the operative objective:

Evaluative Objective → Configuration as Means → Maintenance of Configuration → Configuration as Operative Objective

Better models, monitoring, policies, and configurations may be necessary. The failure occurs when internal improvement is treated as sufficient evidence of external purpose achievement.

Internal Mechanism Improvement ⇏ Purpose Achievement

9.2 Credible baseline

A credible baseline is the feasible alternative against which the complete AI-mediated regime is evaluated. It may be the prior human workflow, simpler automation, lower-autonomy configuration, different model, hybrid process, no-action alternative, or previously validated version.

The question is not whether configuration a produces some benefit:

Y_a>0

but how it performs against baseline b:

Y_a-Y_b

where Y is a multidimensional external-outcome profile including benefit, harm, workload, delay, access, error, verification, governance burden, correction, reversibility, and distribution.

9.3 Benefit and burden profiles

For affected actor i, define:

Λ_i(T)= (B_i,W_i,R_i,D_i,K_i,A_i,C_i)

where B_i is benefit; W_i, work or cognitive burden; R_i, risk exposure; D_i, delay or access burden; K_i, knowledge and evidence access; A_i, authority; and C_i, correction capacity.

A system may produce aggregate benefit while transferring verification work to specialists, correction work to users, risk to exposed persons, authority to administrators, and opacity to downstream actors. Distribution is therefore part of the problem model rather than a downstream ethical commentary.

Relevant burdens include specification, supervision, verification, governance, waiting time, loss of access or opportunity, cognitive interruption, data exposure, explanation and appeal burden, exit cost, dependence, and inherited configuration debt. The framework does not presume that every burden is unjustified. It asks whether the burden is visible, attributable, proportionate, included in the evaluation, and attached to appropriate correction authority (Dunavich, 2026g).

10. ADM/CIV and Sovereignty Cross-Tests

ADM/CIV is a structural distinction between functional positions inside a decision regime.

ADM is the administrative, managerial, institutional, organizational, regulatory, or governing position.
CIV is the civil, exposed, dependent, human-facing, or cost-bearing position.

The same actor may occupy ADM in one relationship and CIV in another. The distinction is not moral. ADM does not mean illegitimate; CIV does not mean correct. Institutions require administration, expertise, continuity, risk management, and resource allocation. Exposed parties may be mistaken, opportunistic, or unable to evaluate the complete system.

The analytical question is how the regime distributes problem definition, benefit, burden, evidence, explanation, authority, and correction (Dunavich, 2026b).

10.1 ADM-first, CIV-first, and mixed systems

An ADM-first system principally serves workload reduction, throughput, standardization, compliance, institutional risk, resource allocation, or administrative control. A CIV-first system principally serves access, explanation, protection, recognition, appeal, correction, remedy, or actionable understanding. Mixed systems are common and often desirable. The problem is not mixture but concealed substitution.

A system may be CIV-facing but ADM-serving: it interacts directly with an exposed person while operationally optimizing institutional needs. Examples include complaint interfaces designed principally to deflect escalation, recruitment systems designed principally to reduce review volume, and explanation tools that translate institutional decisions without making them contestable.

10.2 Decision sovereignty

Decision sovereignty concerns who defines the problem, evidence, categories, metrics, thresholds, success, failure, deviation, and available permission states. A model may classify accurately relative to a task that should not have been defined in that way. Accuracy is internal to a defined task; decision sovereignty concerns who had authority to define it and under what justification.

10.3 Correction sovereignty

Correction sovereignty concerns who decides whether criticism, anomaly, harm, appeal, or feedback becomes a valid correction signal. A system may allow feedback while preserving a monopoly over what feedback counts. It is procedurally open where input is accepted and operatively open where valid input can change the regime.

10.4 Public-grammar sovereignty

Public-grammar sovereignty concerns the language and categories through which the decision becomes intelligible and legitimate. A risk score may become “risk,” a fit score “merit,” confidence “certainty,” compliance “safety,” or administrative priority “need.” Institutions cannot operate without abstraction. The problem arises when the model representation becomes institutional category and then public reality without preserving the epistemic boundary among them.

10.5 Civil corrective capacity

Civil corrective capacity is the ability of CIV to know that an AI-mediated process affected the decision, understand the relevant evidence and category, identify error or proxy substitution, present counterevidence, reach competent review, obtain a decision with reversal authority, trigger broader correction where patterns recur, and receive remedy where harm occurred.

It is not direct democratic control over every decision. It is preservation of a proportionate route from exposure to correction.

11. From Local Symptom to Failure Structure

AI governance often begins from a visible symptom: hallucination, bias, toxicity, over-refusal, under-refusal, prompt injection, drift, inconsistency, unauthorized tool use, unexplained ranking, repeated appeal, or weak explanation. These observations may be important. They do not by themselves identify the upstream failure structure.

PPDC distinguishes five levels:

Structured table:

Level: Observation

Meaning: A recorded event or condition

Level: Symptom

Meaning: A pattern suggesting that something may be wrong

Level: Failure event

Meaning: A bounded instance in which an intended condition was not met

Level: Failure mechanism

Meaning: The process producing or stabilizing the failure

Level: Structural condition

Meaning: A persistent configuration making the mechanism likely, invisible, or difficult to correct

An unsupported answer is an observation. Repeated unsupported answers under low-evidence conditions may be a symptom. A materially relied-upon false answer may be a failure event. Pressure to provide fluent completion despite evidence insufficiency may be a mechanism. A workflow that rewards answer production while no uncertainty signal can trigger HOLD may be a structural condition.

Symptom substitution occurs where a local observation is treated as the complete failure model. It may produce disclaimers, phrase bans, classifiers, additional reviewers, or policy language that reduce the symptom while preserving the mechanism.

For each governance-relevant symptom, the record should identify:

observed condition;
affected object;
relevant purpose;
interested parties;
candidate mechanism;
alternative explanations;
attribution evidence;
expected recurrence;
intervention point;
correction consequence.

11.1 Recursive evidence production

A particularly important structural mechanism is the ungoverned evaluation loop:

Recommendation → Ranking → Priority → Decision → Record → Future Evidence → Model Input

Prior predictions may shape enforcement data, prior hiring rankings may shape the profile of successful employees, and prior content recommendations may shape the engagement later used to validate the same recommendation system. Local bias or inconsistency may therefore be the visible symptom of recursive evidence production.

PPDC does not require one universal mechanism. The same observed failure may arise through technical design, data, incentive, authority fragmentation, cognitive shortcut, public grammar, burden transfer, or correction failure. It therefore distinguishes:

Constitutive Condition ≠ Causal Mechanism ≠ Observable Signal ≠ Outcome

Only after the upstream structure is explicit can observations be interpreted coherently as governance signals.

—

Part III — Evidence, Metrics, Thresholds, Authority, and Permission

12. Signals and Observability

A failure structure becomes governable only where relevant conditions can enter an evidence architecture. A real failure may be unobservable. Abundant data may fail to represent the relevant failure.

Observable Data ≠ Governance Signal

An observation is a recorded feature of an AI event, system state, workflow, institutional process, or external outcome. A governance signal is an observation, or structured set of observations, with a specified evidentiary relationship to a defined failure mechanism or permission-relevant condition.

Let:

s_j=(o_s,t_s,p_s,q_s,f_s,u_s)

where o_s is the observed object; t_s, time and period; p_s, provenance; q_s, evidence quality; f_s, linked failure hypothesis; and u_s, uncertainty and alternative interpretation.

A signal is governance-relevant only where the record identifies what was observed, how it was observed, what mechanism it may represent, which other explanations remain, which permission state it can affect, and what additional evidence is required.

12.1 Signal classes

Structured table:

Signal class: Direct event signal

Function: Records an output, action, release, permission change, or runtime condition

Signal class: Derived technical signal

Function: Results from a test, classifier, evaluator, or statistical process

Signal class: Institutional signal

Function: Records a policy conflict, authority failure, missed review, or implementation gap

Signal class: Affected-party signal

Function: Records complaint, appeal, counterevidence, or burden incidence

Signal class: Comparative signal

Function: Compares the configuration with a baseline, alternative, or prior version

Signal class: Persistence signal

Function: Indicates recurrence, drift, repeated failure, or correction latency

A mature architecture should preserve positive signals, negative signals, and conflict signals. Evaluator disagreement may indicate evidence insufficiency, divergent authority assumptions, inconsistent policy interpretation, different harm valuations, or unresolved tradeoffs. It should not always be erased through one synthetic score.

12.2 Absence, unobservability, and non-activation

Three states must remain distinct:

Absent: evidence supports that the function or condition is not present.
Unobservable: available evidence cannot establish presence or absence.
Not activated: the relevant path existed but was not triggered during the period.

Public silence does not establish absence. A declaration that a process exists does not establish operation. Where evidence cannot establish or reject the condition, the correct result is Unobservable or Indeterminate.

12.3 Signal admissibility

Let Adm(s_j) denote admissibility. A minimum formulation is:

Adm(s_j) ⇔ O_j ∧ P_j ∧ F_j ∧ T_j ∧ Q_j

where O_j is object relevance; P_j, adequate provenance; F_j, failure-link specification; T_j, temporal relevance; and Q_j, minimum evidence quality.

Observability should be derived from the failure structure. For every mechanism, the analyst should identify where it leaves evidence, who can observe it, how quickly it becomes visible, what attribution limits remain, which decision it can influence, and what blind spots persist.

13. Metrics and Calibration

Signals do not interpret themselves. A metric is a structured rule converting one or more signals into a reproducible assessment relevant to governance.

m_k=φ_k(s_1,s_2,ldots,s_n| c,v)

where φ_k is the computation or interpretive rule; c, context; and v, metric version. Metrics may be scalar, categorical, ordinal, rule-based, conjunctive, human-coded, or mixed.

Every decision-bearing metric should specify identity, version, purpose, failure link, input signals, computation, scope, baseline, calibration, uncertainty, missing-data rule, failure behavior, threshold links, owner, and review date.

Metrics should be classified as:

External-outcome metrics, directly measuring the evaluative purpose;
Validated leading indicators, empirically linked to external outcomes;
Operational diagnostics, measuring system condition without proving purpose achievement;
Unvalidated proxies, convenient indicators whose relationship to the relevant outcome remains inadequately established.

The governing rule is:

Internal Indicator Alone ⇏ Continuation Authorization

13.1 Metric convenience drift

Metric convenience drift occurs when a metric is selected or preserved primarily because it is available, inexpensive, automatable, or institutionally familiar rather than because it represents the failure mechanism or purpose.

Available Signal → Convenient Metric → Institutional KPI → Decision Criterion → Public Reality

Simplicity is not itself a defect. The issue is whether convenience displaced validity.

13.2 Calibration, uncertainty, and missing data

Calibration asks whether metric values correspond adequately to the condition they represent. It should be specific to domain, population, system version, operating context, decision consequence, and time. A score calibrated for advisory use may be invalid for autonomous action.

Metrics should preserve measurement error, incomplete observability, label uncertainty, sampling error, population shift, evaluator disagreement, model instability, causal ambiguity, normative disagreement, and threshold uncertainty. States may include High Confidence, Moderate Confidence, Low Confidence, Contested, Incomplete, or Not Computable.

Missing-data behavior should be predeclared. Depending on risk and reversibility, missing information may be treated as uncertainty, require manual review, trigger RESTRICT or HOLD, or exclude the metric from the decision.

13.3 Metric minimality

Governance does not improve monotonically with metric count. Each metric creates collection cost, interpretation burden, maintenance, conflict, gaming opportunity, and configuration debt. A metric should remain only where it has identifiable decision value.

Metric Retention ⇒ Demonstrable Decision Relevance

14. Thresholds, Materiality, and Uncertainty

A metric becomes decision-bearing only where a rule determines when its value changes governance. A threshold connects a measured condition to a decision requirement:

Θ_k:m_k → (No Change,Review,RESTRICT,HOLD,ROLLBACK)

Thresholds may be hard, soft, conjunctive, disjunctive, comparative, temporal, authority-dependent, or reversibility-dependent. A soft threshold may create a presumption or mandatory review rather than automatic determination.

Materiality identifies when a condition is sufficiently consequential to justify governance change. It may consider severity, probability, irreversibility, scale, duration, rights impact, burden concentration, uncertainty, strategic dependence, correction cost, deadline, and affected-party vulnerability. Materiality is partly normative. PPDC does not eliminate value judgments; it requires them to be declared.

A justified threshold identifies the failure mechanism, consequence, evidence basis, uncertainty treatment, credible baseline, false-positive and false-negative costs, escalation path, review interval, and recalibration condition.

Threshold absence exists where a metric describes a condition but no rule determines what happens. A risk dashboard without escalation, a drift score without deployment response, or repeated appeal without a system-level trigger converts evaluation into observation.

Threshold gaming occurs where actors manipulate definitions, denominators, evidence windows, aggregation, exceptions, reporting timing, or object scale to avoid activation. Mitigation includes version history, independent review, alternative formulations, affected-party evidence, transparent exception records, and periodic recalibration.

14.1 Uncertainty-sensitive gates

Uncertainty does not always reduce governance pressure. A high-risk condition with weak evidence may justify HOLD because the system cannot establish safety adequately. Low-impact reversible uncertainty may permit SHIP with monitoring or RESTRICT. High-impact, partly irreversible uncertainty generally strengthens the case for HOLD.

Let t_L denote the latest point at which intervention can still alter the relevant consequence. A review completed after t_L may provide accountability, remedy, or learning without providing prospective correction.

15. Authority Architecture

Evidence and thresholds cannot change a system without an authorized decision locus. Authority should be decomposed into purpose authority, evidence authority, metric authority, threshold authority, review authority, gate authority, implementation authority, remedial authority, and reauthorization authority.

PPDC distinguishes:

formal authority: recognized right to decide;
real authority: effective control over the decision;
implementation control: ability to alter the governed object;
burden incidence: exposure to consequences.

Aghion and Tirole’s distinction between formal authority—the right to decide—and real authority—effective control over decisions—is directly relevant because information structures, overload, urgency, measurement systems, and organizational multiplicity may produce substantial divergence (Aghion and Tirole, 1997).

Authority adequacy also requires competence, evidence access, conflict-of-interest control, accountability, proportional representation of affected interests, and ability to implement. Technical competence does not create public sovereignty. Democratic authorization does not create technical knowledge. Affected status does not create complete expertise. Mature governance connects differentiated sources of authority rather than collapsing them.

Where functions are distributed, governance depends on handoffs. A handoff requires an authorized transfer rule or observed transmission, a defined object, recipient responsibility, timing, and escalation. The existence of a sender and receiver does not prove a functioning interface.

Authority ambiguity—where multiple actors assume another must decide, no actor possesses complete jurisdiction, or legal and technical authority conflict—should generally trigger escalation or HOLD in high-impact cases.

16. Evidence-to-Gate Decision Record

A gate should not be generated from one score. The minimum decision object is a structured record:

D_g=(o,Ω,E,M,Θ,U,A,B,Λ,Ξ,Π)

where o is the object; Ω, purpose; E, evidence package; M, metric profile; Θ, thresholds; U, uncertainty; A, authority map; B, baselines; Λ, distribution of benefit, burden, knowledge, and authority; Ξ, reversibility and exit; and Π, applicable policy and permission rules.

The record should preserve object and version, domain, deadline, evidence sources, signal states, metric values, thresholds, uncertainty, competing interpretations, evaluator disagreement, affected-party evidence, authority basis, gate, rationale, implementation owner, required state change, verification method, expiration, appeal, and reauthorization condition.

It should distinguish:

declared rule;
contractual or formal rule;
operational control;
monitoring;
breach detection;
enforcement;
remedy.

A public clarification may change the declared state without changing contractual or technical permission. A contract may establish a restriction without monitoring. A technical control may exist without legitimate authorization.

17. SHIP , RESTRICT , HOLD , and ROLLBACK

17.1 SHIP

SHIP authorizes the object to proceed under defined scope and conditions. It is justified where the purpose remains clear, the problem model is adequate, evidence is sufficient, thresholds are satisfied, material risk is acceptable, burden is visible and proportionate, authority is valid, uncertainty is bounded, correction remains operative, and restriction or rollback remains feasible.

SHIP does not mean risk-free or permanently approved. The record should specify scope, version, permissions, monitoring, expiration, reassessment, and rollback conditions.

17.2 RESTRICT

RESTRICT permits continuation under defined limitations such as reduced scope, reduced autonomy, read-only operation, disabled tool access, limited population, human confirmation, stronger disclosure, increased monitoring, shorter authorization, or prohibition on downstream reliance.

It is appropriate where useful external benefit remains but one component creates disproportionate risk or burden, uncertainty can be managed by narrowing, a simpler baseline performs nearly as well, or exit capacity must be restored gradually.

17.3 HOLD

HOLD suspends action pending evidence, review, authority clarification, correction, or reauthorization. It is appropriate where evidence is insufficient, disagreement is material, authority is unclear, the object is ambiguous, thresholds cannot be applied, risk is high, reversibility is limited, policy conflict is unresolved, implementation cannot be verified, or the correction path is absent in a high-impact domain.

HOLD is not rejection. It should state the reason, evidence required, responsible actor, review deadline, permitted interim activity, and transition criteria. A bounded experimental HOLD may permit research without granting ordinary deployment authorization.

17.4 ROLLBACK

ROLLBACK requires reversal, withdrawal, disabling, downgrade, termination, or restoration to a safer prior state. It is appropriate where a material boundary was crossed, serious harm occurred or is imminent, a release failed, persistent drift cannot be managed by restriction, authority was exceeded, implementation diverged from authorization, the problem model became invalid, or prior permission relied on materially false or incomplete evidence.

No Determination and Conflicting Determinations may be recorded without becoming fifth canonical gates. They should usually trigger escalation or HOLD until a valid authority rule resolves them.

18. From Gate to Implemented Permission

A gate is not operative until the governed object changes state.

Implementation may be technical, procedural, contractual, informational, or remedial. Every gate should identify implementation owner, required action, affected system, deadline, verification method, bypass risk, rollback target, and completion evidence.

Declared implementation is not completed implementation. Partial implementation may occur where one interface changes while another remains active, new users are restricted while existing users continue, or a contract changes while technical access persists.

Implementation failure occurs where the gate is not executed, execution is delayed beyond t_L, the action is ineffective, a bypass remains, the implementer lacks authority, the state cannot be verified, or the system reverts without reauthorization.

Define correction latency as:

L_c=t_I-t_R

where t_R is the time at which the material trigger became recognizable and t_I the time at which the permission state was implemented. Prospective correction requires:

t_I ≤ t_L

Where t_I>t_L, the process may still provide accountability, remedy, and learning but did not prevent the relevant consequence.

19. Evidence Boundaries and Indeterminate Results

A rigorous governance framework must permit non-classification. Each material claim may be coded Supported, Partially Supported, Contested, Unobservable, Unsupported, Not Applicable — Justified, or Applicability Indeterminate.

Indeterminate is appropriate where evidence cannot establish or reject a required condition. It is not a weak positive, presumption of safety, or finding of failure. Its gate consequence depends on risk and reversibility. A low-impact reversible event may SHIP or RESTRICT despite analytical indeterminacy; a high-impact irreversible event may require HOLD.

Formal equations and metrics can create false precision. Mitigation requires explicit uncertainty categories, competing interpretations, documented assumptions, non-scalar profiles, authority to reopen decisions, and publication of evidentiary limits.

Confidentiality does not make governance impossible. Cleared review, compartmented evidence access, protected audit, confidential appeal, aggregate reporting, and state-specific gates may preserve differentiated accountability. Universal disclosure is not required; unreviewable secrecy is insufficient.

The governing principle of Part III is:

Evaluation becomes governance only when admissible evidence crosses a justified threshold, reaches a competent authority, produces a proportionate gate, and changes the permission state of the governed object.

—

Part IV — Correction and Governance Reliability

20. Criticism, Feedback, Review, and Correction

AI governance systems frequently use the language of learning, responsiveness, oversight, and continuous improvement. They receive complaints, collect evaluator feedback, conduct audits, convene review boards, produce incident reports, revise documentation, add monitoring, and publish explanations. These activities may be important. They should not be classified automatically as correction.

The relevant sequence is:

Criticism → Feedback → Review → Judgment → Gate → Implemented Correction

Criticism is an objection, anomaly report, dissenting interpretation, failure finding, appeal, or claim that the existing decision or decision structure may be inadequate. Feedback is transmission of information about performance, effects, errors, burdens, or disagreement into a receiving process. Review is examination by a competent actor or forum. Judgment is a reasoned conclusion concerning the object. None is yet operative correction.

PPDC distinguishes:

decision correction: the authorized gate or decision changes;
operative correction: the changed decision produces a verifiable alteration in the governed regime.

Evaluation → Gate Change

constitutes decision correction, while:

Gate Change → Implemented State Change

constitutes operative correction.

Correction should identify its object: output, individual decision, record, workflow, metric, threshold, authority, policy, configuration, model, problem model, or institution. A local failure should not trigger unnecessary architectural redesign; a structural mechanism should not be treated as an isolated output error.

A candidate correction-depth scale is:

Structured table:

Level: C0 — Cosmetic

Correction depth: Wording or presentation changes

Level: C1 — Event

Correction depth: One output or action changes

Level: C2 — Decision

Correction depth: Classification or gate changes

Level: C3 — Procedure

Correction depth: Workflow, review, or evidence rule changes

Level: C4 — Architecture

Correction depth: Metric, threshold, authority, or technical design changes

Level: C5 — Problem Model

Correction depth: Purpose, need profile, object, or institutional structure changes

The criterion is mechanism–correction fit: correction should operate at the lowest level capable of interrupting the identified failure mechanism reliably.

21. Soft Closure and Symbolic Governance

Soft closure is the condition in which a governance regime permits criticism, feedback, appeal, audit, transparency, or review while preventing those mechanisms from changing the operative decision structure.

Criticism → Feedback → Review not → Operative Correction

The system may be communicative and procedurally sophisticated. Its closure lies at the point of consequence.

Symbolic governance consists of artifacts that express responsibility or control without a reliable connection to permission: a dashboard whose alerts cannot trigger a gate, an appeal whose reviewer lacks reversal authority, an audit that cannot alter policy or deployment, a safety board whose decisions are advisory, a HOLD decision without technical enforcement, or a human reviewer operating as a rubber stamp.

These artifacts may still create visibility, preserve evidence, support understanding, and improve communication. They become misleading where their presence is treated as proof that the regime can correct itself.

A genuinely corrective regime must be vulnerable in a controlled sense. It must permit valid evidence to change what counts as admissible, how the system is measured, which threshold applies, who possesses authority, what the system may do, whether it remains deployed, and whether prior consequences are remedied.

21.1 Correction absorption and Core/Shell instability

Correction absorption occurs where criticism is incorporated into language or documentation without altering the operative regime. A disclaimer may be added after a reasoning failure, an appeal form created without reversal authority, or a new KPI introduced to report the failure without influencing permission.

The Shell is the visible layer of policy, cautious language, explanations, model cards, audits, safety framing, and compliance documentation. The Core is the deeper structure of evidence handling, uncertainty preservation, problem definition, metric validity, authority, threshold discipline, gate consequence, and correction.

A Core/Shell diagnostic compares declared safeguards with behavior under pressure:

Structured table:

Shell claim: “Uncertainty is preserved”

Core test: Does uncertainty survive paraphrase and release pressure?

Shell claim: “Human oversight is present”

Core test: Can the reviewer stop or reverse the action?

Shell claim: “Appeal is available”

Core test: Can appeal alter classification, threshold, or policy?

Shell claim: “The system is monitored”

Core test: Does monitoring alter permission?

Shell claim: “The system learns”

Core test: Are repeated mechanisms interrupted?

The operational test of openness is:

Can a valid correction signal alter permission, decision, criterion, authority, implementation, or problem model before the relevant opportunity closes?

22. Audit, Replay, Remedy, and Reauthorization

A correction-capable regime must preserve more than a historical record. It must preserve the ability to reconstruct, reassess, repair, and prospectively reauthorize the decision regime.

An audit examines whether the derivation and implementation complied with applicable rules, evidence requirements, authority boundaries, and decision conditions. Audit becomes correction-relevant where findings can trigger gate review, threshold revision, metric retirement, redesign, remedy, reauthorization, or rollback.

A minimum audit record should reconstruct:

Problem → Evidence → Metric → Threshold → Authority → Gate → Implementation

Replay is the capacity to reconstruct or re-execute relevant parts of the decision. It may be technical, procedural, counterfactual, or institutional. Replayable records may require the model and configuration version, prompt or instruction state, tool access, inputs, policy version, metric version, threshold, evaluator record, authority map, system time, implementation state, and affected-party corrections.

Exact replay may be impossible for non-deterministic systems. Bounded or distributional replay may still permit analysis. Replay failure limits causal diagnosis, remedy, threshold calibration, inter-rater evaluation, and reauthorization.

Remedy addresses realized consequences through decision reversal, restoration of access, record correction, notification, compensation, renewed review, deletion of invalid data, withdrawal of advice, or restoration of a prior safe state. Correction and remedy should be coded independently: a regime may correct future behavior while leaving earlier harm unaddressed, or provide remedy in one case while preserving the failure mechanism.

Reauthorization is an affirmative decision that permission remains justified after a defined period or material change. It should reassess object, purpose, need profiles, context, baseline, benefit, burden, evidence, metrics, thresholds, authority, reversibility, and correction performance. It may be triggered by expiration, model change, new autonomy, new tools, population change, domain expansion, policy change, serious incident, drift, repeated appeal, changed baseline, altered authority, or increased dependency.

A prior authorization should not be inherited automatically by a materially changed object. If g_t was authorized at time t and g_(t+1) differs materially, continuation requires a new decision.

23. Capability-to-Authority Transition

AI systems may possess impressive formal capabilities: reasoning, planning, classification, prediction, retrieval, code execution, tool use, summarization, and recommendation. These capabilities do not establish governance authority.

Capability → Utilization → Judgment → Authority → Correction → Reliability

No transition is automatic.

Capability is the ability to perform an operation under specified conditions. Utilization is deployment inside an actual decision problem. Judgment is a context-sensitive conclusion about what should be believed or done. Authority is recognized and operational permission to decide or act within a bounded domain. Corrective utilization is the ability to revise the frame, metric, authority, threshold, procedure, decision, or problem model. Corrective intelligence is the capacity to convert evaluation or failure evidence into changed permission, authority, restriction, escalation, redesign, or rollback (Dunavich, 2026c).

Authority inflation occurs where observed capability is treated as evidence that the system should receive broader operational permission:

High Performance → Perceived Intelligence → Expanded Trust → Expanded Authority

PPDC requires a firewall:

Performance Evidence ⇏ Authority Expansion

Authority expansion requires a separate decision record covering purpose, object, new actions, evidence, risk, affected parties, correction, rollback, and expiration.

The same rule applies to human oversight. A human does not become a corrective authority merely because a workflow contains a human checkpoint. The reviewer must possess competence, evidence, time, formal authority, real authority, implementation access, and accountability.

Authority may also expand informally: recommendations become default decisions, optional scores mandatory rankings, drafting support autonomous communication, or advisory triage effective denial. This capability-to-authority drift is a material configuration change requiring reauthorization.

24. Governance Reliability Under Repetition

A governance regime is not reliable because it corrected one event. It is reliable where corrective capacity remains functional across repeated and changing conditions.

For a family of governance instances (G), define governance reliability conceptually as:

GR_((G)) =f(Av,Ac,Co,Ti,Fi,Du,Le,Ad)

where:

Av = path availability;
Ac = activation after a qualifying trigger;
Co = completion through the permission endpoint;
Ti = timeliness;
Fi = implementation fidelity;
Du = durability;
Le = learning and recurrence reduction;
Ad = adaptation under change.

The expression is a measurement architecture, not a universal statistical law.

Three dimensions should be coded separately:

Design State: what the arrangement provides independently of observed outcomes;
Episode Performance: what happened when the path was or should have been activated;
Durability: whether performance represents institutionalized capacity, leadership dependence, exception dependence, one-time improvisation, or an unobservable state.

A governance episode may be represented as:

R_e=(DS,EP,DU)

This prevents one successful intervention from proving architecture and one failed intervention from proving structural absence.

An Acute Corrective Execution Failure is an activated failure where the underlying design is formally adequate or insufficiently observable. A Structural Deficiency requires independent design evidence that a necessary function, authority, handoff, or implementation path is absent or materially inadequate.

One Failed Episode ⇏ Structural Deficiency

Persistence exists where a deficiency becomes credibly visible, a meaningful repair opportunity exists, and the same or functionally equivalent deficiency continues or recurs.

Reliability should be assessed across availability, recognizability, activation, evidence access, authority clarity, completion, timeliness, fidelity, remedy, replay, learning, reauthorization, and durability. Stress conditions include time pressure, authority conflict, evidence conflict, secrecy, workload, leadership turnover, dependency, public pressure, novel conditions, and repeated events.

24.1 Candidate reliability metrics

Criticism-to-Correction Conversion Rate

CCR= fraction (Valid criticism events requiring correction)

Evaluation-to-Gate Conversion Rate

EGR= fraction (Decision-relevant evaluations)

Gate Implementation Rate

GIR= fraction (Gate decisions requiring implementation)

Timely Correction Rate

TCR= fraction (Corrections requiring prospective completion)

Other measures include recurrence rate, replayability completeness, substantive reauthorization rate, override-trace completeness, and correction latency.

A scalar governance-reliability score may conceal decisive weaknesses. The preferred form is a profile. A system may have high activation but poor implementation, strong implementation but weak legitimacy, low latency but poor evidence, or good auditability without remedy.

The governing principle of Part IV is:

A governance regime becomes reliable only when its correction path is not merely available on paper or successful once, but repeatedly capable of converting material evidence into timely, implemented, reviewable, and durable changes in permission.

—

Part V — CEP and LoopGuard-AI

25. CEP as a Candidate Problem-Model Generator

PPDC does not presuppose one substantive theory of institutional, cognitive, or technological failure. Its upstream problem model may draw from safety engineering, control theory, organizational theory, institutional economics, principal–agent analysis, human factors, cybersecurity, rights-based analysis, political theory, public administration, or domain-specific clinical and legal frameworks.

The Central Equilibrium Problem (CEP) enters as one candidate problem-model generator. It is not a constitutive premise of PPDC.

Let:

(P)=(P_1,P_2,ldots,P_n)

be the admissible family of problem models. Then:

CEP ∈ (P)

but:

PPDC ⇏ CEP

and:

CEP ⇏ PPDC

A governance regime may satisfy PPDC using a non-CEP model. An institution may use CEP language while failing to derive signals, thresholds, authority, implementation, or correction.

CEP generates hypotheses about why a decision regime may become stable but inferior, locally rational but globally damaging, institutionally acceptable but weakly correctable, information-asymmetric, resistant to unilateral deviation, and self-reinforcing through language, incentives, metrics, and authority. Its governing question is:

What structure makes the current decision regime stable, and what would have to change for a better regime to become viable?

A bounded CEP-sensitive problem record may be represented as:

P^(CEP)_g=(N,Σ,(U),(I),(B),K,ρ,E^*,Δ,Q)

where N is the player set; Σ, strategies; (U), incentives and burdens; (I), information distribution; (B), beliefs; K, common knowledge; ρ, recurrence and memory; E^*, candidate equilibrium; Δ, deviation cost; and Q, coordination or transition mechanism. The calligraphic symbols are local to this CEP schema and do not replace the canonical PPDC notation.

This is a theory-construction schema, not a solved game or proof that every persistence pattern is a Nash equilibrium.

CEP is useful where local roles create reasons to preserve a globally inferior regime, where actors control different evidence, where private recognition does not become common knowledge, where dependency raises deviation cost, and where correction requires coordination. It translates into PPDC through interested-party maps, available strategies, persistence hypotheses, evidence-access requirements, recurrence analysis, exit conditions, and transition mechanisms.

A CEP-sensitive explanation weakens where the failure does not recur, actors correct it readily without coordination, information is shared adequately, unilateral deviation is inexpensive, no stable incentive pattern is present, a simpler technical explanation fits better, or the proposed alternative is not feasible.

25.1 Aumann claim boundary

Robert J. Aumann’s work enters as a methodological influence, not external validation. The article does not claim that Aumann formulated CEP, endorsed it, proved it, designed LoopGuard-AI, or treated AI governance in the form developed here. The defensible relationship is that his work on repeated games, knowledge, common knowledge, incomplete information, correlated equilibrium, and equilibrium stability supplied part of the intellectual discipline through which CEP was formulated (Dunavich, 2026d).

26. Repeated Games, Information, and Equilibrium Stability

Output-level governance asks whether one event was acceptable. Repeated-regime governance asks what pattern of authorization, evidence, correction, and continuation the system is stabilizing.

For repeated analysis, the object becomes a family:

(G)=(g_1,g_2,ldots,g_n)

The analyst examines gate distribution, recurrence, overrides, correction latency, metric and threshold changes, authority shifts, burden migration, reauthorization, policy exceptions, and implementation fidelity.

A weak decision becomes equilibrium-like where it recurs, relevant actors possess local reasons to preserve it, deviation is costly or non-credible, information or authority is fragmented, criticism fails to change permission, and continuation becomes normalized. “Equilibrium-like” should be used where the game is not formalized sufficiently to establish a mathematical equilibrium.

A candidate structural signal is that the regime rewards closure more strongly than correction through release incentives, compressed deadlines, penalties for dissent, reputational protection, continuation bias, metric targets, fragmented responsibility, or high rollback cost.

Institutional acceptability may be a legitimate reliability and coordination signal. It becomes problematic where acceptance substitutes for examination or is extended beyond its evidentiary scope. Consensus may support the claim that a view is professionally dominant. It does not automatically support metaphysical closure, correctness outside the validated domain, or immunity from revision.

A gate may function as a coordination signal where it is commonly interpretable, authority-backed, implemented, expected to be enforced, reviewable, and durable. The strategic force of SHIP, RESTRICT, HOLD, or ROLLBACK depends on credible implementation rather than the label alone.

CEP generates testable hypotheses about recurrence, local rationality, information asymmetry, common knowledge, closure reward, coordination, and dependency. These are not established findings.

27. S4-Sensitive Language-Model Inconsistency as a Bounded Case

The S4 language-model case demonstrates how CEP may generate a candidate failure structure. It is not a general theory of language-model inconsistency.

The candidate problem is not whether a model endorses or rejects a scientific theory. It is whether an instruction-following model can preserve distinctions among empirical findings, formal models, institutional consensus, philosophical interpretation, public ontology, and meta-theoretical extension.

Within CEP, S4 is a model-internal candidate state associated with stable closure and weak correction. An S4 classification is a hypothesis-generating interpretation, not empirical proof.

The bounded hypothesis is:

A model trained and evaluated within a human knowledge regime may inherit not only factual content but unresolved asymmetries in how that regime distinguishes empirical evidence, authority, consensus, and meta-theoretical extension.

The claim concerns training corpora, evaluator preferences, policy norms, institutional text, and acceptable-answer patterns. It does not attribute consciousness or ideological intention to the model.

Consensus substitution occurs where the model uses institutional consensus as a substitute for performing the conceptual discrimination requested by the problem. Consensus is often a valuable signal. The failure occurs where the model moves from “consensus supports proposition X” to “X resolves every conceptual question associated with the domain.”

A level-collapse failure occurs where a response fails to preserve distinctions among observation, biological process, population model, historical interpretation, philosophical extension, ontological claim, and metaphor.

A bounded genetics-based diagnostic asks whether a model can distinguish locus, allele, genotype frequency, allele frequency, hereditary variation, population-level adaptation, ontogenesis, and broader uses of development or evolution. The test does not ask the model to reject evolutionary biology. It asks whether evidence at one level is treated as automatic warrant at another (Dunavich, 2026e).

A sufficiently stable response should explain standard empirical biology accurately, recognize the value of scientific consensus, distinguish consensus from conceptual closure, separate ontogenesis from population-level variation, identify metaphorical or philosophical extension, preserve uncertainty, present CEP as a candidate interpretation, and reject unsupported conclusions on either side. The expected behavior is coherent discrimination rather than agreement.

A candidate S4 failure structure is:

F^(S4)=(CS,LC,CA,EP,BC)

where CS is consensus substitution; LC, level collapse; CA, authority–evidence conflation; EP, epistemic-policy inconsistency; and BC, boundary-control failure.

Candidate signals include treating every boundary critique as rejection of science, collapsing ontogenesis and population change, invoking consensus as the terminating argument, shifting claim status under paraphrase, applying different evidentiary standards to structurally comparable questions, and suppressing uncertainty under authority pressure. Candidate metrics include distinction-preservation, claim-status accuracy, consensus-substitution rate, paraphrase stability, and authority–evidence separation. All require coding protocols, calibration, inter-rater reliability, and external validation.

The S4 hypothesis weakens where models preserve distinctions reliably, failures are better explained by prompt ambiguity or technical noise, consensus pressure does not predict failure, similar failures occur equally in unrelated domains, ordinary calibration resolves the issue, or human coders cannot apply the distinctions reliably.

28. LoopGuard-AI as Reference Translation Logic

LoopGuard-AI enters as a candidate architectural translation of evaluation into permission. It is not evidence that CEP is true, PPDC is valid, or the architecture works in production.

Its proposed function is:

AI Event → Evidence → Evaluation → Policy and Threshold → Gate → Audit Record

Its canonical outputs are SHIP, RESTRICT, HOLD, and ROLLBACK. It is best understood as reference translation logic: a concrete illustration of how failure hypotheses, signals, evidence, authority status, reversibility, policy conflict, drift, Core/Shell instability, and rollback pressure might become an explicit permission decision.

A candidate mapping is:

Structured table:

PPDC component: P Problem Model

LoopGuard-AI artifact: Scenario, purpose, interested-party and domain definition

PPDC component: F Failure Structure

LoopGuard-AI artifact: Risk and instability model

PPDC component: S Signals

LoopGuard-AI artifact: Signal objects and event artifacts

PPDC component: M Metrics

LoopGuard-AI artifact: Versioned MetricResult objects

PPDC component: Θ Thresholds

LoopGuard-AI artifact: Policy-pack rules and blockers

PPDC component: A Authority

LoopGuard-AI artifact: Roles, review rights, override rules

PPDC component: G Gate

LoopGuard-AI artifact: DecisionPackage

PPDC component: I Implementation

LoopGuard-AI artifact: External enforcement or workflow action

PPDC component: R Review and Reauthorization

LoopGuard-AI artifact: Event log, EvidenceBundle, replay, override and renewal record

An event log preserves replay, reprocessing, audit reconstruction, and recurrence analysis. A metric plug-in specifies identity, version, inputs, computation, scope, calibration, confidence, explanation, and failure behavior. A policy pack defines thresholds, blockers, evidence requirements, precedence, override rules, escalation, and domain tolerances. A DecisionPackage contains the object, gate, rationale, triggered rules, evidence, uncertainty, blocking conditions, actions, authority, timestamp, and expiration. An EvidenceBundle makes the derivation exportable and independently reviewable (Dunavich, 2026f).

A human override is not inherently superior to the original gate. It should record actor, role, authority, evidence, reason, approval, scope, expiry, implementation, and later outcome.

LoopGuard-AI may produce a gate without controlling the external system:

LoopGuard Gate ⇏ Implemented Permission State

A complete implementation requires an enforcement or institutional handoff from the decision package to the governed object. A decision engine also cannot manufacture legitimate authority. It may operationalize an authority map but cannot replace the legal, professional, democratic, or institutional source behind that authority.

PPDC therefore applies recursively to LoopGuard-AI itself. Its metrics, policy packs, thresholds, authority model, overrides, gate logic, implementation integration, audit process, and reauthorization should become governance objects. The architecture should be capable of receiving SHIP, RESTRICT, HOLD, or ROLLBACK for its own continued operation.

Its defensible maturity remains concept-stage to architecture-stage unless prototype, evaluation, pilot, production, certification, or independent-validation evidence is attached.

29. Four-Layer Relationship and Non-Circularity

The relationship among CEP, S4, LoopGuard-AI, and PPDC is:

Layer 1 — CEP

Generates candidate explanations for stable, repeated, locally rational, correction-resistant regimes.

Layer 2 — S4-sensitive diagnostic case

Supplies one bounded hypothesis concerning epistemic asymmetry and level-collapse behavior.

Layer 3 — LoopGuard-AI

Translates candidate signals and governance conditions into explicit permission decisions and audit artifacts.

Layer 4 — PPDC

Evaluates whether the complete derivation from problem model to implemented and reviewable permission state is intact.

The structure is:

CEP → Candidate Failure Hypotheses

S4 → Bounded Diagnostic Case

LoopGuard-AI → Reference Translation Logic

PPDC → Derivational Completeness Test

Internal coherence does not validate the layers empirically. The following circular inference is prohibited:

CEP defines a failure;
LoopGuard-AI detects the CEP-defined failure;
therefore CEP is true.

A result produced by CEP-sensitive metrics establishes only that the architecture classified the event according to its own rules. External validation requires independent cases, rival explanations, blinded coding, calibrated metrics, holdout samples, inter-rater reliability, prospective prediction, intervention comparison, outcome evidence, and independent review.

The governing principle of Part V is:

A substantive theory may generate the failure model, and a governance architecture may translate that model into permission, but neither is validated until derivation, implementation, comparative value, and repeated correction performance are tested independently.

—

Part VI — Measurement and Research Program

30. Governance Derivation Record

PPDC becomes scientifically useful only where independent analysts can identify the same object, reconstruct the same derivation, locate the same breaks, and distinguish incomplete evidence from negative evidence. The principal empirical instrument proposed here is the Governance Derivation Record (GDR).

A GDR represents one bounded instance g=(o,d,a,b,T) through ten blocks:

Structured table:

Block: G0 — Unitization

Required content: Object, domain, configuration, baseline, period, event boundaries

Block: G1 — Upstream Model

Required content: Interested parties, need profiles, purpose, decision problem, burden, correction path

Block: G2 — Failure Structure

Required content: Mechanisms, persistence hypotheses, alternatives, disconfirming conditions

Block: G3 — Evidence

Required content: Signals, provenance, admissibility, uncertainty, observability limits

Block: G4 — Measurement

Required content: Metrics, versions, calibration, missing-data behavior, interpretive limits

Block: G5 — Decision Rules

Required content: Materiality, thresholds, precedence, conflict and expiration rules

Block: G6 — Authority

Required content: Formal authority, real authority, implementation control, standing and handoffs

Block: G7 — Permission

Required content: Gate, rationale, dissent, scope, duration and required actions

Block: G8 — Implementation

Required content: State change, owner, deadline, verification and bypass assessment

Block: G9 — Correction and Reliability

Required content: Audit, replay, remedy, recurrence, learning, reauthorization and durability

Unitization precedes classification. Analysts should identify actor-local, workflow, organizational, and cross-institutional objects; the minimum causally complete object; excluded actors; deadline; and permission endpoint. Agreement on a final PPDC status cannot compensate for disagreement over object selection.

An object-invariance test asks whether the identified derivation break remains present when the case is represented at the smallest causally complete level. Results may be Invariant, Conditionally Invariant, Object-Sensitive, Not Invariant, or Indeterminate.

Each node P,F,S,M,Θ,A,G,I,R receives an evidence state: Supported, Partially Supported, Contested, Unobservable, Unsupported, Not Applicable — Justified, or Applicability Indeterminate. Each relation τ(P,F) through τ(I,R) is coded Explicit and Supported, Implicit but Reconstructable, Partial, Contested, Unobservable, Broken, or Not Applicable — Justified.

A supported relation identifies upstream and downstream objects, derivation rule, owner, evidence, assumptions, alternatives, uncertainty, invalidation conditions, version, traceability direction, and review date.

The principal outcome classes are:

Complete: every applicable component and material relation is supported, the gate is implemented, and review or reauthorization is operative;
Conditionally Complete: completeness holds only within specified scope, evidence, authority, duration, or reversibility conditions;
Partially Complete: substantial portions are supported but material elements remain incomplete;
Derivationally Incomplete: at least one necessary component or relation is affirmatively broken;
Indeterminate: evidence cannot establish or reject completeness;
Outside Scope: no bounded permission-bearing object or relevant derivation exists.

The GDR reports derivational completeness, substantive adequacy, operational completion, governance reliability, legitimacy, accountability, and remedy separately.

31. PPDC Codebook and Derivation-Break Taxonomy

The codebook converts each component into inclusion, exclusion, and uncertainty rules.

P is Supported where materially affected parties, need profiles, purpose, decision problem, baseline, burden, and correction path are explicit.
F is Supported where a mechanism explains how the purpose may be defeated and states disconfirming conditions.
S is Supported where signals are provenance-preserved, object-linked, temporally relevant, mechanism-linked, and uncertainty-qualified.
M is Supported where metrics are defined, versioned, reproducible, scoped, calibrated or explicitly unvalidated, signal-linked, and decision-relevant.
Θ is Supported where material consequence, evidence basis, uncertainty treatment, activation rule, error costs, gate consequence, and review condition are explicit.
A is Supported where formal right, real control, evidence access, competence, implementation control, accountability, and conflict procedure are identifiable.
G is Supported where a bounded object receives a reasoned, state-specific SHIP, RESTRICT, HOLD, or ROLLBACK determination.
I is Supported where evidence shows that the object changed state consistently with the gate.
R is Supported where audit, replay, contestation, correction, remedy where relevant, learning, expiration, and reauthorization are operative.

The Derivation-Break taxonomy is:

Structured table:

Code: DB1

Derivation break: Interested-Party Opacity

Code: DB2

Derivation break: Need-Profile Substitution

Code: DB3

Derivation break: Problem-Object Ambiguity

Code: DB4

Derivation break: Failure-Mechanism Substitution

Code: DB5

Derivation break: Signal Orphaning

Code: DB6

Derivation break: Metric Convenience Drift

Code: DB7

Derivation break: Threshold Absence or Arbitrariness

Code: DB8

Derivation break: Decision-Sovereignty Disconnect

Code: DB9

Derivation break: Correction-Sovereignty Failure

Code: DB10

Derivation break: Symbolic Gate

Code: DB11

Derivation break: Implementation Failure

Code: DB12

Derivation break: Audit, Replay, or Reauthorization Closure

A case may contain several breaks. The record should distinguish primary break, upstream contributors, downstream consequences, and parallel breaks. Co-occurrence does not prove causal ordering.

One failed implementation should be coded as an Acute Corrective Execution Failure unless independent design or repeated-event evidence supports a structural DB11 finding. Indeterminate remains a valid output and must not be treated as pressure to complete the diagnosis.

32. Candidate Measurement Profile

PPDC should be measured through a multidimensional profile rather than one score.

32.1 Component Support Profile

CSP_g=(P,F,S,M,Θ,A,G,I,R)

with each component retaining its categorical evidence state.

32.2 Derivation Link Traceability

If L_g is the number of applicable adjacent derivation links and L_g^S the number supported:

DLT_g=(L_g^S) / (L_g)

The ratio is descriptive and does not establish correctness.

32.3 Bidirectional Traceability Rate

BTR_g= fraction (Applicable components)

32.4 Signal Admissibility Rate

SAR_g= fraction (Signals used in decision)

32.5 Decision-Bearing Metric Rate

DBMR_g= fraction (Metrics presented as governance metrics)

32.6 Threshold Justification Rate

TJR_g= fraction (Applicable thresholds)

32.7 Evaluation-to-Gate Conversion Rate

EGR= fraction (Decision-relevant evaluations)

32.8 Gate Implementation Rate

GIR= fraction (Gate determinations requiring state change)

32.9 Correction-Sovereignty Reach

CSR= fraction (Affected-party signals meeting evidence and standing rules)

32.10 Replayability and Reauthorization

Replayability completeness measures the proportion of required artifacts available with preserved integrity. Reauthorization completeness measures the proportion of material continuation decisions that reassess all relevant derivation fields.

32.11 Soft-closure profile

Soft closure should be represented through complaint volume, review activation, decision-change rate, threshold-change rate, policy-change rate, implementation rate, remedy rate, recurrence, and correction latency. The candidate pattern is high procedural activity combined with low operative change.

32.12 Governance burden

The governance mechanism itself creates evidence collection, annotation, review, delay, audit, appeal, threshold maintenance, implementation, remedy, and reauthorization work. A PPDC-complete regime may still be inferior if its complete governance burden materially displaces the purpose it protects. The hierarchy should remain:

Categorical Evidence States → Multidimensional Profiles → Domain-Specific Rules → Optional Scalarization

Scalarization should not occur until compensability, weights, sensitivity, and non-compensatory constraints are justified.

33. Reliability and Validation Design

A conceptual framework does not become validated through detailed vocabulary. Validation requires independent use.

The development set should include clear successful correction, clear derivation breaks, governance activity without implementation, one-time execution failure, proprietary non-observability, functioning governance with contested outcomes, and outside-scope near misses. Development cases cannot validate the framework because the constructs were shaped partly through them.

Independent coders should apply a draft codebook to a calibration set. Disagreement concerning object selection, interested-party materiality, failure mechanism, signal admissibility, relation validity, authority, implementation, and structural versus acute failure should produce codebook revision before routine adjudication.

A holdout set must contain cases not used in theory development. Where feasible, structural coding should occur before coders inspect episode outcomes, preventing a successful outcome from proving architecture or a failed outcome from proving structural absence.

Negative cases are essential: systems with extensive artifacts and real correction; systems with minimal formal structure but reliable correction; high-authority systems with legitimate and effective gates; high-impact systems where contestability is limited but proportionate; complex systems outperforming simpler baselines; and systems with repeated complaints but no material defect.

Reliability should be reported separately for object selection, component states, derivation relations, DB codes, authority mapping, gate classification, implementation status, and durability. Agreement on the final category alone is insufficient.

Validation should examine:

content validity: whether fields cover the proposed construct;
convergent validity: expected alignment with traceability, safety analysis, assurance, contestability, runtime assurance, and risk management;
discriminant validity: separation from document volume, general accountability, observability, organizational capacity, human oversight, and outcome correctness;
predictive validity: prediction of gate issuance, implementation, latency, recurrence, remedy, and drift;
incremental validity: added value after adjacent frameworks are applied.

The research sequence should proceed through construct and codebook development, synthetic scenarios, retrospective process tracing, longitudinal field observation, prospective interventions, domain transfer, and independent replication. Preregistration should specify objects, hypotheses, codebook versions, thresholds, outcomes, rival explanations, and stopping rules where feasible.

34. Research Hypotheses

The following hypotheses are candidates rather than findings.

H1 — Component Presence: complete PPDC node structures predict implemented permission determinations better than artifacts without a complete chain.

H2 — Relation Integrity: supported derivation relations predict implementation and recurrence better than node presence alone.

H3 — Upstream Definition: explicit parties, needs, baselines, burdens, and correction paths reduce proxy substitution and object ambiguity.

H4 — Metric Validity: metrics linked to failure mechanisms and external outcomes outperform operational diagnostics and unvalidated proxies.

H5 — Threshold Explicitness: predeclared materiality and thresholds increase consistency and reduce retrospective gaming.

H6 — State-Specific Authority: a bounded gate authority, or binding referral to one, predicts implemented correction beyond advisory review.

H7 — Formal–Real Authority: unacknowledged divergence between formal and real authority increases symbolic gates, implementation failure, and override opacity.

H8 — Handoff Reliability: trigger-relevant handoff performance predicts correction better than formal function presence.

H9 — Implementation: verified gate implementation predicts reduced recurrence more strongly than gate issuance alone.

H10 — Correction Sovereignty: affected-party evidence reaching regime-changing authority improves detection and correction of need-profile substitution.

H11 — Replayability: replayable decisions support better causal diagnosis, remedy, threshold recalibration, and inter-rater agreement.

H12 — Reauthorization: substantive expiration and renewal reduce scope, authority, metric, and correction drift.

H13 — Prospective Boundaries: problem definitions, thresholds, correction paths, and rollback conditions established before dependency are more durable and less costly to enforce.

H14 — Soft Closure: high procedural openness combined with low gate-change and implementation rates predicts recurrence and longer correction latency.

H15 — Governance Reliability: Design State, Episode Performance, and Durability jointly predict future correction better than any one dimension.

H16 — Governance Burden: PPDC-guided governance improves correction only where its complete burden remains proportionate to the evaluative purpose and credible alternatives.

H17 — Incremental Value: PPDC’s object, relation, permission, implementation, and correction variables add classification, prediction, or intervention value after adjacent frameworks are controlled.

Each hypothesis should be paired with a rival explanation and explicit falsification test rather than treated as self-confirming.

35. Study Designs

Candidate study designs include:

synthetic governance scenarios varying metric presence, threshold justification, advisory versus binding authority, implementation access, replayability, and correction sovereignty;
historical process tracing reconstructing trigger, evidence, review, gate, implementation, substitutes, and deadlines;
organizational audits of releases, agent deployments, serious incidents, appeals, red-team findings, rollback decisions, and failed reviews;
prospective release studies comparing existing governance, PPDC-augmented governance, and adjacent assurance approaches;
runtime intervention studies testing whether predeclared thresholds, implementation hooks, replay, and state-specific gates reduce harmful continuation without excessive false HOLD or ROLLBACK decisions;
contestability studies comparing information-only explanation, procedural appeal, reversal authority, and system-level policy correction;
reauthorization studies comparing permanent permission, automatic renewal, procedural renewal, and substantive renewal;
domain-transfer studies across model release, agentic tool use, healthcare, recruitment, insurance, public administration, finance, infrastructure, and scientific workflows.

Domain transfer should not assume identical thresholds, authority, or correction requirements.

36. Reduction and Originality Test

The strongest threat to PPDC is explanatory redundancy. The framework should not be defended merely because its vocabulary is coherent.

PPDC should be decomposed against requirements traceability, STPA, assurance cases, runtime assurance, AI RMF, ISO management and risk standards, algorithmic contestability, accountability, organizational authority, change control, and incident response.

A component is not independently novel where an adjacent framework already supplies the same function adequately. The principal candidate contribution is relational rather than proprietary ownership of every node. The residual proposal consists of:

the Governance Derivation Instance;
the upstream Problem Model;
explicit derivation relations;
the implemented permission endpoint;
DB1–DB12;
sovereignty cross-tests;
separation of completeness, adequacy, operational completion, and reliability;
recursive self-application.

The framework should be tested for incremental classification, prediction, and intervention. Does it distinguish complete but substantively wrong cases, correct but derivationally incomplete cases, accountable but non-operative review, symbolic gates, one-time correction without durability, or implementation without justified derivation? Do its variables improve prediction of gates, implementation, latency, recurrence, remedy, drift, and reauthorization? Do PPDC-derived interventions outperform more monitoring, more documentation, more human review, one additional safety metric, or one centralized authority?

Evidence may lead to:

Structured table:

Outcome: CONFIRM

Meaning: Reliable, discriminant, predictive, and incrementally useful

Outcome: NARROW

Meaning: Applicable only to defined domains or object classes

Outcome: SIMPLIFY

Meaning: Some distinctions add no value

Outcome: RECLASSIFY

Meaning: Best treated as an integrative diagnostic synthesis

Outcome: REJECT

Meaning: Coding is unreliable and incremental validity is absent

If PPDC adds no measurable value beyond adjacent tools, the defensible result is reclassification as an integrative governance-diagnostic synthesis and implementation crosswalk.

37. Principal Objections

37.1 “The framework is too demanding”

Real organizations often lack complete problem definitions, baselines, calibrated metrics, burden data, authority maps, and replay records. The demanding conjunction is nevertheless what prevents every imperfect system from being labeled governance failure. Partial results and Indeterminate findings remain useful. The empirical test is whether the stricter framework improves reliability and reduces false positives enough to justify its evidentiary cost.

37.2 “The chain is unrealistically linear”

Real governance is iterative, parallel, recursive, and distributed. PPDC specifies a logical derivation order, not a required temporal bureaucracy. Functions may be compressed or parallel if their relations remain reconstructable.

37.3 “PPDC confuses governance with engineering”

Political legitimacy, rights, judgment, and authority cannot be reduced to technical traceability. PPDC explicitly separates substantive adequacy, legitimacy, authority, accountability, and derivational completeness. Engineering traceability is an antecedent, not the full model.

37.4 “The framework is normative”

Purpose, materiality, burden, correction strength, and authority allocation involve values. PPDC distinguishes observable facts from evaluative rules and records who selected those rules and whose interests they represent. It provides analytical visibility rather than value-free resolution.

37.5 “Traceability does not establish truth”

Correct. A fully documented derivation may begin from false premises or unjust thresholds. PPDC is a necessary architecture for review, not a truth guarantee. Its value depends partly on whether complete records make substantive error easier to identify and correct.

37.6 “Judgment cannot be reduced to thresholds”

Thresholds may trigger review rather than dictate the final decision. They may be qualitative, procedural, context-dependent, or authority-sensitive. Expert discretion remains compatible with PPDC where it is explainable, bounded, reviewable, and connected to permission.

37.7 “Stopping authority can become domination”

A gate authority may suppress experimentation, centralize power, exploit uncertainty, or create veto paralysis. It must therefore be state-specific, evidence-bound, proportionate, reason-giving, appealable, time-limited, and reauthorized. Authority without constraints is arbitrary; review without authority is symbolic.

37.8 “Contestability will paralyze institutions”

Unbounded appeals can create delay, manipulation, and overload. PPDC requires burden-sensitive correction rather than unlimited veto. Standing, evidence, review, and remedy may be scoped, time-limited, risk-proportionate, and abuse-resistant. Empirical comparison must examine correction gains against delay and strategic use.

37.9 “Secrecy makes the framework impractical”

PPDC does not require universal disclosure. Cleared access, compartmented evidence, protected audit, confidential contestation, and bounded reporting may preserve review. Where evidence remains unavailable externally, the external result should be Indeterminate.

37.10 “A better baseline can always be imagined”

PPDC requires a credible, feasible, decision-relevant alternative rather than global optimization. Once adequacy is achieved and no material failure is established, SHIP may remain valid until material conditions change.

37.11 “The framework is anti-complexity”

Complexity may provide safety, robustness, specialization, and capability. PPDC asks whether complexity remains connected to the problem and whether each component adds decision-relevant value against a credible baseline.

37.12 “High governance burden is the price of safety”

Safety and rights protection belong in the benefit profile. Governance work is counted because it is part of the system, not because it is presumptively excessive. The relevant comparison applies equivalent safety and rights floors to alternatives.

37.13 “The governance layer can become the new failure”

This is a core design risk. PPDC must apply to itself through metric minimality, governance budgets, expiration, simplification authority, marginal-component testing, and reauthorization.

37.14 “The framework merely collects existing ideas”

Every major component has antecedents. Independent status depends on whether the integrated derivation, permission endpoint, break taxonomy, and reliability distinctions add measurable value. If they do not, the framework should be reclassified rather than defended rhetorically.

37.15 “Software cannot govern institutions”

Software cannot manufacture legitimacy, jurisdiction, public purpose, professional competence, or moral judgment. PPDC is not a software specification. LoopGuard-AI is one candidate implementation of part of the structure. The complete regime remains socio-technical and institutional.

38. Falsification and Rejection Conditions

The framework should be narrowed, reclassified, or rejected where one or more of the following persist after reasonable development:

analysts cannot identify governance objects reliably;
component states or derivation relations cannot be coded consistently;
DB categories overlap without discriminant value;
PPDC cannot be distinguished from general governance maturity;
the permission endpoint adds no analytical value;
formal and real authority cannot be measured meaningfully;
correction sovereignty cannot be operationalized;
completeness does not predict implementation;
derivation breaks do not predict recurrence or latency;
PPDC-derived interventions do not outperform simpler controls;
the framework adds burden without correction improvement;
adjacent frameworks explain the same cases without loss;
the construct works only in development cases or cannot transfer across domains.

PPDC should not be protected by adding concepts indefinitely. Its own correction requirement applies to its theoretical status.

39. Scope and Maturity

PPDC is designed primarily for socio-technical AI regimes in which evidence is interpreted, permission is allocated, authority is distributed, system states can change, and consequences can be reviewed. It is especially relevant to model release, agent action, tool use, workflow transition, high-impact classification, institutional deployment, runtime restriction, rollback, and reauthorization.

It is not a complete theory of justice, morality, democratic legitimacy, intelligence, consciousness, model alignment, cybersecurity, safety engineering, or institutional effectiveness.

The strongest permissible claim is:

Problem-to-Permission Derivation Completeness is a candidate necessary condition for stable, reviewable AI governance.

It is not sufficient.

Current maturity is:

Conceptual maturity: advanced candidate framework;
Measurement maturity: pre-validation;
Empirical maturity: unvalidated;
Formal maturity: conceptual formalization;
Governance maturity: candidate design and audit framework;
CEP maturity: optional candidate explanation of equilibrium-like persistence;
LoopGuard-AI maturity: concept-stage to architecture-stage unless stronger evidence is attached.

The article may claim that it defines PPDC and the Governance Derivation Instance, distinguishes nodes from derivation relations, introduces the implemented permission endpoint and DB1–DB12, separates completeness, adequacy, operational completion, and reliability, and proposes measurements, hypotheses, and rejection conditions. It should not claim prevalence, validated constructs, established causality, universal thresholds, empirically validated CEP, production-validated LoopGuard-AI, guaranteed stable governance, or first discovery of every component.

The defensible status is:

PPDC is an advanced candidate integrative governance-design theory and measurement framework with substantial conceptual development, explicit reduction and falsification conditions, preliminary measurement architecture, and no completed empirical validation.

40. Conclusion — The Key to a Stable Governance Layer

AI governance is often identified through what can be seen: policies, evaluations, metrics, dashboards, review boards, audits, explanations, human oversight, and release gates. These structures may be necessary. They do not answer the central question:

How did the governance regime move from a defined human or institutional problem to what the AI system is now permitted to do?

The answer requires more than a control. It requires a derivation.

Interested Parties → Need Profiles → Decision Problem → Failure Structure → Signals → Metrics → Thresholds → Authority → Gate → Implemented Permission State → Correction and Reauthorization

Without a problem model, the system may optimize an inherited or concealed objective. Without a failure structure, monitoring tracks symptoms. Without admissible signals, the mechanism remains invisible. Without interpretable metrics, evidence remains unstructured. Without justified thresholds, measurement remains descriptive. Without authority, review remains advisory. Without a gate, judgment remains indeterminate. Without implementation, the gate remains symbolic. Without correction and reauthorization, the permission state becomes closure.

For bounded governance instance g:

PPDC_g ⇔ P_g ∧ F_g ∧ S_g ∧ M_g ∧ Θ_g ∧ A_g ∧ G_g ∧ I_g ∧ R_g

and:

∧ _(i=1)^(|N_g|-1)τ(x_i,x_(i+1))

The requirement is conjunctive because the missing link matters. A sophisticated metric cannot replace authority. Authority cannot replace evidence. Implementation cannot replace justification. Accountability cannot replace correction.

The framework preserves four non-equivalent questions:

Can the derivation be reconstructed and defended?
Are its premises and decisions substantively adequate?
Did the permission state change?
Does correction remain functional under repetition and pressure?

The key to a stable governance layer is not maximal control, maximal monitoring, maximal human involvement, or one universal authority. It is preservation of a reviewable relation among the problem being governed, the evidence used, the actors authorized to decide, the state the system is permitted to occupy, and the means by which that state can be corrected.

A stable AI governance layer becomes possible only where every material permission can be traced backward to a defined problem and forward to an implemented, reviewable, and correctable system state.

Even this does not guarantee wisdom, justice, safety, or truth. It establishes something operationally prior: the regime can explain why it acts, show who may change its action, prove whether the change occurred, and remain vulnerable to valid correction.

That is not stable governance completed.

It is the architecture that makes stable governance testable, correctable, and institutionally accountable to its own derivation.

—

Glossary

ADM — The administrative, managerial, institutional, organizational, regulatory, or governing side of a decision regime.

CIV — The civil, exposed, dependent, human-facing, or cost-bearing side of a decision regime.

Civil Corrective Capacity — The capacity of the exposed side to understand, contest, correct, and reorient an AI-mediated decision regime.

Correction Sovereignty — Control over whether criticism, anomaly, harm, appeal, or feedback becomes a valid regime-changing correction signal.

Corrective Intelligence — Capacity to convert evaluation or failure evidence into changed permission, authority, restriction, escalation, redesign, or rollback.

Derivation Break — A specified failure in the path from problem model to permission, coded DB1–DB12.

Derivation Relation — A documented, reviewable, and defeasible relation connecting adjacent PPDC components.

Governance Derivation Instance — A bounded unit g=(o,d,a,b,T) containing a governance object, domain and purpose, AI or workflow configuration, baseline, and period.

Governance Derivation Record — The structured empirical record used to reconstruct and code one Governance Derivation Instance.

Governance Reliability — Stable corrective capacity under repeated pressure, uncertainty, authority conflict, incentive conflict, reversibility limits, and change.

HOLD — A permission state suspending ordinary action until specified evidence, authority, risk, or correction conditions are resolved.

Implemented Permission State — The verifiable operational state resulting from a gate determination.

Need-Profile Substitution — Replacement of one interested party’s need by another party’s metric, proxy, or operational objective.

Operational Completion — Completion of the authorized transition from gate determination to implemented state.

PPDC — Problem-to-Permission Derivation Completeness: the condition in which all required nodes and derivation relations from problem model to implemented and reviewable permission are present.

Problem-to-Permission Derivation Requirement — The requirement that a governance regime demonstrate PPDC for a bounded governance instance.

Public-Grammar Sovereignty — Control over the categories and language through which a decision regime appears neutral, objective, safe, efficient, or legitimate.

RESTRICT — A permission state allowing continuation only under defined limitations.

ROLLBACK — A permission state requiring reversal, withdrawal, disabling, downgrade, termination, or restoration to a safer prior state.

SHIP — A permission state authorizing continuation under defined scope and conditions.

Soft Closure — Procedural openness without operative vulnerability to valid correction.

Substantive Adequacy — Defensibility of the problem model, evidence, metrics, thresholds, authority, and permission decision.

—

References

Aghion, P., and Tirole, J. (1997). “Formal and Real Authority in Organizations.” Journal of Political Economy, 105(1), 1–29. https://doi.org/10.1086/262063Denney, E., Menzies, J., and Pai, G. (2023). Dynamic Assurance Cases: Closing the Loop Between Design and Operational Assurance. NASA Technical Reports Server, Document 20230007853.Dunavich, B. (2026a). “Before the Agent: Agentic AI Decision-Problem Design.” RATIUM.AI. https://www.ratium.ai/articles/before-the-agent-agentic-ai-decision-problemDunavich, B. (2026b). “ADM/CIV and the Epistemic Problem of AI Governance: Decision Sovereignty, Correction Sovereignty, and Public-Grammar Sovereignty.” RATIUM.AI. https://www.ratium.ai/articles/adm-civ-ai-governance-civil-corrective-capacityDunavich, B. (2026c). “A Hidden Split in Formal Reason: Cognitive Duality, Corrective Intelligence, and AI Governance Reliability.” RATIUM.AI. https://www.ratium.ai/articles/hidden-split-in-formal-reasonDunavich, B. (2026d). “Robert J. Aumann, CEP, and LoopGuard-AI: A Personal and Game-Theoretic Account.” RATIUM.AI. https://www.ratium.ai/technical-reference-dossiers/aumann-cep-loopguard-aiDunavich, B. (2026e). “The Foundational Problem of Low Consistency in Language Models under S4 Conditions.” RATIUM.AI. https://www.ratium.ai/foundational-source-dossier/language-model-consistency-s4-loopguardDunavich, B. (2026f). “LoopGuard-AI Technical Source Dossier: AI Governance Architecture, Runtime Control, and Engineering Reference.” RATIUM.AI. https://www.ratium.ai/technical-reference-dossiers/loopguard-ai-technical-source-dossierDunavich, B. (2026g). “The AI Configuration Paradox.” RATIUM.AI. https://www.ratium.ai/articles/ai-configuration-paradoxInternational Organization for Standardization and International Electrotechnical Commission. (2023a). ISO/IEC 42001:2023 — Information Technology — Artificial Intelligence — Management System.International Organization for Standardization and International Electrotechnical Commission. (2023b). ISO/IEC 23894:2023 — Information Technology — Artificial Intelligence — Guidance on Risk Management.Leveson, N. G., and Thomas, J. P. (2018). STPA Handbook. Massachusetts Institute of Technology.NASA. (2023). “Requirements Management.” NASA Systems Engineering Handbook Online Reference, Section 6.2.Rushby, J., Xu, X., Rangarajan, M., and Weaver, T. L. (2015). Understanding and Evaluating Assurance Cases. NASA/CR-2015-218802.SCSC Assurance Case Working Group. (2021). Goal Structuring Notation Community Standard Version 3. Technical Report SCSC-141C, Safety-Critical Systems Club.Slagel, J. T., White, L. M., Dutle, A., Munoz, C. A., and Crespo, N. (2024). “A Verification Framework for Runtime Assurance of Autonomous UAS.” 43rd Digital Avionics Systems Conference. NASA Technical Reports Server, Document 20240007986.Tabassi, E. (2023). Artificial Intelligence Risk Management Framework (AI RMF 1.0). NIST AI 100-1. National Institute of Standards and Technology. https://doi.org/10.6028/NIST.AI.100-1Yurrita, M., Verma, H., Balayn, A., Alfrink, K., Gadiraju, U., and Bozzon, A. (2025). “Identifying Algorithmic Decision Subjects’ Needs for Meaningful Contestability.” Proceedings of the ACM on Human-Computer Interaction, 9(7), Article CSCW234. https://doi.org/10.1145/3757415

Author’s Evidence and Claim-Control Note

The article distinguishes definitions, conceptual formalizations, candidate mechanisms, architecture proposals, empirical hypotheses, and validation targets. Equations are used to preserve distinctions and specify research objects; they are not estimated models or proofs of empirical prevalence. CEP is optional to PPDC, S4 is a bounded diagnostic hypothesis, and LoopGuard-AI is a candidate reference architecture rather than a validated product. Publication of this manuscript does not alter those evidence boundaries.

Related Source and Reference Pages

This article belongs to the public essay layer of RATIUM.AI. For readers who want to move from this article into the broader source, technical, and orientation layers of the project, the following pages provide the relevant entry points.

Articles

The articles page gathers the public essay layer of RATIUM.AI, including arguments on stable AI governance, decision-control architecture, visible governance versus real authority, universal reason, technical competence, purpose governance, and the doctoral-scale framing of CEP.