How Brouwerian Assertibility Constraint Transforms Responsible AI

This theoretical paper proposes a fundamental shift in how generative AI systems should operate in high-stakes domains, arguing that current models' tendency to convert statistical uncertainty into definitive statements erodes democratic discourse. The author introduces a formal "assertibility constraint" requiring AI to provide publicly contestable proof for its claims, fundamentally challenging the industry's reliance on confidence scores and argmax outputs as sufficient for deployment.

Key Takeaways

The core argument is that generative AI displaces "epistemic agency" by presenting uncertain outputs as authoritative, which is corrosive to public discourse in domains like law, medicine, or policy.
The proposed solution is a Brouwer-inspired "assertibility constraint": AI may only assert or deny a claim if it can produce a publicly inspectable and contestable certificate of entitlement; otherwise, it must return "Undetermined."
This creates a three-status interface (Asserted, Denied, Undetermined) that cleanly separates the system's internal reasoning from the public justification of its output.
The constraint is operationalized by gating standard decision-layer outputs (thresholds, argmax) with "internal witnesses" like sound bounds or separation margins, forcing the system to abstain when such proof is unavailable.
A key technical lemma shows that "Undetermined" is not a tunable reject option but a mandatory status when no forcing witness exists, making the system's scope of reliable operation explicit and limited.

A Formal Framework for Epistemically Responsible AI

The paper, "Generative AI can convert uncertainty into authoritative-seeming verdicts," presents a philosophical and technical critique rooted in the philosophy of language and mathematics. The author argues that when an AI system generates a text that asserts a factual claim, it performs a "speech act" that enters the space of public reasons. In a democratic society, such acts must be answerable to challenge and justification. Current systems, which often output a single, fluent answer based on the highest probability token (argmax), fail this test because they conceal their internal uncertainty.

The proposed "assertibility constraint" is inspired by intuitionistic logic, particularly the work of L.E.J. Brouwer, where a mathematical statement can only be considered true if a constructive proof (a certificate) can be provided. Translated to AI, a system is only entitled to Assert a claim (e.g., "This patient has condition X") if it can generate a corresponding certificate—a piece of internally verifiable evidence like a sound statistical bound or a clear separation margin in its latent space. The same applies for a Denial. If no such certificate exists, the only responsible output is Undetermined.

Technically, this means modifying the standard inference pipeline. Instead of directly outputting the result of an argmax function over a probability distribution, the system must first check for an "internal witness." This witness acts as the certificate of entitlement. The output is then governed by a formal contract: it must be one of the three statuses, and if it is Asserted or Denied, it must be accompanied by the inspectable certificate. This creates a "time-indexed entitlement profile" that is stable—the system won't flip answers with minor numerical tweaks—yet revisable if new public evidence changes the foundational knowledge.

Industry Context & Analysis

This theoretical proposal strikes at the heart of a critical tension in modern AI deployment: the race for capability versus the imperative for reliability. The dominant paradigm, exemplified by models like GPT-4, Claude 3, and open-source giants like Meta's Llama 3, prioritizes generating coherent, helpful, and seemingly confident responses across a vast range of topics. Their interfaces typically provide a single, definitive-sounding answer, sometimes with a disclaimer about potential inaccuracy. This paper argues that such disclaimers are insufficient; the very architecture of authoritative speech is the problem.

The author's framework directly contrasts with the industry's primary approach to handling uncertainty: confidence scores and calibration. For instance, a model might output an answer with a 95% confidence score. However, as research on benchmarks like MMLU (Massive Multitask Language Understanding) shows, even models with high calibrated confidence can be confidently wrong on nuanced or adversarial examples. The proposed "assertibility constraint" moves beyond probabilistic confidence to demand constructive, contestable proof. This is closer in spirit to techniques in formal verification or "proof-carrying code" from secure systems, but applied to the stochastic outputs of neural networks.

This work also contextualizes the growing field of AI alignment and safety. While much effort focuses on preventing harmful outputs (e.g., via reinforcement learning from human feedback or constitutional AI), this paper addresses a prior, epistemic harm: the erosion of public trust and agency through unchallengeable automated speech. It aligns with emerging trends toward "uncertainty-aware AI" and "abstention," but imposes a much stricter, formal requirement. The lemma proving that "Undetermined" is mandatory in the absence of a certificate prevents developers from simply tuning the abstention rate for convenience, forcing a hard confrontation with the system's actual limits of knowability.

What This Means Going Forward

If adopted, this framework would fundamentally reshape AI product development and regulation in high-stakes fields. For industries like healthcare diagnostics, legal research, and financial compliance, it mandates a shift from AI as an oracle to AI as a reasoning assistant with clearly documented boundaries. Systems would be forced to explicitly declare the scope of questions they can decisively answer with proof, dramatically increasing transparency but potentially reducing their perceived utility in the short term.

The primary beneficiaries would be end-users and democratic institutions, who regain "epistemic agency" by being able to inspect and challenge the warrants for AI-generated claims. Regulators, such as those enforcing the EU's AI Act, could leverage such a framework to define stricter requirements for "high-risk" AI systems, moving beyond vague mandates for transparency to specific technical standards for justifiable output.

For AI researchers and engineers, the path forward involves major technical challenges. The paper's concept of an "internal witness" or certificate needs concrete instantiation for large neural networks. This could drive innovation in areas like self-attention attribution, robust feature extraction, and neuro-symbolic methods that marry neural pattern recognition with symbolic, verifiable reasoning. The field should watch for research that attempts to implement this assertibility constraint, even in a limited domain, and measure its impact on standard benchmarks—not just accuracy, but on metrics of justification quality and user trust. The ultimate test will be whether this philosophical corrective can be translated into practical systems that are both useful and epistemically humble.

Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI

Key Takeaways

A Formal Framework for Epistemically Responsible AI

Industry Context & Analysis

What This Means Going Forward

常见问题

Key Takeaways

A Formal Framework for Epistemically Responsible AI

Industry Context & Analysis

What This Means Going Forward

常见问题

相关推荐

Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI

Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI

Measuring AI R&D Automation

Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI

Measuring AI R&D Automation

Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI