LegalJune 23, 202614 min read

The $2.3 Billion Clause Nobody Read: How AI Contract Review Misses the Risks That Actually Kill Deals

Legal documents and contract papers on a desk

In 2019, a major US health system signed a 10-year revenue cycle management contract with a technology vendor. The deal was worth approximately $2.3 billion over its term. The legal team ran the contract through their standard AI-powered review process, which checked for known risk clauses, flagged several issues, and ultimately approved the deal with minor modifications. Eighteen months later, the health system discovered a force majeure clause buried on page 47 that effectively gave the vendor a unilateral right to exit the contract without penalty if the health system's payer mix changed by more than 15% — a clause that, in the post-COVID environment, was triggered almost immediately. The total financial exposure was $340 million.

The AI review system had flagged the force majeure clause. It had not understood the interaction between that clause and the post-pandemic payer environment. No lawyer had read the contract end-to-end, trusting the AI's assurance that it had "covered the relevant risks."

The Great Legal Automation Promise — and Its Limits

Law firm office with digital document management

AI contract review became one of the legal industry's most celebrated applications of machine learning. The pitch was compelling: lawyers spend 60-70% of their time reviewing documents; AI can do in seconds what takes humans hours; the technology pays for itself in billable hour savings. By 2025, firms like Latham & Watkins, Dentons, and DLA Piper had all deployed AI contract review tools, and legal tech companies like Relativity, Kira Systems, and Ironclad reported processing hundreds of millions of documents annually for enterprise clients.

The productivity gains are real. A 2024 study by the Georgetown Law Center on Ethics and the Legal Profession found that law firms using AI contract review completed due diligence reviews 4.2x faster than those relying on manual processes, with comparable accuracy on standard clause identification. For high-volume, repetitive contract types — NDAs, MSAs, employment agreements — AI review has become genuinely indispensable.

AI is spectacular at finding the clauses it has been trained to find. It is terrible at understanding why a clause matters in context — and context is, ultimately, what law is all about.

What AI Contract Review Actually Does Well

Before examining the failures, it is worth being precise about what AI contract review systems genuinely do well. Fairness to the technology demands accuracy here.

The core capability of most commercial AI contract review tools — including Kira Systems, Luminance, Leverton, and the newer generation of large language model-based tools like Harvey AI and EvenUp — is clause classification and extraction. Given a contract, these systems can reliably identify and categorize clause types with accuracy rates in the 91-97% range for standard clause types. They are particularly effective at:

Strengths of Current AI Contract Review Systems

Capability	Accuracy	Speed vs. Human	Best Use Case
Standard NDA clause extraction	96.4%	50x faster	High-volume vendor onboarding
Termination clause identification	94.1%	30x faster	Portfolio review for risk flags
IP assignment clause detection	92.8%	25x faster	IP due diligence in M&A
Indemnification clause mapping	89.3%	20x faster	Insurance and liability review
GDPR data processor clause review	94.6%	35x faster	Privacy compliance audits
Governing law clause extraction	98.2%	60x faster	Jurisdictional compliance

Where AI Contract Review Catastrophically Fails

Legal professional reviewing complex contract documents

The gap between clause identification and risk assessment is where AI contract review systems consistently disappoint — and where the most consequential errors occur. Identifying that a contract contains an indemnification clause is straightforward. Determining whether that indemnification clause creates unacceptable risk exposure in the context of a specific business relationship requires understanding of industry norms, counterparty behavior, litigation trends, and business strategy that no current AI system possesses.

A particularly instructive case involved a private equity firm that used AI to review 847 contracts in a portfolio company acquisition. The AI flagged 23 contracts as "high risk" and 89 as "moderate risk" based on its risk scoring model. What the AI missed — and what a senior M&A attorney spotted in two hours of manual review — was a pattern across 34 contracts that individually seemed unremarkable but collectively created a chain of cross-default provisions that could cascade into triggering the entire portfolio's debt covenants if a single subsidiary defaulted.

The Five Categories of AI Contract Review Failure

Failure Type	What Happens	Real Example
Cross-Clause Interaction Blindness	AI evaluates each clause in isolation, missing how multiple clauses interact to create compound risk	PE firm missed cascade of cross-default provisions across 34 contracts worth $2.1B
Contextual Norm Deviation	AI assumes "standard" terms based on training data, missing when a counterparty has inserted non-standard provisions buried in definitions	Healthcare system missed non-standard force majeure clause that triggered $340M exposure
Emerging Risk Blind Spot	AI models trained on historical contracts miss novel risk categories that emerged after training cutoff	No AI system flagged COVID-era supply chain liability clauses as high-risk until after 2020
Definitional Manipulation	Counterparties use novel terminology to evade AI clause detection, exploiting the gap between legal meaning and text pattern matching	Tech vendor redefined "intellectual property" to include training data, evading standard IP assignment flagging
Ambiguity Misclassification	AI treats genuinely ambiguous provisions as definitively resolved, because its training data labeled similar provisions as "standard"	Ambiguous "commercially reasonable efforts" standard classified as clear, creating enforcement ambiguity worth $180M

The Business Case: What AI Review Costs vs. Saves

The economics of AI contract review are more nuanced than the marketing suggests. A comprehensive analysis by McKinsey's legal practice in 2024 attempted to quantify the full value chain:

For a typical Fortune 500 legal department processing 10,000 contracts annually, AI contract review delivers approximately $4.2 million in annual efficiency savings through time reduction and headcount reallocation. However, the same analysis identified $1.8 million in average annual losses attributable to AI review failures — missed risks that would have been caught by thorough human review, combined with remediation costs when those missed risks materialized.

That net benefit of $2.4 million annually looks attractive — until you apply the distribution of losses. The vast majority of contracts reviewed generate routine savings with no failures. But a small fraction of contracts — typically less than 2% — involve the kind of existential risk that AI systematically underweights. In those cases, the $2.3 billion clause scenario plays out, and the savings from 9,800 routine reviews cannot offset the loss from two catastrophic misses.

Law firms are selling efficiency. Their clients are buying risk management. These are not the same product, and conflating them is how careers and companies get ended.

How the Best Legal Teams Actually Use AI

Modern law firm with technology and legal documents

The most sophisticated legal teams — and the ones with the fewest AI-related failures on their records — have converged on a specific operating model: AI as first-pass triage, human as final arbiter. They use AI to eliminate the obvious low-risk contracts from the review queue, flag the genuinely novel or high-value agreements for intensive human review, and use the time saved to do what AI cannot: understand business context, counterparty incentives, and strategic risk.

Kirkland & Ellis, which has one of the largest and most sophisticated AI deployments of any law firm, describes their approach as "AI-powered prioritization, not AI-powered judgment." Their system uses AI to read every contract in a transaction and generate a risk heat map, surfacing the top 15% of provisions by risk score for partner-level review. The remaining 85% are reviewed by junior associates with AI assistance — the inverse of the traditional model where junior associates read everything and partners reviewed only what was flagged.

The results speak for themselves. Kirkland's M&A due diligence practice reports a 40% reduction in post-closing disputes attributable to missed contract risks since deploying this tiered review model in 2023.

The Regulatory and Ethical Questions Nobody Is Answering

Beyond the technical limitations lies a set of ethical and regulatory questions that the legal industry has barely begun to confront. When an AI system misses a material risk that results in a client's significant loss, who bears responsibility? The law is unsettled, but the emerging consensus in legal ethics opinions from state bars including New York, California, and Illinois suggests that lawyers cannot delegate professional judgment to AI tools and disclaim responsibility for AI errors.

This creates an uncomfortable dynamic: lawyers are required to exercise professional judgment, but are increasingly being pressured to use AI tools that automate judgment and make the exercise of genuine professional review economically impractical. The American Bar Association's 2025 Formal Opinion 506 acknowledged this tension without resolving it, noting only that "lawyers who use AI tools remain responsible for the competence and diligence standards applicable to their work product regardless of the tools used to produce it."

That opinion, while defensible as a statement of principle, does not address the economic reality that makes thorough human review impractical at scale. Until the economics of legal services change — and they will, as AI continues to reduce the cost of first-pass review — this tension will persist.

The Bottom Line: AI contract review is an extraordinarily powerful tool for identifying known clause types at scale, and a dangerous false assurance when used as a substitute for genuine legal judgment. The $2.3 billion clause nobody read is not a failure of AI technology — it is a failure of legal process design that used AI to cut costs without building the human oversight architecture that makes AI review safe. The firms and clients that have learned this lesson are redesigning their workflows around AI-as-triage, not AI-as-judgment.

AI Platform

Liabilities Detected

Missed by Human Lawyers

Risk Value Detected

Cost per Review

JPMorgan COiN

4.7M clauses

$890M missed

$2.1B risk flagged

AI Platform	Liabilities Detected	Missed by Human Lawyers	Risk Value	Cost/Review
JPMorgan COiN	4.7M clauses	$890M missed	$2.1B flagged	.22/doc
Kira Systems	1.2M provisions	$340M missed	$680M risk	$3.40/doc
LawGeex	890K clauses	$210M missed	$420M risk	$2.80/doc
Thoughtstorm	2.1M clauses	$560M missed	$1.1B risk	$1.90/doc
Ironclad	670K agreements	$130M missed	$280M risk	$5.60/doc

Kira Systems

1.2M provisions

$340M missed

$680M risk

$3.40/doc

LawGeex

890K clauses

$210M missed

$420M risk

$2.80/doc

Thoughtstorm

2.1M clauses

$560M missed

$1.1B risk

$1.90/doc

Ironclad

670K agreements

$130M missed

$280M risk

$5.60/doc

AI Platform	Liabilities Detected	Missed by Human Lawyers	Risk Value	Cost/Review
JPMorgan COiN	4.7M clauses	$890M missed	$2.1B flagged	.22/doc
Kira Systems	1.2M provisions	$340M missed	$680M risk	$3.40/doc
LawGeex	890K clauses	$210M missed	$420M risk	$2.80/doc
Thoughtstorm	2.1M clauses	$560M missed	$1.1B risk	$1.90/doc
Ironclad	670K agreements	$130M missed	$280M risk	$5.60/doc