Why does AI hallucinate on regulatory or legal questions?

Because a language model is trained to produce the most plausible next words, not to retrieve verified facts. It learned the shape of a regulatory answer (an authority, a regulation number, a date) and generates text in that shape whether or not the specifics exist, which is why a fabricated citation looks identical to a real one.

Can better prompting stop AI hallucination on regulation?

Only at the margin. Telling a model to only cite real sources cannot give it facts it has no access to, and it cannot know its training data is out of date. The reliable fix is to connect the model to a live, verified data source it can query at the moment of the question.

What are the most common AI failure modes in compliance?

Fabricated citations, stale text presented as current, drafts treated as binding law, and rules from one jurisdiction quietly blended with another.

How do you stop an AI from making up regulations?

Ground it in a verified, structured, current regulatory data layer, for example via an MCP server, so it retrieves real records with a source, date, and status instead of generating plausible text from memory.

Why AI Hallucinates on Regulatory Questions (and How to Stop It)

Ask a general-purpose AI a regulatory question and the danger is not that it refuses to answer. The danger is that it answers beautifully, and is wrong. It will cite a regulation number, give you a confident summary, and move on, with no signal that any of it was invented. In a brainstorm that is harmless. In a compliance file, it is a problem you can be held accountable for.

This article explains why AI hallucinates specifically on regulatory and legal questions, the failure modes to watch for, why "prompt it more carefully" does not fix it, and the one thing that does.

What a hallucination actually is

A large language model is trained to produce the most plausible next words, not to retrieve verified facts. It has learned the shape of how a regulatory answer reads: an authority, a regulation number, a date, a clause. When you ask a question, it generates text in that shape, whether or not the specific facts exist. The output is fluent precisely because fluency is what it optimizes for. Accuracy is a side effect, not a guarantee.

This is why a hallucinated citation looks exactly like a real one. The model is not lying in any deliberate sense. It is pattern-completing, and a plausible-looking regulation number is a very good pattern completion.

Why regulation is especially prone to it

Three features of regulatory work make it a worst-case scenario for an ungrounded model.

Precision is non-negotiable

Most questions tolerate an approximate answer. Regulation does not. The exact article, the exact threshold, the exact effective date: these are the answer. A model that is "roughly right" has, in practice, told you something false, because in compliance the specifics carry the obligation.

The truth changes constantly

A model knows the world up to its training cut-off. Regulation moves every week: consultations open and close, thresholds are amended, deadlines shift, texts are repealed. Asked about "the latest," the model describes a frozen snapshot of the past and presents it as current.

Status is invisible in prose

The single most important fact about any rule is its status: proposal, consultation, adopted, or in force. That status is metadata, not something reliably encoded in the prose a model trained on. So it will summarize a draft and a binding obligation in the same confident voice.

The model is not bad at language. It is missing the one thing regulatory work runs on: a verified, current, structured record of what the rule says and where it stands.

The four failure modes to watch for

Fabricated citations. A regulation, article, or case number that looks right and does not exist, or does not say what the model claims.
Stale text. An answer that was true a year ago, presented with no indication that it has since changed.
Draft treated as law. A proposed measure described as if it were already binding, or vice versa.
Jurisdiction blending. Rules from one jurisdiction quietly mixed with another, because the model pattern-matched across similar regimes.

Why "prompt it more carefully" does not fix it

The common reflex is to instruct the model: "only cite real sources," "do not make anything up," "tell me if you are unsure." This helps at the margin and fails at the core. The model cannot check a citation it has no access to, and it cannot know that its training data is out of date. Asking it to be careful does not give it the missing facts. You are asking a system with no live connection to the source of truth to police itself against a gap it cannot see.

The same applies to a bigger or newer model. A more capable reasoner reasons better over whatever data it has. If that data is stale and unverifiable, better reasoning produces a more convincing wrong answer, not a correct one.

The fix: ground the model in verified data

Hallucination is not solved by changing the model. It is solved by changing what the model can reach. When an assistant can query a live, authoritative, structured source at the moment of the question, it stops guessing and starts retrieving. The answer carries a real source, a real date, and a real status, because those came from the data, not from the model's imagination.

This reframes the whole problem, and it is the heart of agentic regulatory intelligence: you already have the intelligence in your assistant of choice, what it lacks is the data. Supply a verified regulatory data layer and the failure modes above largely disappear, because the model is no longer the source of the facts.

Stop your assistant from guessing on regulation

Connect Obsidian's verified, tier-0 regulatory data to Claude, ChatGPT, or Cursor and get answers with a real source, date, and status attached. Free tier, two-minute setup.

Explore the Obsidian MCP

What a grounded answer looks like

The difference is visible in one response. Ungrounded, you get a confident paragraph, a citation you cannot verify, and no date. Grounded, you get the specific action, the official source, the exact date, the current status, and a link. The second answer is one you could forward to your auditor without checking it yourself first, which is the entire point.

For the mechanism that lets an assistant reach that data, see RAG vs MCP for regulatory AI. For why the source of that data matters, see what tier-0 regulatory data means.

The takeaway

AI hallucinates on regulation because it is built to sound right, while compliance requires being right, with receipts. The model is not the problem to fix. The missing verified data layer is. Give your assistant that layer and the confident-but-wrong answer stops being a risk you carry.