Mathematically Governed AI

Ship the AI that's
stuck in pilot.

It works in every demo. But risk and compliance won't sign off, and your own people won't trust it, because the best anyone can say is it's right 99% of the time, and everyone hears the other 1%. We make AI prove what it will and won't do, in math, before it runs, with a signed record anyone can re-verify.

Built for the controls the regulator is asking forEU AI ActColorado SB 26-189NYDFSNAICNIST AI RMF
Decision ReceiptLive
Enterprise · Customer Operations
actionissue_refund($2,400)
calleragent.support.v3
invariant · proven in Lean 4refund_policy.rev9
same answer tomorrow?yes · byte-identical
verdictADMITTED
ADMITTEDsigned · key we don't hold
Where this goes

One architecture. Rising stakes.

We start where being wrong costs money. The same code governs the machine that moves, and the system that runs on its own, because the proof is built into how the software is constructed, not bolted onto a category. The higher the stakes climb, the more the world needs proof instead of a guess.

Nowwrong costs · money

AI in applications

Rules, decisions, governance and audit for regulated enterprises moving AI from demo to production. A wrong answer is a denied claim that ends up in court, or a pilot that never ships.

issue_refund($2,400)Admitted
Shipping today
Arrivingwrong costs · bodies

AI in machines that move

The same code that governs an agent inside an application governs an agent inside a machine that moves. Once AI has a body there is no after to inspect, you can't filter a kick once it lands.

actuator.move(θ=42°)Refused · pre-action
Where this goes
The destinationwrong costs · lives

Autonomous, mission-critical systems

Industrial and medical systems that act on their own. Same architecture, same proof, because the guarantees are built into how the software is constructed, not bolted onto a category.

dose.administer(2.5mg)Admitted
Where this goes
Same gate. Same signed receipt. Only the cost of being wrong changes.

The warning shots are already public: a Unitree humanoid roundhouse-kicked a child at a June 2026 demo; another flailed beside a factory worker in 2025. Goldman puts the humanoid market at $38B by 2035, every unit running the same unprovable software the enterprise is wrestling with today. The same guess that costs a refund today drives a motor tomorrow.

How it works

We prove the math before we write the code.

It is the inversion of how software is built today. Instead of writing code and then testing and hoping, you start with what must always be true, prove it, and compile the proven rules into the app as constraints it cannot break. We call the discipline the Mathematical Autopsy.

01

Define the intent as math

Write down what must always be true, the invariants the system can never violate, as formal statements, not prose policy.

02

Draft and check the proof

An AI proof-drafting assistant proposes the math and the proof; the Lean 4 kernel, a public proof system, checks it. The AI does the flexible part; the math is what we trust.

03

Compile the rules in

The proven rules become constraints compiled into the code, not a guardrail bolted on top, but part of how the software is built.

04

Gate the action, sign the decision

The gate runs before the action fires, enforced at every step. Each decision writes a signed Decision Receipt your auditor can replay on a clean machine.

The old waywrite code → test → hopeOursprove the math → compile → enforce
The Decision Receipt

Two receipts. One unbroken chain.

Proof comes in two parts. Once, when the software is built, a rule is proven in math and sealed into the runtime, the Receipt of Truth. Then, every time a decision fires, the gate stamps a signed Decision Receipt that points straight back to it. One proves the rule is sound; the other proves this decision followed it.

Receipt of TruthBuilt once
Rule · a borrower over their proven debt limit is never auto-approved
lemma → proofLean 4 · kernel-checked ✓
invariantfair_lending.reg_b
notebook + scorecardhashed · SHA-256
chainnotebook ⊕ scorecard ⊕ invariant
sealed intoruntime header · 0x9f2c…a1
Sealedchange one byte, the hash breaks
Decision Receipt#LN-2231
Runtime · Banking · Consumer Lending
actionextend_credit($45,000)
caller · momentagent.lending.v3 · 14:22 UTC
governing rulefair_lending.reg_b
verdictADMITTED · pre-action
reproducible?same inputs → same decision
signaturecustomer's key
Admittedevery decision · signed · re-verifiable without us
Walk it backward — anyone can, without us
Lemmathe rule, written as plain math
Lean 4 proofkernel-checked — or it doesn't compile
Invariantthe runtime is required to enforce it
SHA-256 chainnotebook + scorecard + invariant, hashed
Runtime headerthe chain is sealed into the binary
Decision Receiptat runtime, points back to that invariant
01

Verify the signature

Check it against the customer's public key. This decision really happened, and was signed at that moment, not backdated after a complaint.

02

Follow the rule to the math

The named invariant traces back through the hash chain to the Lean 4 proof. Re-check the proof yourself, with the public kernel.

03

Replay the inputs

Feed the recorded inputs through the sealed runtime on a clean machine. Same inputs, same decision, every time.

AICP · mission control
A Decision Receipt in AICP — a denied action, signed and reproducible

The same artifact, live in the product, here AICP refuses a push to a protected branch, signed sha256, deterministic and verifiable offline.

A log

“Here is what we say happened.” Written after the action, by the vendor. You trust whoever wrote it.

A Decision Receipt

“Here is what happened — re-derive it yourself.” Produced as the decision is made, signed with your key. You trust no one.

A log is only as trustworthy as the system that wrote it. A receipt is only possible because the system is deterministic, the same inputs return the same decision, forever. Determinism is what turns a record into proof.

SMARTHAUS never holds the receipts and never holds the signing key, we cannot produce, alter, or backdate one. A receipt covers the action that passed through the gate; anything routed around the gate leaves no receipt, and that absence is itself the tell.

The beachhead — not the ceiling

Where it bites first.

We start where the pain is sharpest and the proof requirement is hardest: regulated enterprises trying to move AI from demo to production. Banking, insurance, health plans, where a wrong answer costs money or lives, and no one will sign off on "right 99% of the time."

Take the sharpest version, an AI wealth advisor. Watch the math catch a bad recommendation before it ships:

What's actually happening
Client

“I'm 34. I want to retire by 60, but I panic when the market drops 20%.”

profile → risk_tolerance: LOW · horizon 26y
The AI proposes · the flexible part

85 / 15, growth-tilted

The gate · the math

invariant suitability.risk_tolerance, proven in Lean 4: low tolerance ⇒ equity ≤ 70%

85% equity exceeds the proven cap.

Refused · before the client saw it
The AI re-proposes

65 / 35, inside the proven bound

Delivered

The client gets a suitable portfolio.

Admitted · 38ms · signed

An unsuitable recommendation wasn't filtered out after the fact. It was impossible to deliver.

What gets risk & compliance to sign off
Decision Receipt#WA-0472
01Captured the profile.risk_tolerance: low · horizon 26ycomplete
02AI proposed 85 / 15.checked against suitability.risk_tolerance, proven in Lean 4checked
03Refused 85 / 15.equity 85% > proven cap 70%, blocked before deliveryrefused
04Admitted 65 / 35.inside the proven constraint, so it firedadmitted
# same profile in → same decision out run 2026-04-30 → refuse 85/15 · admit 65/35 run 2026-06-18 → refuse 85/15 · admit 65/35 ✓ byte-identical # deterministic — no drift, no one favored
# your auditor, their machine, without us $ smarthaus verify WA-0472 signature ✓ valid (customer-held key) invariant ✓ re-proven (Lean 4) VERIFIED
Admitteddecided in 38ms · signed
The model does the flexible part. The proven math decides what's allowed to ship.
The same shape, every vertical — experience on top, proof underneath
AI loan officerfig 01

Instant, personalized credit.

A real number in seconds, tailored to the full file, not a thin score.

The catchProve fair-lending: identical treatment, a real reason for every decline.
fair_lending.reg_bAdmitted
AI account conciergefig 02

“Move $200 when I'm over $5k.”

Plain-language banking that just acts, transfers, sweeps, alerts.

The catchProve every action stayed inside the customer's authority, before it fires.
daily_external < 5,000Admitted
AI dispute resolverfig 03

Disputes settled same-day.

Adjudicated on the spot instead of a ten-day wait.

The catchEvery decision replayable for the regulator, identical inputs to outcome.
dispute.eligibilityAdmitted
AI wealth advisorfig 01

Forty-question depth, for everyone.

Top-of-market advice, delivered at scale and personalized per client.

The catchProve it was suitable, the same tomorrow, and no one favored.
suitability.risk_toleranceAdmitted
AI retirement plannerfig 02

A plan that adapts as life does.

Same engine, longer horizon, contributions, drawdown, tax in view.

The catchProve projections stayed inside assumptions the client agreed to.
assumption.boundsAdmitted
AI portfolio rebalancerfig 03

Always on-mandate.

Autonomous rebalancing the moment the portfolio drifts off target.

The catchProve it physically cannot breach the mandate, not that it usually won't.
mandate.invariantAdmitted
AI claims adjudicatorfig 01

Claims decided in minutes.

An answer, and a clear reason, while the loss is still fresh.

The catchProve no unfair denial, every decision replays for the regulator.
claim.eligibility.provenHeld · review
AI underwriterfig 02

Priced to the risk, instantly.

Bind-ready pricing in one sitting instead of a referral queue.

The catchProve no proxy discrimination, same inputs, same price, every time.
no_proxy_discriminationAdmitted
AI FNOL triagefig 03

Routed on first contact.

Fast-track, investigate, or escalate, decided at the first call.

The catchProve eligibility and routing followed policy, with a record for each.
routing.policy.provenAdmitted
The operators

Four products. One layer ties them together.

A control plane and an inference engine you run, a custom runtime we build, and the engine you build it all with, all inside your environment. Your compute, your keys. Above them, Operation Center turns every receipt into evidence a regulator will accept.

Operation Center · Mission Control
SMARTHAUS Operation Center — Mission Control, every receipt fleet-wide

Operation Center, every receipt, every runtime, fleet-wide. Live governed activity, fleet convergence, and a board-ready evidence trail.

Start free

Run governed AI on your own machine today.

Studio Loge is free, forever. Download it, point it at your flow, and watch every action get a signed Decision Receipt, no sales call required. Enterprise adds Operation Center and fleet-wide evidence.

Why us

Everyone else watches the output. We prove the action before it fires.

The rest of the market observes outputs after they exist, a probabilistic classifier with its own error rate, a dashboard that alerts after the trade, the denial, the email has already fired. You can't catch a wrong action by watching for it; by the time you've seen it, it has already happened.

The rest of the market
  • Watches the output after it has been produced.
  • The control is itself AI, its own error rate, its own failure modes.
  • Works only inside one platform; reconstruction depends on the vendor.
  • By the time it flags a bad output, the action has already committed.
SMARTHAUS
  • Admits or refuses the action before it fires, deterministic, the same answer every time.
  • Checks invariants proven in Lean 4, math, not another model's judgment.
  • The proof is portable, signed with a key we don't hold, re-verifiable off-platform with no SMARTHAUS in the loop.
  • If a rule would break, the action never fires. Built in by construction.
Why it holds

Six ways enterprise AI breaks. One cause.

Every failure that kills a pilot traces back to one fact: the model guesses. You cannot govern a guess with another guess, so we prove the math before we write the code.

F1 · Prove
Compounding error

Six chained steps at 95% each land at 73%. The pilot doesn't survive the math.

F2 · Replay
Production drift

The action fires before the dashboard alerts. Correct in pilot, decayed in production.

F3 · Bind
Eval-set rot

Tests pass while production fails. The scores stopped tracking reality.

F4 · Prevent
Governance after the fact

The control logs the destructive action after it ran. Too late to matter.

F5 · Specify
No proof object

You cannot prove what a single decision did. Courts now enforce that gap.

F6 · Lead
Policy isn't evidence

Regulators no longer accept a document. The control has to be in production.

Seven properties — by construction, not by promise
ReproducibleTraceableExplainableAuditableReplayableFalsifiableVerifiable
120+ theorems machine-proven500+ invariant rules1,000+ verification testsThree patents filed
Built for the controls — as of 8 June 2026
EU AI Act · Reg. 2024/1689Colorado AI Act · SB 26-189NYDFS · CL 2024-7NAIC · Model BulletinNIST · AI RMF 1.0ISO/IEC · 42001FDA · AI/ML SaMDHIPAA · 45 CFR 164

Ship it with proof.

“You cannot govern a guess with another guess.”

The clock is running — Colorado SB 26-189 · Jan 2027 · EU AI Act high-risk · 2027