Skip to content

Intent-Based Inference Primitive

Sermo Arbiter Inferentiae Determinata (SAID)

The determined arbiter of inference discourse.

What it does

Governed inference, not just generated output.

SAID selects and executes LLM inference with mathematical guarantees. It doesn't just run a model — it ranks, routes, enforces, and receipts every inference decision.

What other inference apps do

You pick a model manually. You run it. You hope the output is right. No selection rationale. No audit trail. No governance. If it fails, you try a different model.

What SAID does

Ranks models across 6 dimensions using multi-objective optimization. Routes requests deterministically. Validates output against formal constraints. Retries on violation at zero cost on local GPU. Produces HMAC-signed decision receipts for every inference. Fails closed on contract violations. Same inputs, same model selection, every time.

What no other inference tool does

Nine capabilities. Zero of them exist anywhere else.

Every capability below is something no other local or cloud inference tool provides. Each one starts with what exists today — and what SAID does instead.

Others: you pick a model manually

Pareto-optimal model selection

No other inference tool mathematically selects models. SAID uses NSGA-II evolutionary optimization across cost, speed, quality, privacy, capacity, and regulatory compliance — computing a Pareto frontier and selecting via weighted Chebyshev scalarization. The selection is deterministic, reproducible, and auditable.

Others: inference is single-shot

Generate-validate-retry

No other inference tool validates output before returning it. SAID passes every inference through a gate engine with formal constraint checking. Contract violations trigger structured retry with feedback enrichment. On local GPU, retries are zero-cost — enabling aggressive constraint stacking that would be prohibitively expensive through cloud APIs.

Others: no constraint enforcement

Formal constraint system

No other inference tool enforces output contracts. SAID supports set membership, regex, range, cross-reference, and composite constraints — defined in YAML, enforced deterministically. Every attempt, verdict, violation, and remediation path is logged. The gate function is pure: same output always produces the same verdict.

Others: no record of decisions

HMAC-signed decision receipts

No other inference tool cryptographically signs its decisions. Every SAID inference produces an HMAC-signed receipt: the selected model, the full score breakdown across all dimensions, the applied rules, and the hyperparameter snapshot. Replay any decision after an incident. Prove why that model was chosen.

Others: local and remote are separate

Unified selection across local and remote

No other inference tool evaluates local and remote models in the same mathematical framework. SAID runs one selection calculus across llama.cpp, MLX, PyTorch, Metal, and CUDA locally — and OpenAI, Anthropic, Groq, Mistral, and Cohere remotely. A local MLX model competes against a cloud API on the same Pareto frontier.

Others: errors are undefined

Deterministic failure doctrine

No other inference tool distinguishes contract failure from operational failure. SAID defines two explicit failure states: contract violations fail closed (TERMINATED), operational faults degrade gracefully (DEGRADED). Deterministic state transitions. No partial states. No silent failures. Security boundaries are never silently crossed.

Others: every request runs unconditionally

Runtime governance predicates

No other inference tool gates requests before execution. SAID enforces rule-based filtering at inference time: task type, token count, time-of-day, provider scope. Weight overrides, forced model assignment, and block-with-reason on constraint violation. Frozen request context prevents mid-flight mutation.

Others: no reproducibility guarantee

Replay equivalence

No other inference tool guarantees reproducible model selection. SAID logs every decision with full reasoning and score components. Seeded execution with determinism fingerprinting. Same inputs, same model selection, every time. This is what makes incident forensics possible.

Others: governance bolted on top

OpenAI-compatible API

Drop-in replacement interface — your existing code works unchanged. But unlike proxy layers that add governance after the fact, SAID builds selection, validation, constraint enforcement, and audit into the inference path itself. Governance is the engine, not a wrapper around it.

How it ships

Desktop app. Embeddable runtime. Same engine.

SAID Studio (Loge / Forge)

Standalone desktop app. One user, one device, no cloud dependency. Loge is the free community tier — a full, usable product floor. Forge is paid with advanced local capabilities.

SAID Enterprise Client

Managed local endpoint syncing governance from SMARTHAUS Operation Center. Policies decided centrally, inference enforced locally on the device.

SAID Runtime

Sealed runtime embeddable into enterprise applications via Python SDK or HTTP API. Same engine, same guarantees — inside your system.

See SAID running governed inference.