althor
Production infrastructure for AI agent systems

The guardrails, orchestration, and audit between a prompt and a production deployment.

Most AI work stops at the prompt. I build what goes underneath — credential brokers, scoped tool access, policy-gated approvals, and the audit layer that makes agents safe to deploy. From regulated-enterprise rollouts to small businesses that just need the automation to actually work.

Multi-model extraction pipeline

agent framework · consensus voting · learning loops

Multi-stage agent pipeline with field-level consensus voting across extraction models. Deterministic validators, Raw → Suggested → Final audit layering, human-in-the-loop correction feeding structured learning loops. Throughput went from 4 to 120 entries an hour — about a 30× lift, and it retired an ongoing contractor spend.

TypeScript Next.js Azure Functions Durable Orchestration Azure SQL Entra ID Bicep IaC Playwright E2E

Enterprise AI governance platform

use-case registry · RBAC · audit trail · Teams + SharePoint embed

Adoption governance for a global workforce — submission, review, tool approval, policy enforcement. Four-tier RBAC, audit trail for council review, native Teams + SharePoint embedding so the tool meets users where they already work.

React 18 TypeScript Vite Azure Functions Static Web Apps Cosmos DB Entra ID Bicep IaC
Scale Global enterprise Volume ~100 submissions / month

Spire

AI-native infrastructure control plane · Rust · single-binary deploy

Operator surface for running agents against real infrastructure without handing them master keys. Policy-gated approvals, per-object audit log, MCP tools with read-only guards and per-server auth boundaries. My own project, in active development.

Rust Axum sqlx SQLite bollard tokio React Model Context Protocol
All essays →
2026-05-17 · Pattern

Entra ID workload identities for agent systems

Four identity options — app-only, on-behalf-of, federated credentials, managed identity — mapped to the audit-trail decision an agent designer is making whether they realize it or not. Plus the five gotchas the wizard doesn't surface.

2026-05-17 · Free PDF

Agent Security Review Checklist

A pre-flight checklist for shipping AI agents into regulated environments. Five layers, yes/no items, decision tree — the same review I run during a Discovery engagement, distilled to a self-serve PDF.

Architecture before code

Auth boundaries, tool scopes, audit surface, and failure modes get modeled before the first function is written. Skip this and you'll rewrite it later, under worse conditions.

Scoped tools, never master keys

An agent that reads invoices doesn't get a credential that can write them. One scope, one tool, one expiry — credentials live in a broker, tools expose narrow surfaces with explicit allowlists.

Audit everything that matters

Every decision, suggestion, and override produces a structured event. Compliance review, debugging, and post-incident work all run against the same surface — not reconstructed from logs after the fact.

Humans own the risky calls

Reversible, low-risk actions auto-run. Anything touching production state, payments, or customer-facing data queues for human approval. The default is "ask."

Documentation that survives me

Every engagement ships with IaC, runbooks, and architecture docs your team can operate without me. Nothing locked behind tribal knowledge. Handoff is a milestone, not an afterthought.

I'm Samuel S. Production AI systems on the Microsoft stack — Azure, Entra ID, M365, Power Platform — with a Rust, TypeScript, and Python sideline for the parts Microsoft doesn't reach. The work above is mine: governance, identity, multi-stage data pipelines, agent infrastructure. Althor is the consulting practice that takes that experience to teams shipping their own production systems — specialized agent-infrastructure work that complements in-house engineering rather than replacing it.

D.C. metro · remote-first · regulated environments preferred.

What is the Model Context Protocol (MCP)?

The Model Context Protocol is an open transport for connecting AI agents to external tools, data sources, and services. An MCP server exposes a set of tools (typed function calls) and resources (read-only context) to a client — the agent — over stdio or HTTP. The protocol itself is small and well-specified; the interesting decisions are upstream of it: what each server's trust boundary is, which tools it exposes, how outputs are bounded, and how auth is resolved. In production, treat each MCP server as a security boundary, not a convenience layer — one server, one credential scope, read-only by default, writes tagged and policy-gated.

How do I add an MCP server to Microsoft Copilot Studio in a regulated tenant?

Five decisions sit upstream of the onboarding wizard. First, pick OAuth on-behalf-of for any tool that touches user-scoped data — API key auth attributes everything to the MCP server's own identity, which fails compliance review. Second, classify the connector into the right Power Platform DLP zone before publishing. Third, enable generative orchestration on the agent — classic orchestration ignores MCP. Fourth, use Streamable HTTP transport; SSE was deprecated in August 2025. Fifth, scope the connector to your tenant unless cross-tenant publishing is the explicit business goal.

What is the right Entra ID identity for an AI agent?

Four shapes cover the realistic option space. App-only (service principal) suits agents acting on shared, non-user-scoped data — the audit trail reads "the agent did it." On-behalf-of (OBO) is the only correct answer when the agent reads or writes user-scoped data — the trail reads "the user's agent did it." Federated identity credentials (FIC) replace stored client secrets for agent runtimes outside Azure (GitHub Actions, Kubernetes, Vercel). Managed identity is the platform-managed machine identity when the agent runs on Azure compute. The choice is downstream-attribution-driven, not preference-driven.

What does AI agent infrastructure consulting actually deliver?

The work is the layer between a working prompt and a production deployment that passes security review. Concretely: a credential broker so agents never hold long-lived secrets, scoped tool surfaces so the blast radius of any single tool call is bounded, policy-gated approvals so irreversible actions queue for human review, and a structured audit layer so compliance, incident response, and post-mortem all read from the same source. Deliverables include architecture docs, IaC, runbooks, and the operator dashboards governance teams actually use — handed off to the client's engineers, not locked behind the consultant.

How long is a Discovery engagement?

One to two weeks, fixed-fee. The output is a written report covering the five layers — identity, credential broker, scoped tool access, policy gating, audit — mapped to the specific compliance controls the client's InfoSec team will ask about. The report includes a threat model, a prioritized remediation list, and a one-page summary suitable for handing to security leadership. Discovery is the right engagement when an agent project either hasn't passed security review yet or hasn't been put in front of one, and the team needs a clear remediation path before committing to a build.

Discovery
Architecture & security review
1–2 weeks · fixed-fee

A focused look at an existing or planned agent deployment. Output is a written report covering identity, credential surface, tool scope, approval flow, audit, and the specific compliance gaps you'll need to close.

  • Threat model + control mapping
  • Concrete remediation list, prioritized
  • One-page summary for InfoSec / leadership
Build
Production agent infrastructure
8–12 weeks · fixed-fee

End-to-end build of an agent system on your stack. Identity, credential broker, scoped tools, policy gating, audit layer — wired into your existing auth and observability. Ships behind a real approval flow, not a demo.

  • Architecture, IaC, CI/CD
  • Tool surface + MCP server design
  • Audit + dashboard for governance review
  • Handoff with documentation and runbooks
Advisory
Ongoing retainer
Monthly · flexible hours

For teams that already have engineers but need an experienced hand on agent deployments. Architecture review, design partnership, security posture checks, escalation on tricky calls.

  • Weekly review on active work
  • Async review of PRs / designs
  • On-demand for production incidents
All packages →
$2,500 · 1 week

AI Governance Foundation Audit

NIST AI RMF gap analysis + Microsoft Foundry control-plane recommendations. The report you hand to InfoSec before the procurement conversation starts.

$1,500 · 3 days

MCP Server Security Review

Tool-by-tool written review against the OWASP Agentic AI Top 10, with a remediation list ordered by exploitability.

$5,000 · 2 weeks

Copilot Studio Agent Quickstart

Working agent + Functions middleware + connector wiring to one enterprise data source. Security-review-ready by handoff.

Book
Schedule a 30-min architecture review
Email
contact@althor.dev
Currently
Selective engagements considered for Q3 2026
Based
Washington, D.C. metro — remote-first