Consulting
19 years building reliable, scalable systems
I help engineering teams ship reliable platforms and put AI into production — safely, measurably, and without runaway costs.
Available for advisory, hands-on build, and fractional engagements.
Three Pillars
SRE & Reliability
Designing systems for 99.99%+ uptime with proper SLOs, observability, and incident response.
- SLOs, error budgets & reliability strategy
- Observability with Prometheus & Grafana (incl. custom exporters)
- Incident response & on-call design
- Zero-downtime deployments & hot config reload
- Kubernetes & cloud-native architecture reviews
Four-nines uptime targets; zero-downtime config reload that eliminated 20-minute rolling restarts
Platform Engineering & DevEx
Internal developer platforms, golden paths, GitOps and shared libraries that eliminate toil.
- Internal developer platforms & golden-path scaffolders
- GitOps (ArgoCD) and reusable Helm chart libraries
- CI/CD pipelines & GitHub automation tooling
- Cloud cost optimization / FinOps (KEDA scale-to-zero, smart autoscaling)
- Shared SDKs & libraries that standardize delivery
Award-winning autoscaling; ~18% compute cost reductions; scaffolders that bootstrap new services in minutes
AI Engineering
Putting LLMs and agents into production safely and cost-effectively — with banking-grade guardrails.
- LLM integration architecture (AWS Bedrock, Claude, Spring AI)
- MCP & agentic tooling — servers, clients, tool-calling orchestration
- AI cost optimization — prompt caching, model fallback, extended context
- AI developer tooling — automated code review, doc generation, scaffolding
- AI readiness & guardrails — security, governance, RBAC/SSO
20–35% AI cost reductions via prompt caching; 90%+ cache hit rates; multi-provider fallback
Engagement Models
Advisory
Architecture reviews, audits, and strategy. Short, fixed-scope engagements to unblock decisions and set direction.
Hands-on Build
Embedded delivery of a specific system or tooling — from reliability work to a production AI integration.
Fractional Architect
Ongoing fractional SRE / Platform / AI architect support to level up a team over time.
Let's talk
Tell me what you're trying to build or fix. I'll tell you how I'd approach it.