Founder & Lead Architect @ SMLabs AI Β· Building AI systems that run anywhere β cloud, edge, or offline
I'm an AI Systems Architect focused on the orchestration primitives, cognitive state machines, and inter-agent protocols that make agentic systems reliable at runtime.
smagent / Cognitive Runtime (private) Agentic OS with real orchestration primitives, cognitive state management, and inter-agent communication protocols. Built from scratch for full production control β no framework lock-in.
anthropic-proxy (public) Multi-provider LLM routing layer with protocol translation β routes Claude Code CLI through Gemini, Groq, DeepSeek, and OpenRouter. Direct provider integration β minimal abstraction overhead.
MedGemma FORENSIC (public) Offline multimodal AI diagnostics on a $200 Android device. Gemma 3 1B + MedSigLIP + MedASR at 417MB peak RAM. Submitted to Google's Health AI Developer Foundations program.
SMExpense (private) AI-powered expense analyzer with receipt scanning, voice input, and smart budgeting.
Serper Search MCP Server 8-tool MCP server (web, images, videos, news, shopping, places, deep research, RAG context) used with Claude, Cursor, Windsurf, n8n, and KiloCode. Published on npm and Docker. TypeScript, full type safety, zero telemetry.
Running a Multimodal Diagnostic Suite (LLM + Vision + Audio + Anomaly Detection) on a consumer Android device under 500MB peak RAM.
A "Traffic Cop" sequential AI kernel (ModelLifecycleManager.kt) orchestrates ~4.3GB of model weights serially β keeping peak native heap at ~417MB on a 4GB RAM device.
Worker Node (Field Medic) Anchor Node (Command Center)
βββ MedSigLIP β visual capture βββ Receives CaseFile.proto streams
βββ MedASR β verbal autopsy βββ Full diagnostic reasoning
βββ Packages β CaseFile.proto βββ Gemma 3 1B β ForensicReport
β P2P Mesh / WiFi Direct β βββ AnomalyEngine β Epidemic Clusters
πΊ Live Gradio Demo
AI / ML & Agents
LLM Providers & APIs
Full Stack SaaS
Edge & Mobile
Backend & APIs
Infrastructure & DevOps
Self-hosted on bare-metal Linux β no managed platform abstraction layer.
netcup VPS 1000 ARM G11 Β· Manassas, USA
βββ π² 6-core ARM64 Β· 8GB RAM Β· 256GB NVMe SSD
βββ π§ Ubuntu 24.04 LTS
βββ π³ Docker Β· Nginx Β· Traefik reverse proxy Β· SSL
βββ π Coolify β self-hosted deployment orchestration
βββ π§ Hosts: smagent Cognitive Runtime Β· future SMLabs AI services
Provisioning, securing, and operating Linux servers without a managed platform layer.
| System | What It Is | Stack |
|---|---|---|
| medgemma-forensic (public) | Edge AI pathologist β multimodal diagnostics on Android | Kotlin, LiteRT, ONNX, Protobuf, P2P |
| serper-search-mcp-server (public) | Enterprise MCP server for agent web search | TypeScript, MCP, Node.js, Serper API |
| smagent (private) | Cognitive Runtime β agentic OS, orchestration primitives | Python, LangGraph, Redis, MCP |
| anthropic-proxy (public) | LLM gateway routing Claude Code through Gemini, Groq, DeepSeek | Python, Claude API, Gemini, Groq, DeepSeek |
| smexpense (private) | Live AI expense SaaS β receipt OCR, voice input, smart budgeting | Next.js, Supabase, Prisma, Tailwind, PostgreSQL, Redis |
"The difference between an AI enthusiast and an AI architect is orchestration."
