Skip to content
View smjahid012's full-sized avatar
  • Dhaka

Block or report smjahid012

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
smjahid012/README.md

Typing SVG

Profile views

Founder & Lead Architect @ SMLabs AI Β· Building AI systems that run anywhere β€” cloud, edge, or offline


🧠 Systems I Have Built

I'm an AI Systems Architect focused on the orchestration primitives, cognitive state machines, and inter-agent protocols that make agentic systems reliable at runtime.

🧠 Agent Orchestration System

smagent / Cognitive Runtime (private) Agentic OS with real orchestration primitives, cognitive state management, and inter-agent communication protocols. Built from scratch for full production control β€” no framework lock-in.

🌐 LLM Gateway System

anthropic-proxy (public) Multi-provider LLM routing layer with protocol translation β€” routes Claude Code CLI through Gemini, Groq, DeepSeek, and OpenRouter. Direct provider integration β€” minimal abstraction overhead.

🧬 Edge AI System

MedGemma FORENSIC (public) Offline multimodal AI diagnostics on a $200 Android device. Gemma 3 1B + MedSigLIP + MedASR at 417MB peak RAM. Submitted to Google's Health AI Developer Foundations program.

πŸ’° SaaS System

SMExpense (private) AI-powered expense analyzer with receipt scanning, voice input, and smart budgeting.

πŸ”Œ AI Tooling System

Serper Search MCP Server 8-tool MCP server (web, images, videos, news, shopping, places, deep research, RAG context) used with Claude, Cursor, Windsurf, n8n, and KiloCode. Published on npm and Docker. TypeScript, full type safety, zero telemetry.


πŸ”¬ The MedGemma FORENSIC Architecture

Running a Multimodal Diagnostic Suite (LLM + Vision + Audio + Anomaly Detection) on a consumer Android device under 500MB peak RAM.

A "Traffic Cop" sequential AI kernel (ModelLifecycleManager.kt) orchestrates ~4.3GB of model weights serially β€” keeping peak native heap at ~417MB on a 4GB RAM device.

Worker Node (Field Medic)          Anchor Node (Command Center)
β”œβ”€β”€ MedSigLIP β†’ visual capture     β”œβ”€β”€ Receives CaseFile.proto streams
β”œβ”€β”€ MedASR   β†’ verbal autopsy      β”œβ”€β”€ Full diagnostic reasoning
└── Packages β†’ CaseFile.proto      β”œβ”€β”€ Gemma 3 1B β†’ ForensicReport
     ↓ P2P Mesh / WiFi Direct ↓    └── AnomalyEngine β†’ Epidemic Clusters

πŸ“Ί Live Gradio Demo


πŸ› οΈ Tech Stack

AI / ML & Agents

Python PyTorch HuggingFace LiteRT ONNX LangChain LangGraph MCP

LLM Providers & APIs

Claude Gemini API Groq DeepSeek OpenRouter OpenAI

Full Stack SaaS

Next.js Node.js TypeScript React Supabase Prisma Tailwind PostgreSQL Redis Better Auth

Edge & Mobile

Kotlin Coroutines Android Protobuf WiFi Direct ARM

Backend & APIs

FastAPI Node.js REST GraphQL Upstash

Infrastructure & DevOps

Docker Docker Compose Nginx Traefik Ubuntu Coolify GitHub Actions Vercel


πŸ–₯️ Self-Hosted Infrastructure

Self-hosted on bare-metal Linux β€” no managed platform abstraction layer.

netcup VPS 1000 ARM G11  Β·  Manassas, USA
β”œβ”€β”€ πŸ”² 6-core ARM64  Β·  8GB RAM  Β·  256GB NVMe SSD
β”œβ”€β”€ 🐧 Ubuntu 24.04 LTS
β”œβ”€β”€ 🐳 Docker  Β·  Nginx  Β·  Traefik reverse proxy  Β·  SSL
β”œβ”€β”€ πŸš€ Coolify β€” self-hosted deployment orchestration
└── 🧠 Hosts: smagent Cognitive Runtime Β· future SMLabs AI services

Provisioning, securing, and operating Linux servers without a managed platform layer.


πŸ“Œ Pinned Systems

System What It Is Stack
medgemma-forensic (public) Edge AI pathologist β€” multimodal diagnostics on Android Kotlin, LiteRT, ONNX, Protobuf, P2P
serper-search-mcp-server (public) Enterprise MCP server for agent web search TypeScript, MCP, Node.js, Serper API
smagent (private) Cognitive Runtime β€” agentic OS, orchestration primitives Python, LangGraph, Redis, MCP
anthropic-proxy (public) LLM gateway routing Claude Code through Gemini, Groq, DeepSeek Python, Claude API, Gemini, Groq, DeepSeek
smexpense (private) Live AI expense SaaS β€” receipt OCR, voice input, smart budgeting Next.js, Supabase, Prisma, Tailwind, PostgreSQL, Redis

πŸ“« Connect

SMLabs AI SMExpense LinkedIn GitHub HuggingFace

"The difference between an AI enthusiast and an AI architect is orchestration."

Pinned Loading

  1. serper-search-mcp-server serper-search-mcp-server Public

    Serper Search - Enterprise Edition

    TypeScript

  2. medgemma-forensic medgemma-forensic Public

    Offline AI Pathologist

    Kotlin

  3. anthropic-proxy anthropic-proxy Public

    Not just a proxy β€” makes any model behave like Claude Code. Format translation + behavior injection + tool sanitizer + multi-provider fallback. One pip install replaces LiteLLM for solo devs.

    Python

  4. smjahid012 smjahid012 Public