Skip to content
View Shivanathsai's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Shivanathsai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Shivanathsai/README.md
Shiva Mittapalli β€” Full-Stack, Data and AI/ML Engineer

LinkedIn Email GitHub

Open to 2026 New Grad SWE / SDE 2 Β· Full-Stack Β· Data Β· AI/ML Engineering roles in the US F-1 STEM OPT Β· 3-yr work authorization Β· Tempe, AZ


πŸ‘‹ About

I'm a software engineer with 4.5 years of production experience, now finishing my MS in Software Engineering at Arizona State (4.0 GPA, graduating Dec 2026).

I work across three lanes β€” shipping full-stack products, building data platforms, and integrating AI/ML into production. I've delivered systems serving 500K+ users at 99.9% uptime, ML pipelines that prevented $2M+/year in fraud, and cut batch processing time by 87% by re-architecting partitioning. Comfortable with the whole loop: Spring Boot APIs, React UIs, Spark jobs, RAG pipelines, and the Kubernetes manifests holding it all together.

What I'm into right now: real-time data infrastructure, LLM-backed services that actually work in production, and making distributed systems boring (in the good way).


πŸ› οΈ Tech Stack

πŸ”΅ Full-Stack

Java 21 Β· Spring Boot Β· React.js Β· Next.js Β· TypeScript Β· Node.js Β· PostgreSQL Β· MySQL Β· MongoDB Β· REST Β· GraphQL Β· gRPC

🟣 Data Engineering

Python Β· Spark Β· Kafka Β· Airflow Β· Delta Lake Β· dbt Β· Snowflake Β· AWS EMR Β· AWS Glue Β· Redis Β· Pandas Β· NumPy

🟒 AI / ML

PyTorch Β· LangChain Β· LLM / RAG Β· pgvector Β· Pinecone Β· OpenAI API Β· Anthropic API Β· MLflow Β· XGBoost Β· scikit-learn

Cloud & Infrastructure β€” AWS (Lambda, S3, EMR, Glue, ECS, Cognito) Β· Docker Β· Kubernetes Β· Terraform Β· GitHub Actions Observability β€” Prometheus Β· Grafana Β· OpenTelemetry Β· Datadog


πŸ“Œ Featured Projects

πŸ”΅ Task-Flow β€” Full-Stack

Collaborative task manager β€” JWT auth via Spring Security, drag-and-drop Kanban, real-time stats dashboard, Swagger-documented API. Containerized end-to-end with Docker Compose, ECS-deployment-ready.

Java 21 Spring Boot React.js PostgreSQL Docker

🟒 ML Inference Optimizer β€” AI / ML

PyTorch inference framework β€” 3.2Γ— throughput, 68% latency reduction vs FP32. INT8 quantization (75% smaller, <1% accuracy loss), custom C++/CUDA fused kernels (ReLU + LayerNorm), DDP distributed training, FastAPI async server.

Python PyTorch CUDA / C++ DDP FastAPI

🟣 Realtime Streaming Platform β€” Systems / Python

Event-driven Kafka platform β€” 500K+ events/hr, exactly-once semantics, 5 CEP patterns (velocity, anomaly, impossible-travel, burst, merchant-diversity), Welford's online algorithm, Redis state store, K8s with HPA on consumer lag.

Python Kafka Redis PostgreSQL Kubernetes

🟣 Distributed ETL Β· Delta Lake β€” Data Engineering

Medallion-architecture pipeline (Bronze β†’ Silver β†’ Gold) processing 500GB+/day. Delta Lake ACID, partition pruning + Z-Ordering (60–67% faster queries), SCD Type 2, Great Expectations validation, Terraform-provisioned AWS EMR.

Python Spark Delta Lake Airflow AWS EMR Terraform

More projects worth a look β†’

🟠 Event Processing Platform β€” Java microservices on Kafka Streams + AWS EKS. Multi-module Maven project (producer / processor / consumer), exactly-once semantics, tumbling-window aggregation, Prometheus + Grafana monitoring, Alertmanager integration. Java Spring Boot Kafka Streams AWS EKS Grafana

🟒 LLM Code Assistant β€” RAG-based code assistant over 100K+ doc lines. ChromaDB vector store, Groq LLM (llama-3.1-8b-instant), 92% accuracy on 25-QA benchmark suite, sub-2s p99 latency, full CI accuracy gates. Python RAG ChromaDB Groq FastAPI


πŸ“ˆ Track Record

πŸš€  Shipped full-stack systems (React + Spring Boot/Node) for 500K+ users at 99.9% uptime
⚑  Cut API p99 latency by 45% β€” query plan + indexing rewrites
πŸ“¦  Reduced batch processing time 87% (6 hrs β†’ 45 min) β€” re-architected partitioning
πŸ’°  Prevented $2M+/year in fraud β€” real-time ML detection pipelines
πŸ€–  Integrated OpenAI APIs and RAG pipelines into production user-facing features
πŸ‘₯  Mentored 5 engineers on distributed systems design reviews

πŸŽ“ Currently

  • πŸŽ“ Wrapping up MS in Software Engineering at Arizona State University (Dec 2026)
  • πŸ› οΈ Shipping TaskFlow and the LLM RAG project as portfolio capstones
  • πŸ“š Going deep on streaming systems, vector search, and production ML infra
  • 🌎 Recruiting season β€” open to 2026 New Grad SWE roles (full-stack, data, AI/ML)

πŸ“« Let's Talk

If you're hiring, collaborating, or just want to swap notes on distributed systems β€”


If something here caught your eye, the easiest way to say hi is the email above.

Pinned Loading

  1. Distributed-ETL-Delta-Lake Distributed-ETL-Delta-Lake Public

    Python

  2. event-processing-platform event-processing-platform Public

    Java

  3. llm-code-assistant llm-code-assistant Public

    Python

  4. ML-Inference-Optimizer ML-Inference-Optimizer Public

    Python

  5. realtime-streaming-platform realtime-streaming-platform Public

    Python

  6. Task-Flow Task-Flow Public

    Java