Skip to content
View bzubs's full-sized avatar

Block or report bzubs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bzubs/README.md

Aditya Veerkar (bzubs)

┌─────────────────────────────────────────────────────────────┐
│  AI/ML Engineer • Systems Architect • Backend Developer     │
│  Building high-performance systems from first principles    │
└─────────────────────────────────────────────────────────────┘

Core Stack

Languages

Python      ██████████████████░░  90%    C++  ████████████████░░░░  80%     Typescript   ██████████░░░░░░░░░░  40%
JavaScript  ██████████████░░░░░░  70%    Go   ██████░░░░░░░░░░░░░░  30%

Systems & Infrastructure

┌─ Backend ──────────────────────────────────────────┐
│ FastAPI • gRPC • Node.js • NGINX • WebSockets      │
├─ Orchestration ────────────────────────────────────┤
│ Docker • Kubernetes • Microservices                │
├─ Databases ────────────────────────────────────────┤
│ PostgreSQL • MongoDB • Redis • Vector DBs          │
└────────────────────────────────────────────────────┘

AI/ML Pipeline

┌─ Frameworks ───────────────────────────────────────┐
│ PyTorch • TensorFlow • vLLM Inferencing            │
├─ Specialized ──────────────────────────────────────┤
│ CUDA • Distributed Inferencing • Benchmarking      │
├─ Applications ─────────────────────────────────────┤
│ LLMs • RAG Systems • Fine-tuning                   │
└────────────────────────────────────────────────────┘

Notable Projects

Systems Engineering

LLM Benchmarking tool

A GPU-aware LLM benchmarking orchestration platform built on top of vLLM, designed for real-world system behavior, concurrent execution, and clean architecture design.

  • Remote benchmark job submission
  • Thread based Task Scheduler for parallel execution
  • Cluster management for available GPU cluster
  • Real-time monitoring & logs
  • Structured metrics extraction
  • Per-user history tracking

FERMION

A large-scale production-grade, high-performance reverse proxy designed to operate under configurable latency and throughput constraints. This system provides a pluggable architecture for sophisticated traffic distribution and real-time observability.

  • Tech: Node.js, http API, zod, vanilla JS

DELTON

Scalable 2 server-based architecture for production-based secure wiping.

  • Custom token based auth between proxy and microservices.
  • Remote Wiping tool functionality for Linux Systems.
  • Professional Logging system and Dashboards
  • Tech: Node.js, FastAPI, Nest.js, MongoDB

Machine Learning

attention-from-scratch

Built encoder-decoder mdoel Transformer and MiniGPT implementation in pure PyTorch

  • Custom attention mechanisms, positional encoding, layer normalization
  • Training pipeline for dialogue datasets
  • Tech: PyTorch, NumPy

Chat With Video

Production RAG systems for PDF processing and real-time chat

  • Vector similarity search with FAISS
  • YouTube transcript extraction and semantic search
  • Tech: LangChain, ChromaDB, FastAPI

mlzero

Custom machine learning library with fundamental algorithms implemented from scratch

  • Linear/Logistic Regression, Decision Trees, Random Forests, SVM
  • Neural Networks with backpropagation, K-Means, PCA
  • Pure Python/NumPy implementations for research clarity
  • Tech: Python, NumPy, Mathematical optimization

📊 GitHub Analytics


🛠️ Tech Arsenal

Primary Languages

Python C++ JavaScript Go TypeScript

ML/AI Stack

PyTorch TensorFlow HuggingFace scikit-learn

Backend & Infrastructure

FastAPI Node.js gRPC Docker Kubernetes NGINX

Databases & Caching

PostgreSQL MongoDB Redis Pinecone

Tools & Platforms

Git Linux VS Code Jupyter


Connect

GitHub LinkedIn Email


Pinned Loading

  1. MLzero MLzero Public

    Implementation of widely used ML algorithms in vanilla python

    Python

  2. attentionX attentionX Public

    From Scratch Implementations of the "Transformer Architecture" in Numpy. Educational Module for clear visualization of the famous architecture.

    Python

  3. x86emu x86emu Public

    Emulating Low Level x86 instructions

    Python 1

  4. loadbalancer loadbalancer Public

    Reverse Proxy Implementation using custom algorithms for efficient throughput and latency management in native Node HTTP API

    JavaScript