- Tel Aviv, Israel
-
03:14
(UTC +03:00) - in/omer-aplatony
Starred repositories
Tool for deleting all photos from the Google Photos
Contains latest company wise questions of LeetCode as of Feb 2026.
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
Tensors and Dynamic neural networks in Python with strong GPU acceleration
A toolkit to run Ray applications on Kubernetes
Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
Supercharge Your LLM with the Fastest KV Cache Layer
SGLang is a high-performance serving framework for large language models and multimodal models.
Cost-efficient and pluggable Infrastructure components for GenAI inference
A flexible distributed key-value database that is optimized for caching and other realtime workloads.
Gateway API Inference Extension
Manages Envoy Proxy as a Standalone or Kubernetes-based Application Gateway
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
A high-throughput and memory-efficient inference and serving engine for LLMs
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
Gin is a high-performance HTTP web framework written in Go. It provides a Martini-like API but with significantly better performance—up to 40 times faster—thanks to httprouter. Gin is designed for …
A CLI tool and go library which recommends instance types based on resource criteria like vcpus and memory
📙 Amazon Web Services — a practical guide
Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.
AWS version of Kelsey's kubernetes-the-hard-way
Open and extensible continuous delivery solution for Kubernetes. Powered by GitOps Toolkit.
Networking plugin repository for pod networking in Kubernetes using Elastic Network Interfaces on AWS
Distributed reliable key-value store for the most critical data of a distributed system
KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes



