Stars
Kubebuilder - SDK for building Kubernetes APIs using CRDs
SGLang is a high-performance serving framework for large language models and multimodal models.
AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
Delivers efficient, stable, and secure data distribution and acceleration powered by P2P technology, with an optional content‑addressable filesystem that accelerates OCI container launch.
Mechanical keyboard sound simulator for macOS — pixel-art UI, psychoacoustic audio, 13 switch profiles
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
Claude Code 泄露源码 - 本地可运行版本,新增跨平台桌面端软件补齐Computer Use(附带核心模块解析)
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Open-source, secure environment with real-world tools for enterprise-grade agents.
Automated management of large-scale applications on Kubernetes (incubating project under CNCF)
Rapid and cost-effective operator and best practice for agent sandbox lifecycle management.
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
This repository contains tutorials and examples for Triton Inference Server
UltiSpike / tinker
Forked from thinking-machines-lab/tinkerTraining API
A Q&A platform software for teams at any scales. Whether it's a community forum, help center, or knowledge management platform, you can always count on Apache Answer.
A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.
The Moby Project - a collaborative project for the container ecosystem to assemble container-based systems
A library for training and deploying machine learning models on Amazon SageMaker
Shim logger repository for streaming container logs when using Containerd
One stop shop for running AI/ML on AWS.
AI-powered content analysis platform for social media creators (Xiaohongshu, Douyin)

