Stars
A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io
Tile-Based Runtime for Ultra-Low-Latency LLM Inference
A powerful AI coding agent. Built for the terminal.
Official, Anthropic-managed directory of high quality Claude Code Plugins.
Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors
swiss-ai / Megatron-LM
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1
AI generates a real, editable PowerPoint from any document — native shapes & animations, speaker notes voiced as audio narration, and the option to follow your own .pptx template, not slide images …
CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies
Persistent Context Across Sessions for Every Agent – Captures everything your agent does during sessions, compresses it with AI, and injects relevant context back into future sessions. Works with C…
分享AI Infra知识&代码练习:PyTorch、vLLM/SGLang、slime/vime框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等
🚀 Efficient implementations for emerging model architectures
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
High-performance distributed data shuffling (all-to-all) library for MoE training and inference
Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
[TMLR 2024] Efficient Large Language Models: A Survey
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI.
An Open-source RL System from ByteDance Seed and Tsinghua AIR
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
