Stars
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Example models using DeepSpeed
A high-throughput and memory-efficient inference and serving engine for LLMs
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
verl: Volcano Engine Reinforcement Learning for LLMs
A toolkit to run Ray applications on Kubernetes
This is a place for various problem detectors running on the Kubernetes nodes.
The official Typescript SDK for Model Context Protocol servers and clients
mcp-auth / inspector
Forked from modelcontextprotocol/inspectorVisual testing tool for MCP servers
A GitOps OpenTofu and Terraform controller for Flux
Open and extensible continuous delivery solution for Kubernetes. Powered by GitOps Toolkit.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
A kubernetes operator for creating and managing a cache of container images directly on the cluster worker nodes, so application pods start almost instantly
a Docker + Kubernetes network trouble-shooting swiss-army container
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
A Kubernetes mutating webhook server that implements sidecar injection
SRIOV network device plugin for Kubernetes
Service Fabric Emulator, run your stateful service fabric app without service fabric cluster
Export Kubernetes events to multiple destinations with routing and filtering
Add-on agent to generate and expose cluster-level metrics.


