Skip to content
View chaos-ad's full-sized avatar

Block or report chaos-ad

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A construction kit for reinforcement learning environment management.

Python 437 62 Updated May 16, 2026

This is a course on Deep Learning-based Recommender Systems taught at HSE University, academic year 2025/26.

Jupyter Notebook 66 6 Updated May 13, 2026

The batteries-included agent harness.

Python 22,856 3,231 Updated May 16, 2026

Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end reference examples to build with Nemotron models

Jupyter Notebook 1,098 235 Updated May 15, 2026

Material for gpu-mode lectures

Jupyter Notebook 6,079 611 Updated May 9, 2026

🚀 Efficient implementations for emerging model architectures

Python 5,105 531 Updated May 14, 2026
Python 3,120 646 Updated May 13, 2026

DeeperGEMM: crazy optimized version

Cuda 86 Updated May 5, 2025

FlashInfer: Kernel Library for LLM Serving

Python 5,620 976 Updated May 16, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 80,204 16,860 Updated May 16, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,224 570 Updated May 12, 2026

Code for "Learning to summarize from human feedback"

Python 1,064 153 Updated Sep 5, 2023

CLI interfaces & config objects, from types

Python 1,049 47 Updated May 9, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!

Python 9,459 832 Updated May 16, 2026

Parallelized search for matrix multiplication schemes using flip graphs on PyTorch

Jupyter Notebook 4 1 Updated Sep 20, 2025

A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.

Jupyter Notebook 1,135 206 Updated May 2, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,630 1,244 Updated May 13, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,798 1,113 Updated May 17, 2026

Official inference framework for 1-bit LLMs

Python 39,015 3,553 Updated Mar 10, 2026

Optimized primitives for collective multi-GPU communication

C++ 4,717 1,257 Updated May 16, 2026

Collective communications library with various primitives for multi-machine training.

C++ 1,427 355 Updated Apr 21, 2026

ContextualAI's text-to-SQL pipeline for BIRD benchmark

Python 75 12 Updated Apr 2, 2026

Efficient Triton Kernels for LLM Training

Python 6,357 528 Updated May 16, 2026

Everything about the SmolLM and SmolVLM family of models

Python 3,776 292 Updated Apr 2, 2026

Complete solutions to the Programming Massively Parallel Processors Edition 4

Jupyter Notebook 756 103 Updated Jun 18, 2025

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,688 1,353 Updated May 7, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,096 2,077 Updated Mar 27, 2026

Code and training scripts for FlexOlmo

Python 150 24 Updated Apr 20, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,706 794 Updated May 14, 2026

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,270 116 Updated Aug 16, 2025
Next