Lists (3)
Sort Name ascending (A-Z)
Stars
An agentic skills framework & software development methodology that works.
CLI for common Playwright actions. Record and generate Playwright code, inspect selectors and take screenshots.
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
Claude Code Skills and 200+ agent skills from official dev teams and the community, compatible with Codex, Antigravity, Gemini CLI, Cursor and others.
τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.
An interface library for RL post training with environments.
《动手学大模型Dive into LLMs》系列编程实践教程
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
A character-level language diffusion model trained on Tiny Shakespeare
PhD/MBA-level human-annotated rubrics dataset across Physics, Chemistry, Finance and Consulting
OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.
shizhediao / nanochat
Forked from karpathy/nanochatThe best ChatGPT that $100 can buy.
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
High accuracy RAG for answering questions from scientific documents with citations





