bychen7

bychen7

Achievements

Stars

RL-Align / RL-Kernel

Modern RL Post-training Infrastructure: Optimized for NVIDIA/AMD GPUs with a focus on vLLM and DeepSpeed integration, CUDA/ROCm/Triton kernels, and transparent hardware-aware scaling.

Python 122 20 Updated Jun 13, 2026

OpenBMB / ForgeTrain

Python 230 21 Updated May 26, 2026

kangarooking / buffett-letters-skill

An AI skill pack for value investing, capital allocation, and behavioral discipline, distilled from Warren Buffett's 60+ years of shareholder letters.

38 8 Updated Apr 17, 2026

speedyapply / JobSpy

Jobs scraper library for LinkedIn, Indeed, Glassdoor, Google, ZipRecruiter & more

Python 3,655 718 Updated Feb 18, 2026

xbtlin / ai-berkshire

AI 时代的伯克希尔：基于 Claude Code 的价值投资研究框架。巴菲特·芒格·段永平·李录四大师方法论 + 多Agent并行研究。

Python 33 6 Updated Jun 14, 2026

nick7nlp / Awesome-LLM-On-Policy-Distillation

A curated collection of papers and resources on On-Policy Distillation for Large Language Models.

Python 301 6 Updated Jun 6, 2026

alchaincyf / nuwa-skill

你想蒸馏的下一个员工，何必是同事。蒸馏任何人的思维方式——心智模型、决策启发式、表达DNA。Distill how anyone thinks.

Python 24,202 3,555 Updated Jun 6, 2026

Gen-Verse / Open-AgentRL

RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios

Python 546 56 Updated Jun 12, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 5,494 595 Updated May 23, 2026

WooooDyy / AgentGym-RL

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 774 74 Updated Feb 15, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 86,581 12,541 Updated Mar 26, 2026

MineDojo / NitroGen

A Foundation Model for Generalist Gaming Agents

Python 2,076 233 Updated Jan 25, 2026

FoundationAgents / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 68,774 8,786 Updated Jan 21, 2026

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,421 62 Updated May 11, 2026

guyulongcs / Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising

Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking, Post Ranking, Relevance, LLM and RL. Please cite our p…

Python 2,525 288 Updated Apr 25, 2026

Osilly / Vision-R1

[ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incen…

Python 1,357 27 Updated Mar 20, 2026