maksimstw

Taiwei Shi maksimstw

Achievements

verl-project/verl verl-project/verl Public

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19k 3.2k
hiyouga/LlamaFactory hiyouga/LlamaFactory Public

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 67k 8.1k
BytedTsinghua-SIA/MemAgent BytedTsinghua-SIA/MemAgent Public

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 882 58
lmarena/arena-hard-auto lmarena/arena-hard-auto Public

Arena-Hard-Auto: An automatic LLM benchmark.

Python 994 144
zeno-ml/zeno-build zeno-ml/zeno-build Public

Build, evaluate, understand, and fix LLM-based apps

Jupyter Notebook 492 32
limenlp/safer-instruct limenlp/safer-instruct Public

This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"

17 1