Popular repositories Loading
-
-
ms-swift
ms-swift PublicForked from modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Lla…
Python
-
-
Muon
Muon PublicForked from KellerJordan/Muon
Muon is an optimizer for hidden layers in neural networks
Python
-
-
verl-agent
verl-agent PublicForked from langfengQ/verl-agent
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Python
If the problem persists, check the GitHub status page or contact support.