Stars
Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Paper: “MEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORY” Open-Source Code
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
AgentEvolver: Towards Efficient Self-Evolving Agent System
Agent0 Series: Self-Evolving Agents from Zero Data
A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios
Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedback"
[ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
[NeurIPS 2025] ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models
Training VLM agents with multi-turn reinforcement learning
🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Learning materials of Transformer, including my code, XMind, PDF and so on