Skip to content
View dematsunaga's full-sized avatar

Highlights

  • Pro

Block or report dematsunaga

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
50 stars written in Python
Clear filter

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,929 1,211 Updated Jul 25, 2024

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,887 842 Updated May 29, 2022

Check out the new game server:

Python 3,570 1,350 Updated Jun 17, 2025

Monte Carlo tree search in JAX

Python 2,603 207 Updated Sep 2, 2025

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python 1,957 526 Updated Apr 1, 2024

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,941 374 Updated Jul 18, 2024

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Python 1,787 322 Updated Jul 30, 2024

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,556 188 Updated Mar 28, 2026

SMAC: The StarCraft Multi-Agent Challenge

Python 1,334 235 Updated Feb 18, 2024

Scalable Multi-Agent RL Training School for Autonomous Driving

Python 1,116 217 Updated Jan 31, 2025

Really Fast End-to-End Jax RL Implementations

Python 1,035 84 Updated Sep 9, 2024

A suite of test scenarios for multi-agent reinforcement learning.

Python 808 155 Updated Mar 28, 2026

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 702 190 Updated Sep 24, 2024

Official code repo for the MARL book (www.marl-book.com)

Python 621 104 Updated Mar 30, 2025

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically ground…

Python 594 120 Updated Feb 7, 2026

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 470 48 Updated Feb 19, 2026

The AMLSim project is intended to provide a multi-agent based simulator that generates synthetic banking transaction data together with a set of known money laundering patterns - mainly for the pur…

Python 348 107 Updated Sep 17, 2025

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 344 48 Updated Aug 22, 2024

Pytorch solutions for UC Berkeley's cs285 assignments

Python 155 24 Updated Jan 21, 2022
Python 112 16 Updated Aug 6, 2024
Python 111 26 Updated Oct 25, 2021

Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight https://arxiv.org/abs/2106.03400)

Python 74 9 Updated Oct 18, 2022

A Multi-agent Learning Framework

Python 62 18 Updated May 10, 2021

Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning

Python 58 10 Updated Jun 13, 2022

Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method

Python 47 8 Updated Sep 11, 2024

ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"

Python 33 16 Updated Nov 30, 2023

A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent environments.

Python 30 7 Updated Jun 2, 2025

Official PyTorch implementation of AlberDICE

Python 23 14 Updated Dec 8, 2023

Code for GFlowNet-DPO (Direct Preference Optimization) EMNLP 2024 Main

Python 19 8 Updated Feb 22, 2026
Next