dematsunaga

Daiki Matsunaga dematsunaga

3 followers · 2 following

https://sites.google.com/view/daikieddymatsunaga

Achievements

Highlights

Stars

50 stars written in Python

Clear filter

p-christ / Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,929 1,211 Updated Jul 25, 2024

ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,887 842 Updated May 29, 2022

google-research / football

Check out the new game server:

Python 3,570 1,350 Updated Jun 17, 2025

google-deepmind / mctx

Monte Carlo tree search in JAX

Python 2,603 207 Updated Sep 2, 2025

openai / maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python 1,957 526 Updated Apr 1, 2024

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,941 374 Updated Jul 18, 2024

openai / multi-agent-emergence-environments

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Python 1,787 322 Updated Jul 30, 2024

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,556 188 Updated Mar 28, 2026

oxwhirl / smac

SMAC: The StarCraft Multi-Agent Challenge

Python 1,334 235 Updated Feb 18, 2024

huawei-noah / SMARTS

Scalable Multi-Agent RL Training School for Autonomous Driving

Python 1,116 217 Updated Jan 31, 2025

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 1,035 84 Updated Sep 9, 2024

google-deepmind / meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Python 808 155 Updated Mar 28, 2026

carlosferrazza / humanoid-bench

Python 738 118 Updated Sep 18, 2025

uoe-agents / epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 702 190 Updated Sep 24, 2024

marl-book / codebase

Official code repo for the MARL book (www.marl-book.com)

Python 621 104 Updated Mar 30, 2025

facebookresearch / BenchMARL

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically ground…

Python 594 120 Updated Feb 7, 2026

TsinghuaC3I / MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 470 48 Updated Feb 19, 2026

IBM / AMLSim

The AMLSim project is intended to provide a multi-agent based simulator that generates synthetic banking transaction data together with a set of known money laundering patterns - mainly for the pur…

Python 348 107 Updated Sep 17, 2025

twni2016 / pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 344 48 Updated Aug 22, 2024

mdeib / berkeley-deep-RL-pytorch-solutions

Pytorch solutions for UC Berkeley's cs285 assignments

Python 155 24 Updated Jan 21, 2022

google-research / dice_rl

Python 112 16 Updated Aug 6, 2024

oxwhirl / facmac

Python 111 26 Updated Oct 25, 2021

YiqinYang / ICQ

Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight https://arxiv.org/abs/2106.03400)

Python 74 9 Updated Oct 18, 2022

ying-wen / malib_deprecated

A Multi-agent Learning Framework

Python 62 18 Updated May 10, 2021

011235813 / cm3

Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning

Python 58 10 Updated Jun 13, 2022

bic4907 / Overcooked-AI

Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method

Python 47 8 Updated Sep 11, 2024

jys5609 / MC-LAVE-RL

ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"

Python 33 16 Updated Nov 30, 2023

RDLLab / posggym

A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent environments.

Python 30 7 Updated Jun 2, 2025

dematsunaga / alberdice

Official PyTorch implementation of AlberDICE

Python 23 14 Updated Dec 8, 2023

ggoggam / gdpo

Code for GFlowNet-DPO (Direct Preference Optimization) EMNLP 2024 Main

Python 19 8 Updated Feb 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Daiki Matsunaga dematsunaga

Achievements

Achievements

Highlights

Block or report dematsunaga

Stars

p-christ / Deep-Reinforcement-Learning-Algorithms-with-PyTorch

ikostrikov / pytorch-a2c-ppo-acktr-gail

google-research / football

google-deepmind / mctx

openai / maddpg

marlbenchmark / on-policy

openai / multi-agent-emergence-environments

opendilab / LightZero

oxwhirl / smac

huawei-noah / SMARTS

luchris429 / purejaxrl

google-deepmind / meltingpot

carlosferrazza / humanoid-bench

uoe-agents / epymarl

marl-book / codebase

facebookresearch / BenchMARL

TsinghuaC3I / MARTI

IBM / AMLSim

twni2016 / pomdp-baselines

mdeib / berkeley-deep-RL-pytorch-solutions

google-research / dice_rl

oxwhirl / facmac

YiqinYang / ICQ

ying-wen / malib_deprecated

011235813 / cm3

bic4907 / Overcooked-AI

jys5609 / MC-LAVE-RL

RDLLab / posggym

dematsunaga / alberdice

ggoggam / gdpo