xiaomengy

Xiaomeng Yang xiaomengy

Artificial Intelligence, Reinforcement Learning

115 followers · 86 following

Google DeepMind

Achievements

Stars

google-deepmind / simply

Minimal and scalable research codebase in JAX, designed for rapid iteration on frontier research in LLM and other autoregressive models.

Python 53 16 Updated Jan 29, 2026

google-deepmind / gemini_icpc2025

Gemini 2025 ICPC World Finals Code Submissions

C++ 169 13 Updated Sep 17, 2025

lmgame-org / GamingAgent

[ICLR 2026] LLM/VLM gaming agents and model evaluation through games.

Python 858 92 Updated Nov 16, 2025

electronicarts / CnC_Red_Alert

Command and Conquer: Red Alert

C++ 6,575 1,311 Updated Feb 27, 2025

facebookresearch / iGSM

The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Process" (arxiv 2407.20311) and "Physics of Language Models Part 2…

Python 84 8 Updated Jan 12, 2025

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,330 112 Updated Jan 16, 2026

google-deepmind / physics-IQ-benchmark

Benchmarking physical understanding in generative video models

Python 238 24 Updated Feb 2, 2026

PolymathicAI / the_well

A 15TB Collection of Physics Simulation Datasets

Jupyter Notebook 1,881 175 Updated Dec 10, 2025

AIMO-CMU-MATH / CMU_MATH-AIMO

Python 82 11 Updated Jul 10, 2024

princeton-nlp / USACO

Can Language Models Solve Olympiad Programming?

Python 123 13 Updated Jan 14, 2025

BricksRL / bricksrl

BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO

Python 66 5 Updated Oct 8, 2024

meta-pytorch / LeanRL

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 670 28 Updated Aug 22, 2025

OmniLRS / OmniLRS

Omniverse Lunar Robotics Simulator

Python 168 33 Updated Jan 29, 2026

XuezheMax / megalodon

Reference implementation of Megalodon 7B model

Cuda 529 53 Updated May 17, 2025

MichaelTMatthews / Craftax

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Python 369 44 Updated Jul 7, 2025

google-deepmind / alphageometry

Python 4,766 566 Updated Jan 13, 2026

adamkarvonen / chess_gpt_eval

A repo to evaluate various LLM's chess playing abilities.

Python 87 20 Updated Apr 12, 2024

facebookresearch / nle

The NetHack Learning Environment

C 977 124 Updated May 6, 2024

google-deepmind / reverb

Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research

C++ 763 108 Updated Feb 5, 2026

facebookresearch / jps

Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"

C++ 52 9 Updated Nov 14, 2023

electronicarts / CnC_Remastered_Collection

Command & Conquer: Remastered Collection

C++ 21,287 5,437 Updated Jan 16, 2025

NVlabs / curobo

CUDA Accelerated Robot Library

Python 1,349 227 Updated Oct 23, 2025

sotopia-lab / sotopia

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 278 41 Updated Jan 23, 2026

google / coding-competitions-archive

Google Coding Competitions problem archive

HTML 1,294 340 Updated Jul 12, 2023

google-deepmind / alphastar

Python 537 72 Updated Sep 8, 2022

facebookresearch / nocturne

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Python 294 32 Updated Jun 18, 2024

pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,292 435 Updated Feb 5, 2026

google-deepmind / mctx

Monte Carlo tree search in JAX

Python 2,587 207 Updated Sep 2, 2025

facebookresearch / mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 8,217 1,343 Updated Jul 23, 2024

facebookresearch / moolib

A library for distributed ML training with PyTorch

C++ 366 22 Updated Dec 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiaomeng Yang xiaomengy

Achievements

Achievements

Block or report xiaomengy

Stars

google-deepmind / simply

google-deepmind / gemini_icpc2025

lmgame-org / GamingAgent

electronicarts / CnC_Red_Alert

facebookresearch / iGSM

open-thought / reasoning-gym

google-deepmind / physics-IQ-benchmark

PolymathicAI / the_well

AIMO-CMU-MATH / CMU_MATH-AIMO

princeton-nlp / USACO

BricksRL / bricksrl

meta-pytorch / LeanRL

OmniLRS / OmniLRS

XuezheMax / megalodon

MichaelTMatthews / Craftax

google-deepmind / alphageometry

adamkarvonen / chess_gpt_eval

facebookresearch / nle

google-deepmind / reverb

facebookresearch / jps

electronicarts / CnC_Remastered_Collection

NVlabs / curobo

sotopia-lab / sotopia

google / coding-competitions-archive

google-deepmind / alphastar

facebookresearch / nocturne

pytorch / rl

google-deepmind / mctx

facebookresearch / mae

facebookresearch / moolib