Skip to content
View xiaomengy's full-sized avatar
  • Google DeepMind

Block or report xiaomengy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimal and scalable research codebase in JAX, designed for rapid iteration on frontier research in LLM and other autoregressive models.

Python 53 16 Updated Jan 29, 2026

Gemini 2025 ICPC World Finals Code Submissions

C++ 169 13 Updated Sep 17, 2025

[ICLR 2026] LLM/VLM gaming agents and model evaluation through games.

Python 858 92 Updated Nov 16, 2025

Command and Conquer: Red Alert

C++ 6,575 1,311 Updated Feb 27, 2025

The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Process" (arxiv 2407.20311) and "Physics of Language Models Part 2…

Python 84 8 Updated Jan 12, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,330 112 Updated Jan 16, 2026

Benchmarking physical understanding in generative video models

Python 238 24 Updated Feb 2, 2026

A 15TB Collection of Physics Simulation Datasets

Jupyter Notebook 1,881 175 Updated Dec 10, 2025
Python 82 11 Updated Jul 10, 2024

Can Language Models Solve Olympiad Programming?

Python 123 13 Updated Jan 14, 2025

BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO

Python 66 5 Updated Oct 8, 2024

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 670 28 Updated Aug 22, 2025

Omniverse Lunar Robotics Simulator

Python 168 33 Updated Jan 29, 2026

Reference implementation of Megalodon 7B model

Cuda 529 53 Updated May 17, 2025

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Python 369 44 Updated Jul 7, 2025

A repo to evaluate various LLM's chess playing abilities.

Python 87 20 Updated Apr 12, 2024

The NetHack Learning Environment

C 977 124 Updated May 6, 2024

Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research

C++ 763 108 Updated Feb 5, 2026

Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"

C++ 52 9 Updated Nov 14, 2023

Command & Conquer: Remastered Collection

C++ 21,287 5,437 Updated Jan 16, 2025

CUDA Accelerated Robot Library

Python 1,349 227 Updated Oct 23, 2025

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 278 41 Updated Jan 23, 2026

Google Coding Competitions problem archive

HTML 1,294 340 Updated Jul 12, 2023

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Python 294 32 Updated Jun 18, 2024

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,292 435 Updated Feb 5, 2026

Monte Carlo tree search in JAX

Python 2,587 207 Updated Sep 2, 2025

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 8,217 1,343 Updated Jul 23, 2024

A library for distributed ML training with PyTorch

C++ 366 22 Updated Dec 12, 2022
Next