Skip to content
View pseudo-rnd-thoughts's full-sized avatar

Organizations

@ray-project @anyscale @Farama-Foundation

Block or report pseudo-rnd-thoughts

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

aqt: Another (unofficial) Qt CLI Installer on multi-platforms

Python 1,150 105 Updated Dec 8, 2025
Python 602 49 Updated Dec 10, 2025

[NeurIPS 2025] The official implementation of "Towards Robust Zero-Shot Reinforcement Learning"

Python 9 4 Updated Dec 2, 2025
Python 724 60 Updated Dec 9, 2025

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 657 27 Updated Aug 22, 2025

GPU-Accelerated Multi-Agent Reinforcement Learning for High-Frequency Trading

Python 28 6 Updated Oct 22, 2025

Procgen2: A community maintained fork of procgen

C++ 12 7 Updated Aug 25, 2022

A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research. (Accepted at ICML 2025).

C++ 147 11 Updated Nov 28, 2025
Jupyter Notebook 6 Updated Oct 31, 2021

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

1,414 55 Updated Dec 10, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,820 3,763 Updated Dec 10, 2025

Causal Responsibility EXplanations for Image Classifiers and Tabular Data

Python 40 8 Updated Oct 31, 2025
Jupyter Notebook 3 Updated Jul 20, 2025

Reinforcement learning library from Embark Studios

Python 25 Updated Sep 25, 2024

Gym-TORAX: A software for integrating reinforcement learning with plasma control simulators

Python 11 1 Updated Oct 9, 2025

RL gym for vision language models written from scratch

Python 130 13 Updated Oct 30, 2025

ray & RLlib tools for unified code across different repositories. Experiments with dynamic hyperparameters

Python 6 Updated Dec 10, 2025

Ongoing Lean formalisation of the proof of Fermat's Last Theorem

Lean 766 102 Updated Dec 9, 2025

Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research.

Python 1,145 135 Updated Dec 9, 2025

[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data

Python 99 17 Updated Aug 9, 2024

Extension of Gymnasium for active perception tasks.

Python 5 Updated Nov 27, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,377 1,962 Updated Nov 1, 2025

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, Brax and other environments

Python 932 117 Updated Dec 8, 2025

[ :suspect:👾 ] ➡️ 💾 ➡️ { 🎮🕹️ } Extra Stable-Baselines3 buffer classes. Reducing RL memory usage drastically with minimal overhead.

Python 22 Updated Dec 9, 2025

A benchmark for offline goal-conditioned RL and offline RL

Python 292 63 Updated Oct 21, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,245 6,987 Updated Dec 10, 2025

Settlers of Catan Bot Simulator and Strong AI Player

Python 370 95 Updated Dec 2, 2025

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 5,699 2,766 Updated Dec 10, 2025
Next