Skip to content
View Zirui00's full-sized avatar

Block or report Zirui00

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Codebase for "Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights"

Python 357 31 Updated Mar 20, 2026

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 4,105 329 Updated Mar 25, 2026

[ICML 2025] The Diffusion Duality

Python 209 27 Updated Mar 25, 2026

Scaling Beyond Masked Diffusion Language Models

Python 23 1 Updated Feb 18, 2026

The best ChatGPT that $100 can buy.

Python 50,326 6,611 Updated Mar 26, 2026
Python 46 3 Updated Mar 12, 2026
Python 154 6 Updated Feb 25, 2026
Python 1 Updated Oct 11, 2023

LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.

379 21 Updated Feb 12, 2026

SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)

Python 342 17 Updated Mar 16, 2026

[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.

Python 472 39 Updated Jan 28, 2026

dLLM: Simple Diffusion Language Modeling

Python 2,272 215 Updated Feb 27, 2026

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

Python 40 3 Updated Sep 30, 2025

A nix configuration for chameleon

Nix 1 Updated Sep 25, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,385 1,325 Updated Jul 9, 2025

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 426 51 Updated Jan 26, 2026

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

Python 301 15 Updated Jan 23, 2025

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,440 244 Updated May 21, 2023

Concept Bottleneck Models, ICML 2020

Python 247 44 Updated Feb 24, 2023
Python 13 2 Updated Oct 7, 2024

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,588 81 Updated Mar 16, 2025

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,450 770 Updated May 31, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,242 2,302 Updated Mar 16, 2026

Foundational model for human-like, expressive TTS

Python 4,205 690 Updated Jul 30, 2024

从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)

Python 544 60 Updated Mar 23, 2025

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,933 381 Updated Mar 14, 2024

deep learning for image processing including classification and object-detection etc.

Python 26,167 8,255 Updated Jan 1, 2026

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,500 1,616 Updated Aug 9, 2024