Zirui00

Zirui Zhang Zirui00

BS in Shanghai Jiao Tong University MS in Columbia University

1 follower · 0 following

Stars

sunrainyg / RandOpt

Official Codebase for "Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights"

Python 357 31 Updated Mar 20, 2026

wanshuiyin / Auto-claude-code-research-in-sleep

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 4,105 329 Updated Mar 25, 2026

s-sahoo / duo

[ICML 2025] The Diffusion Duality

Python 209 27 Updated Mar 25, 2026

s-sahoo / scaling-dllms

Scaling Beyond Masked Diffusion Language Models

Python 23 1 Updated Feb 18, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 50,326 6,611 Updated Mar 26, 2026

Xinxi-Zhang / Re-MeanFlow

Python 46 3 Updated Mar 12, 2026

OpenMOSS / DiRL

Python 154 6 Updated Feb 25, 2026

Zirui00 / PromptDLA

Python 1 Updated Oct 11, 2023

inclusionAI / LLaDA2.X

LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.

379 21 Updated Feb 12, 2026

JetAstra / SDAR

SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model（1.7B, 4B, 8B, 30B）

Python 342 17 Updated Mar 16, 2026

Gen-Verse / dLLM-RL

[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.

Python 472 39 Updated Jan 28, 2026

ZHZisZZ / dllm

dLLM: Simple Diffusion Language Modeling

Python 2,272 215 Updated Feb 27, 2026

ZJU-REAL / GSM8K-V

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

Python 40 3 Updated Sep 30, 2025

kleinercubs / nix-config

Forked from jiezhuzzz/cc-config

A nix configuration for chameleon

Nix 1 Updated Sep 25, 2025

HW-whistleblower / True-Story-of-Pangu

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,385 1,325 Updated Jul 9, 2025

dllm-reasoning / d1

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 426 51 Updated Jan 26, 2026

baaivision / DIVA

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

Python 301 15 Updated Jan 23, 2025

YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,440 244 Updated May 21, 2023

yewsiang / ConceptBottleneck

Concept Bottleneck Models, ICML 2020

Python 247 44 Updated Feb 24, 2023

mchiquier / llm-mutate

Python 13 2 Updated Oct 7, 2024

sihyun-yu / REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,588 81 Updated Mar 16, 2025

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,450 770 Updated May 31, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,242 2,302 Updated Mar 16, 2026

metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS

Python 4,205 690 Updated Jul 30, 2024

Tongjilibo / build_MiniLLM_from_scratch

从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)

Python 544 60 Updated Mar 23, 2025

OpenGVLab / LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,933 381 Updated Mar 14, 2024

WZMIAOMIAO / deep-learning-for-image-processing

deep learning for image processing including classification and object-detection etc.

Python 26,167 8,255 Updated Jan 1, 2026

WongKinYiu / yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,500 1,616 Updated Aug 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly