wz0919

🍊

Stay hungry!

Zun Wang wz0919

🍊

Stay hungry!

PhD Student

36 followers · 21 following

UNC, Chapel Hill

Achievements

Stars

GengzeZhou / memex

A portable identity for Claude Code. Clone it anywhere, and Claude knows you.

Python 4 Updated Mar 24, 2026

Yui010206 / Ego2Web

[CVPR 2026] Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

Python 8 1 Updated Mar 25, 2026

HL-hanlin / V-Co

Official implementation of V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising

Python 16 2 Updated Mar 18, 2026

daeunni / VisionCoach

Reinforcing Grounded Video Reasoning via Visual-Perception Prompting

Python 7 Updated Mar 17, 2026

ZichengDuan / LiveWorld

12 Updated Mar 9, 2026

cwchenwang / tttLRM

[CVPR 2026] tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction

Python 380 29 Updated Mar 2, 2026

zhangyuejoslin / Deer-3D

Python 7 1 Updated Nov 19, 2025

MrZihan / D3D-VLP

15 1 Updated Dec 16, 2025

Yui010206 / Adaptive-Visual-Imagination-Control

When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning

Python 13 1 Updated Feb 10, 2026

daeunni / StreamGaze

Code for "StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos [CVPR 2026]"

Python 20 1 Updated Feb 21, 2026

GengzeZhou / SAR

Official implementation of Rethinking Training Dynamics in Scale-wise Autoregressive Generation

Jupyter Notebook 4 Updated Dec 17, 2025

OpenGVLab / SID-VLN

Official implementation of: Learning Goal-Oriented Language-Guided Navigation with Self-Improving Demonstrations at Scale

Python 12 2 Updated Nov 29, 2025

HL-hanlin / Bifrost-1

Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)

Python 46 3 Updated Nov 24, 2025

NIRVANALAN / STream3R

Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]

Python 318 18 Updated Mar 9, 2026

InternRobotics / StreamVLN

[ICRA 2026] Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"

Python 435 31 Updated Nov 2, 2025

Ziyang412 / Video-RTS

Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"

Python 23 1 Updated Feb 18, 2026

Mars-tin / fast-spatial-mem

Self-reimplemented version of 4D-LRM.

65 Updated May 30, 2025

Yui010206 / MEXA

[EMNLP 2025 Findings] MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation

Python 16 2 Updated Aug 22, 2025

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,248 253 Updated Sep 12, 2025

liyz15 / Aligning-Latent-Spaces-with-Flow-Priors

Python 42 3 Updated Jun 6, 2025

TianHongZXY / RLVR-Decomposed

[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"

Python 163 10 Updated Mar 2, 2026

a1600012888 / LaCT

Code release for paper "Test-Time Training Done Right"

Python 422 25 Updated Jan 5, 2026

snap-research / ac3d

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Python 156 11 Updated Sep 16, 2025

CHELSEA234 / M2F2_Det

🔥Deepfake + LLM (CVPR25 Oral)

Python 106 7 Updated Jul 11, 2025

CHELSEA234 / HiFi_IFDL

🔥Hierarchical Fine-Grained Image Forgery Detection and Localization (CVPR23 + IJCV24)

Python 295 25 Updated Jul 1, 2025

wz0919 / EPiC

Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

Python 48 1 Updated Jun 2, 2025

Qinyu-Allen-Zhao / DiSA

Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation

Jupyter Notebook 143 1 Updated May 27, 2025

alibaba-damo-academy / Uni3C

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation [Siggraph Asian 2025]

Python 528 28 Updated Sep 21, 2025

KlingAIResearch / ReCamMaster

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,768 89 Updated Nov 28, 2025

daeunni / Video-Skill-CoT

Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"

Python 16 Updated Aug 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zun Wang wz0919

Achievements

Achievements

Block or report wz0919

Stars

GengzeZhou / memex

Yui010206 / Ego2Web

HL-hanlin / V-Co

daeunni / VisionCoach

ZichengDuan / LiveWorld

cwchenwang / tttLRM

zhangyuejoslin / Deer-3D

MrZihan / D3D-VLP

Yui010206 / Adaptive-Visual-Imagination-Control

daeunni / StreamGaze

GengzeZhou / SAR

OpenGVLab / SID-VLN

HL-hanlin / Bifrost-1

NIRVANALAN / STream3R

InternRobotics / StreamVLN

Ziyang412 / Video-RTS

Mars-tin / fast-spatial-mem

Yui010206 / MEXA

guandeh17 / Self-Forcing

liyz15 / Aligning-Latent-Spaces-with-Flow-Priors

TianHongZXY / RLVR-Decomposed

a1600012888 / LaCT

snap-research / ac3d

CHELSEA234 / M2F2_Det

CHELSEA234 / HiFi_IFDL

wz0919 / EPiC

Qinyu-Allen-Zhao / DiSA

alibaba-damo-academy / Uni3C

KlingAIResearch / ReCamMaster

daeunni / Video-Skill-CoT