Skip to content
View wz0919's full-sized avatar
🍊
Stay hungry!
🍊
Stay hungry!
  • UNC, Chapel Hill

Block or report wz0919

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A portable identity for Claude Code. Clone it anywhere, and Claude knows you.

Python 4 Updated Mar 24, 2026

[CVPR 2026] Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

Python 8 1 Updated Mar 25, 2026

Official implementation of V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising

Python 16 2 Updated Mar 18, 2026

Reinforcing Grounded Video Reasoning via Visual-Perception Prompting

Python 7 Updated Mar 17, 2026

[CVPR 2026] tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction

Python 380 29 Updated Mar 2, 2026
Python 7 1 Updated Nov 19, 2025
15 1 Updated Dec 16, 2025

When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning

Python 13 1 Updated Feb 10, 2026

Code for "StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos [CVPR 2026]"

Python 20 1 Updated Feb 21, 2026

Official implementation of Rethinking Training Dynamics in Scale-wise Autoregressive Generation

Jupyter Notebook 4 Updated Dec 17, 2025

Official implementation of: Learning Goal-Oriented Language-Guided Navigation with Self-Improving Demonstrations at Scale

Python 12 2 Updated Nov 29, 2025

Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)

Python 46 3 Updated Nov 24, 2025

Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]

Python 318 18 Updated Mar 9, 2026

[ICRA 2026] Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"

Python 435 31 Updated Nov 2, 2025

Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"

Python 23 1 Updated Feb 18, 2026

Self-reimplemented version of 4D-LRM.

65 Updated May 30, 2025

[EMNLP 2025 Findings] MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation

Python 16 2 Updated Aug 22, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,248 253 Updated Sep 12, 2025

[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"

Python 163 10 Updated Mar 2, 2026

Code release for paper "Test-Time Training Done Right"

Python 422 25 Updated Jan 5, 2026

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Python 156 11 Updated Sep 16, 2025

🔥Deepfake + LLM (CVPR25 Oral)

Python 106 7 Updated Jul 11, 2025

🔥Hierarchical Fine-Grained Image Forgery Detection and Localization (CVPR23 + IJCV24)

Python 295 25 Updated Jul 1, 2025

Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

Python 48 1 Updated Jun 2, 2025

Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation

Jupyter Notebook 143 1 Updated May 27, 2025

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation [Siggraph Asian 2025]

Python 528 28 Updated Sep 21, 2025

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,768 89 Updated Nov 28, 2025

Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"

Python 16 Updated Aug 27, 2025
Next