Skip to content
View SophieZhou's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report SophieZhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

Python 1,382 130 Updated Dec 8, 2025

Devkit and documentation for the NVIDIA Physical AI Autonomous Vehicles Dataset

Python 320 37 Updated Mar 25, 2026

[CVPR 2026 Highlight] Implementation of "IntrinsicWeather: Controllable Weather Editing in Intrinsic Space".

Python 10 2 Updated Apr 16, 2026

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,551 1,590 Updated Sep 5, 2024

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 9,389 1,411 Updated May 3, 2026

More relighting!

Python 8,418 524 Updated Feb 20, 2025

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 3,325 251 Updated Mar 23, 2026
Python 110 9 Updated Mar 30, 2026

[CVPR 2026] Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models.

Python 98 1 Updated Apr 9, 2026

[ICLR 2026]WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving

141 7 Updated Mar 31, 2026

PyTorch Implementation of "PICS: Pairwise Image Compositing with Spatial Interactions", ICLR 2026

Python 15 Updated Mar 2, 2026

DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images

Python 499 54 Updated Jan 15, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,024 1,445 Updated Mar 3, 2026

The official implementation of the paper “VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction.”

Python 248 13 Updated Dec 2, 2025

[CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception

Python 161 6 Updated Jan 10, 2026

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,800 1,220 Updated Apr 8, 2026

[CVPR2023] LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion

Python 283 16 Updated Jun 4, 2023

[NeurIPS 2025] Official code of Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting

Python 149 5 Updated Dec 3, 2025

An official implementation of the Anchor DETR.

Python 362 43 Updated Jul 29, 2022

⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀

Shell 821 89 Updated Apr 20, 2023

[CVPR 2022 Oral] Official implementation of DN-DETR

Python 605 72 Updated Dec 20, 2023

[ECCV`24&ICLR`25] CityGaussian Series for High-quality Large-Scale Scene Reconstruction with Gaussians

Jupyter Notebook 1,156 99 Updated Feb 7, 2026

Large Driving Models

293 12 Updated Jan 27, 2025

[AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection

Jupyter Notebook 196 19 Updated Dec 13, 2023
Jupyter Notebook 639 81 Updated Jun 25, 2024

An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!

Python 59 2 Updated Apr 13, 2026

Object tracking measure in javascript (MOTA, IDF1 ...)

JavaScript 4 1 Updated Oct 26, 2023

A suite of image and video neural tokenizers

Jupyter Notebook 1,723 88 Updated Feb 11, 2025
Next