Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,551 1,590 Updated Sep 5, 2024

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 9,389 1,411 Updated May 3, 2026

lllyasviel / IC-Light

More relighting!

Python 8,418 524 Updated Feb 20, 2025

song-wensong / insert-anything

Python 561 30 Updated Dec 5, 2025

facebookresearch / map-anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 3,325 251 Updated Mar 23, 2026

TuojingAI / ReconDrive

Python 110 9 Updated Mar 30, 2026

paulpanwang / Diff4Splat

[CVPR 2026] Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models.

Python 98 1 Updated Apr 9, 2026

wm-research / worldsplat

[ICLR 2026]WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving

141 7 Updated Mar 31, 2026

RyanHangZhou / PICS

PyTorch Implementation of "PICS: Pairwise Image Compositing with Spatial Interactions", ICLR 2026

Python 15 Updated Mar 2, 2026

xiaomi-research / dggt

DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images

Python 499 54 Updated Jan 15, 2026

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,024 1,445 Updated Mar 3, 2026

3DAgentWorld / VGGT4D

The official implementation of the paper “VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction.”

Python 248 13 Updated Dec 2, 2025

MoonshotAI / Attention-Residuals

3,246 176 Updated Mar 17, 2026

xiaomoguhz / DeCLIP

[CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception

Python 161 6 Updated Jan 10, 2026

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,800 1,220 Updated Apr 8, 2026

PJLab-ADG / LoGoNet

[CVPR2023] LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion

Python 283 16 Updated Jun 4, 2023

BigCiLeng / bilateral-driving

[NeurIPS 2025] Official code of Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting

Python 149 5 Updated Dec 3, 2025

megvii-research / AnchorDETR

An official implementation of the Anchor DETR.

Python 362 43 Updated Jul 29, 2022

xmu-xiaoma666 / FightingCV-Paper-Reading

⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀

Shell 821 89 Updated Apr 20, 2023

IDEA-Research / DN-DETR

[CVPR 2022 Oral] Official implementation of DN-DETR

Python 605 72 Updated Dec 20, 2023

Linketic / CityGaussian

[ECCV`24&ICLR`25] CityGaussian Series for High-quality Large-Scale Scene Reconstruction with Gaussians

Jupyter Notebook 1,156 99 Updated Feb 7, 2026

wzzheng / LDM

Large Driving Models

293 12 Updated Jan 27, 2025

megvii-research / Far3D

[AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection

Jupyter Notebook 196 19 Updated Dec 13, 2023

HorizonRobotics / Sparse4D

Jupyter Notebook 639 81 Updated Jun 25, 2024

kyegomez / movie-gen

An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!

Python 59 2 Updated Apr 13, 2026

piercus / object-tracking-measure

Object tracking measure in javascript (MOTA, IDF1 ...)

JavaScript 4 1 Updated Oct 26, 2023

NVIDIA / Cosmos-Tokenizer

A suite of image and video neural tokenizers

Jupyter Notebook 1,723 88 Updated Feb 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sophie SophieZhou

Block or report SophieZhou

Stars

hustvl / DiffusionDrive

NVlabs / physical_ai_av

YixinZhu042 / IntrinsicWeather

IDEA-Research / Grounded-Segment-Anything