Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images

Tianhao Wu · Chuanxia Zheng · Frank Guan . Andrea Vedaldi . Tat-Jen Cham

ICCV 2025

Paper | Project Page | Pretrain Weight | Demo

Demo Video

Setup

This code has been tested on Ubuntu 22.02 with torch 2.4.0 & CUDA 11.8. We sincerely thank TRELLIS for providing the environment setup and follow exactly as their instruction in this work.

Create a new conda environment named amodal3r and install the dependencies:

. ./setup.sh --new-env --basic --xformers --flash-attn --diffoctreerast --spconv --mipgaussian --kaolin --nvdiffrast

The detailed usage of setup.sh can be found by running . ./setup.sh --help.

Usage: setup.sh [OPTIONS]
Options:
    -h, --help              Display this help message
    --new-env               Create a new conda environment
    --basic                 Install basic dependencies
    --train                 Install training dependencies
    --xformers              Install xformers
    --flash-attn            Install flash-attn
    --diffoctreerast        Install diffoctreerast
    --vox2seq               Install vox2seq
    --spconv                Install spconv
    --mipgaussian           Install mip-splatting
    --kaolin                Install kaolin
    --nvdiffrast            Install nvdiffrast
    --demo                  Install all dependencies for demo

Pretrained models

We have provided our pretrained weights of both sparse structure module and SLAT module on HuggingFace.

Data Preprocessing

Training Data

We use three datasets for training: ABO, 3D-FUTURE, and HSSD. To obtain the training data, please also refer to TRELLIS. Thanks to them for the amazing work!!!.

When the data is ready, combine them and put under ./dataset/abo_3dfuture_hssd. If you want to train on a single dataset, feel free to modify the dataloader. For training, rendering images, Sparse Structure and SLAT are required.

Training

To train you own model, you can start either on our weights or TRELLIS original weights. Please download the weights and put them under ./ckpts.

To train the sparse structure module with our designed mask-weighted cross-attention and occlusion-aware attention, please run:

. ./train_ss.sh

To train the sparse structure module with our designed mask-weighted cross-attention and occlusion-aware attention, please run:

. ./train_slat.sh

The output folder where the model will be saved can be changed by modifying --vis parameter in the script.

Inference

We have prepared examples under ./example folder. It supports both single and multiple image as input. For inference, please run:

python ./inference.py

If you want to try on you own data. You should prepare: 1) original image and 2) mask image (background is white (255,255,255), visible area is gray (188,188,188), occluded area is black (0,0,0)).

You can use Segment Anything to obtain the corresponding mask, which is used for our in-the-wild examples in the paper and also in our demo.

Evalutation

We render Toys4K and GSO exactly the same as training data. To obtain the evaluation dataset, please modify the directory in 3d_mask_render.py and run:

python ./3d_mask_render.py

It will create a renders_mask folder with the 3D consistent mask in it.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
amodal3r		amodal3r
asset		asset
configs		configs
dataLoader		dataLoader
dataset_toolkits		dataset_toolkits
dit		dit
example		example
extensions/vox2seq		extensions/vox2seq
lightning		lightning
3d_mask_render.py		3d_mask_render.py
LICENSE.txt		LICENSE.txt
README.md		README.md
inference.py		inference.py
setup.sh		setup.sh
train_lightning.py		train_lightning.py
train_slat.sh		train_slat.sh
train_ss.sh		train_ss.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images

ICCV 2025

Paper | Project Page | Pretrain Weight | Demo

Demo Video

Setup

Pretrained models

Data Preprocessing

Training Data

Training

Inference

Evalutation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images

ICCV 2025

Paper | Project Page | Pretrain Weight | Demo

Demo Video

Setup

Pretrained models

Data Preprocessing

Training Data

Training

Inference

Evalutation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages