Video-kMaX (WACV 2024)

This is an official implementation of our WACV 2024 paper: Video-kMaX

We propose a unified approach for online and near-online VPS. The meta architecture of the proposed Video-kMaX consists of two components: within clip segmenter (for clip-level segmentation) and cross-clip associater (for association beyond clips). We propose clip-kMaX (clip k-means mask transformer) and HiLA-MB (Hierarchical Location-Aware Memory Buffer) to instantiate the segmenter and associater, respectively. Our general formulation includes the online scenario as a special case by adopting clip length of one.

Installation

The code-base is verified with pytorch==1.12.1, torchvision==0.13.1, cudatoolkit==11.3, and detectron2==0.6, please install other libiaries through pip3 install -r requirements.txt

Please refer to Mask2Former's script for data preparation.

Dataset preparation

VIPSeg VPS

Please refer to VIPSeg

Experiments

VIPSeg VPS

Near-online (pretrained model: COCO pseudo video)

Backbone	SQ	AQ	STQ	VPQ	ckpt
ResNet-50	45.1	35.3	39.9	38.2	download
ConvNeXt-Large	61.4	43.5	51.7	51.9	download

Online (pretrained model: COCO)

Backbone	SQ	AQ	STQ	VPQ	ckpt
ResNet-50	46.3	32.4	38.7	36.8	download
ConvNeXt-Large	60.7	40.2	49.4	49.4	download

Citing Video-kMaX

If you find this code helpful in your research or wish to refer to the baseline results, please use the following BibTeX entry.

@misc{shin2023videokmax,
      title={Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation}, 
      author={Inkyu Shin and Dahun Kim and Qihang Yu and Jun Xie and Hong-Seok Kim and Bradley Green and In So Kweon and Kuk-Jin Yoon and Liang-Chieh Chen},
      year={2023},
      eprint={2304.04694},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
configs		configs
docs		docs
video_kmax_deeplab		video_kmax_deeplab
README.md		README.md
requirements.txt		requirements.txt
train_net.py		train_net.py
train_net_utils.py		train_net_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video-kMaX (WACV 2024)

Installation

Dataset preparation

VIPSeg VPS

Experiments

VIPSeg VPS

Near-online (pretrained model: COCO pseudo video)

Online (pretrained model: COCO)

Citing Video-kMaX

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Video-kMaX (WACV 2024)

Installation

Dataset preparation

VIPSeg VPS

Experiments

VIPSeg VPS

Near-online (pretrained model: COCO pseudo video)

Online (pretrained model: COCO)

Citing Video-kMaX

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages