✨ MMComposition: Revisiting the Compositionality of Pre-trained Vision-Language Models

What is MMComposition?

MMComposition aims to provide a comprehensive assessment of compositionality for Vision-Language Models (VLMs) -- the ability to understand and produce novel combinations of known visual and textual components. This research endeavor is designed to help researchers and practitioners better understand the capabilities, limitations, and critical areas for model improvement in VLM. MMComposition comprises 13 complex vision-language composition tasks, including:

Attribute Perception
Object Perception
Counting Perception
Relation Perception
Difference Spotting
Text Rendering
Visual Similarity
Attribute Reasoning
Object Reasoning
Counting Reasoning
Relation Reasoning
Object Interaction
Compositional Probing

Getting Started

🏆 Leaderboard

📉 Statistics

✏️ Citation

@article{hua2024mmcomposition,
  title={MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models},
  author={Hua, Hang and Tang, Yunlong and Zeng, Ziyun and Cao, Liangliang and Yang, Zhengyuan and He, Hangfeng and Xu, Chenliang and Luo, Jiebo},
  journal={arXiv preprint arXiv:2410.09733},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
README.md		README.md
evaluation.py		evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

✨ MMComposition: Revisiting the Compositionality of Pre-trained Vision-Language Models

What is MMComposition?

Getting Started

🏆 Leaderboard

📉 Statistics

✏️ Citation

Under construction...

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

✨ MMComposition: Revisiting the Compositionality of Pre-trained Vision-Language Models

What is MMComposition?

Getting Started

🏆 Leaderboard

📉 Statistics

✏️ Citation

Under construction...

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages