M5 – A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks

Published in In the proceedings of Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Access paper here

Recommended citation: Florian Schneider, Sunayana Sitaram, "M5 – A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks." In the proceedings of Findings of the Association for Computational Linguistics: EMNLP 2024, 2024.
Download Paper