M5 – A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks
Published in In the proceedings of Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Recommended citation: Florian Schneider, Sunayana Sitaram, "M5 – A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks." In the proceedings of Findings of the Association for Computational Linguistics: EMNLP 2024, 2024.
Download Paper