WISMIR3: A Multi-Modal Dataset to Challenge Text-Image Retrieval Approaches

Published in In the proceedings of Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR), 2024

Access paper here

Recommended citation: Florian Schneider, Chris Biemann, "WISMIR3: A Multi-Modal Dataset to Challenge Text-Image Retrieval Approaches." In the proceedings of Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR), 2024.
Download Paper