WISMIR3: A Multi-Modal Dataset to Challenge Text-Image Retrieval Approaches
Published in In the proceedings of Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR), 2024
Recommended citation: Florian Schneider, Chris Biemann, "WISMIR3: A Multi-Modal Dataset to Challenge Text-Image Retrieval Approaches." In the proceedings of Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR), 2024.
Download Paper