This paper presents ViFi - a Video Finding System for Video Browser Showdown 2025. Our retrieval system is mainly based on the SigLIP, a most recent and robust visual-textual embedding model, and significantly outperforms CLIP in text-to-image retrieval on the MSCOCO data. Instead of using...
This paper presents ViFi - a Vi deo Fi nding System for Video Browser Showdown 2025. Our retrieval system is mainly based on the SigLIP, a most recent and robust visual-textual embedding model, and significantly outperforms CLIP in text-to-image retrieval on the MSCOCO data. Instead of ...