Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
-
Updated
Aug 11, 2024 - Python
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Dual Cross Encoder for Dense Retrieval
Efficient interpolation-based ranking on CPUs
undergraduate thesis-based project
Understand and build embedding models, focusing on word and sentence embeddings, dual encoder architectures. Learn to train embedding models using contrastive loss, implement them in semantic search and RAG systems.
Dual-Encoder in Tensorflow
This repository provides the code for "Improving Query-by-Vocal Imitation with Contrastive Learning and Audio Pretraining", presented at DCASE 2024. The paper addresses the challenge of audio retrieval using vocal imitations as queries, proposing a dual encoder architecture that leverages pretrained CNNs and an adapted NT-Xent loss for fine-tuning.
Add a description, image, and links to the dual-encoder topic page so that developers can more easily learn about it.
To associate your repository with the dual-encoder topic, visit your repo's landing page and select "manage topics."