Exploiting the PyTerrier library to build a Search Engine and resolve the Near Duplicate Detection tasks.
-
Updated
Sep 20, 2022 - Jupyter Notebook
Exploiting the PyTerrier library to build a Search Engine and resolve the Near Duplicate Detection tasks.
This repository hosts the implementation of a Simple Search Engine designed for efficient information retrieval. The project encompasses several stages from data collection to evaluation, ensuring a comprehensive approach to search and retrieval.
Word2vec, sentenceBert, BM25 and IVFFlat Index quality and speed comparison
This project creates a basic search engine for text documents, covering data collection, preprocessing, indexing, query processing, expansion, UI development, and performance evaluation. Its goal is to efficiently retrieve relevant information from the document collection.
This repository contains the code for a research project that implements and evaluates local word embeddings based on co-authorship and citations for query expansion in PyTerrier on the TREC-Covid dataset.
Fact Finder - a Fact Search Engine
Information retrieval techniques using Pyterrier
Multi-stage Retrieval using SPLADE or RM3 and T5.
Create PyTerrier compatible dense indices using any sentence_transformers model
Clinical Trials Search Engine for Information Retrieval and Recommender System Project
Add a description, image, and links to the pyterrier topic page so that developers can more easily learn about it.
To associate your repository with the pyterrier topic, visit your repo's landing page and select "manage topics."