A composable and fully extensible C++ execution engine library for data management systems.
-
Updated
Oct 31, 2024 - C++
A composable and fully extensible C++ execution engine library for data management systems.
Text preprocessing, indexer constructions, and search engines implementation for information retrieval. Performance analysis done by measuring the construction time of indexers.
Instructional materials (course files) for the BBT3104 course (Advanced Database Systems) and the MIT8107 course (Advanced Database Systems). Topic: Query Optimization. Based on the IMDb dataset using PostgreSQL and the Join Order Benchmark (JOB).
Enhance the performance and cost-efficiency of large-scale Retrieval Augmented Generation (RAG) applications. Learn to integrate vector search with traditional database operations and apply techniques like prefiltering, postfiltering, projection, and prompt compression.
Data Processing At Scale
Leverages extensive power of multiple Machine Learning algorithms & LLM to provide in-depth answers to medical queries and predicts condition/diseases based on patient symptoms
A lightweight database engine supporting CRUD, simple queries, and concurrency control with a CLI.
A DBMS project on Textile Store Management using StreamLit-Python for the frontend app
Implementation and artifacts for "User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases"
A search engine that ranks documents by relevance to a query using a weighting scheme, tokenization, stop word removal, and stemming
Tunable Query Optimizer for Web APIs and User Preferences
An efficient layer to perform query processing on top of Terrier
This repo consists of all the assignments, projects, tasks of Information Retrieval course of FAST NUCES Spring 2023.
Efficient caching for non-volatile memory-backed search indices. A thesis submitted for undergraduate honours for the degree Software Engineering.
Database Systems
Simple search system that includes inverted index builder and boolean query processor for information retrieval.
Falcon: The world fastest data analytics engine
Inverted index and Positional index for a set of collection to facilitate Boolean Model of IR. Inverted files and Positional files are the primary data structure to support the efficient determination of which documents contain specified terms and at which proximity.
Indoor Spatial Queries: Modeling, Indexing, and Processing
Developed an SQL engine that will run a subset of SQL queries using command-line interface
Add a description, image, and links to the query-processing topic page so that developers can more easily learn about it.
To associate your repository with the query-processing topic, visit your repo's landing page and select "manage topics."