Skip to content
#

document-indexing

Here are 11 public repositories matching this topic...

A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.

  • Updated Jul 21, 2023
  • TypeScript

This repository highlights my learning journey in building Retrieval-Augmented Generation (RAG) pipelines using DeepSeek on Lightning AI, covering document ingestion, retrieval, and integration with generative AI. It showcases fine-tuning, evaluation, and optimization for accurate open-domain QA and knowledge management.

  • Updated Jan 24, 2025
  • Jupyter Notebook

A high-performance PDF document search application that extracts text from PDF files, indexes content using Whoosh, and provides a premium user interface with modern design elements. Features include context-aware search results, content highlighting, multi-format export options, and an interactive document viewer with match navigation.

  • Updated Mar 25, 2025
  • HTML

Improve this page

Add a description, image, and links to the document-indexing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-indexing topic, visit your repo's landing page and select "manage topics."

Learn more