Skip to content
@NanoNets

Nanonets

Popular repositories Loading

  1. docext docext Public

    An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

    Python 1.8k 135

  2. docstrange docstrange Public

    Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

    Python 1.1k 100

  3. nanonets-ocr-sample-python nanonets-ocr-sample-python Public

    NanoNets OCR API Example for Python

    Python 206 52

  4. RaspberryPi-ObjectDetection-TensorFlow RaspberryPi-ObjectDetection-TensorFlow Public

    Object Detection using TensorFlow on a Raspberry Pi

    Python 172 37

  5. ocr-with-tesseract ocr-with-tesseract Public

    A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV

    Jupyter Notebook 127 72

  6. ocr-python ocr-python Public

    OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.

    Jupyter Notebook 122 17

Repositories

Showing 10 of 57 repositories
  • docstrange Public

    Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

    Python 1,065 MIT 100 26 2 Updated Oct 31, 2025
  • Nanonets-OCR2 Public

    Evaluations for Nanonets-OCR-1.5

    Jupyter Notebook 15 1 1 0 Updated Oct 16, 2025
  • docext Public

    An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

    Python 1,806 Apache-2.0 136 18 (1 issue needs help) 3 Updated Aug 25, 2025
  • llm-data-converter Public

    Convert any document format into LLM-ready data format (markdown) with advanced intelligent document processing capabilities powered by pre-trained models.

    Python 5 MIT 1 0 0 Updated Aug 14, 2025
  • nanonets-go Public

    Code samples in golang for nanonets API

    Go 1 MIT 0 0 0 Updated May 29, 2025
  • DocAIAgent Public

    This code is part of a workshop conducted on how to build your own Document AI Agent using Open Source LLMs

    Jupyter Notebook 15 8 0 0 Updated May 8, 2025
  • table-metrics Public

    A repo with all metrics related to table extraction accuracy computation

    0 MIT 0 0 0 Updated Apr 24, 2025
  • nn-auto-bench Public

    AutoBench: Benchmarking Automation for Intelligent Document Processing (IDP) with confidence

    Python 10 4 0 0 Updated Mar 18, 2025
  • search-kb Public
    Python 0 0 0 0 Updated Feb 2, 2025
  • Jupyter Notebook 8 2 0 0 Updated Nov 15, 2024