Skip to content
@CatchTheTornado

Catch The Tornado

eCommerce Startup Studio

Popular repositories Loading

  1. pdf-extract-api pdf-extract-api Public

    Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

    Python 1.1k 63

  2. askql askql Public

    AskQL is a query language that can express any data request

    TypeScript 390 27

  3. opensourcetipsbook opensourcetipsbook Public

    Open Source book on Open Source. How to create a successful OSS product - tips and tricks that simply works.

    MDX 56 7

  4. doctor-dok doctor-dok Public

    Doctor Dok is an AI based medical data framework and patient's med vault. Parse any health related PDF/Image to JSON and then use Chat GPT / LLama to discuss it! WARNING: Don't decide on your healt…

    TypeScript 46 8

  5. llm-pdf-ocr-anonimizer llm-pdf-ocr-anonimizer Public

    This project is using Tesseract OCR to convert images to text - then removing PII informatio

    TypeScript 11

  6. ai-product-descriptor ai-product-descriptor Public

    This is an PoC of a REST service extracting product text description and structuralized attributes from product image.

    Python 9

Repositories

Showing 6 of 6 repositories
  • pdf-extract-api Public

    Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

    CatchTheTornado/pdf-extract-api’s past year of commit activity
    Python 1,131 GPL-3.0 63 10 3 Updated Nov 7, 2024
  • doctor-dok Public

    Doctor Dok is an AI based medical data framework and patient's med vault. Parse any health related PDF/Image to JSON and then use Chat GPT / LLama to discuss it! WARNING: Don't decide on your health based on AI Chat - it's just for Research purposes.

    CatchTheTornado/doctor-dok’s past year of commit activity
    TypeScript 46 MIT 8 60 (3 issues need help) 0 Updated Oct 22, 2024
  • llm-pdf-ocr-anonimizer Public

    This project is using Tesseract OCR to convert images to text - then removing PII informatio

    CatchTheTornado/llm-pdf-ocr-anonimizer’s past year of commit activity
    TypeScript 11 MIT 0 3 0 Updated Jul 28, 2024
  • ai-product-descriptor Public

    This is an PoC of a REST service extracting product text description and structuralized attributes from product image.

    CatchTheTornado/ai-product-descriptor’s past year of commit activity
    Python 9 0 0 0 Updated Dec 4, 2023
  • opensourcetipsbook Public

    Open Source book on Open Source. How to create a successful OSS product - tips and tricks that simply works.

    CatchTheTornado/opensourcetipsbook’s past year of commit activity
    MDX 56 CC-BY-4.0 7 2 0 Updated Oct 10, 2023
  • askql Public

    AskQL is a query language that can express any data request

    CatchTheTornado/askql’s past year of commit activity
    TypeScript 390 MIT 27 155 (14 issues need help) 24 Updated Mar 8, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…