Skip to content

dwani.ai - Knowledge from Curiosity

Overview

dwani.ai is a self-hosted, multimodal inference system designed for Indian Languages.

Interact using Text, Voice, Image and Documents for learning, analysis and exploration.

pip install --upgrade dwani

import dwani
import os

dwani.api_key = os.getenv("DWANI_API_KEY")

dwani.api_base = os.getenv("DWANI_API_BASE_URL")
  • gemma3 - with translation
resp = dwani.Chat.create(prompt="Hello!", src_lang="english", tgt_lang="kannada", model="gemma3")
print(resp)
{'response': 'ನಮಸ್ತೆ! ಭಾರತ ಮತ್ತು ಕರ್ನಾಟಕವನ್ನು ಗಮನದಲ್ಲಿಟ್ಟುಕೊಂಡು ಇಂದು ನಿಮ್ಮ ಪ್ರಶ್ನೆಗಳಿಗೆ ನಾನು ನಿಮಗೆ ಹೇಗೆ ಸಹಾಯ ಮಾಡಲಿ?'}

Website -> dwani.ai

Chat UI -> chat.dwani.ai

Workshop PPT -> workshop-ppt

Why dwani ? - Video

Download App on Google Play

Research Goals

  • Measure and improve the Time to First Token Generation (TTFTG) for model architectures in ASR, Translation, and TTS systems.
  • Develop and enhance a Kannada voice + OCR model that meets industry standards set by OpenAI, Google, ElevenLabs, xAI
  • Create robust voice solutions for Indian languages, with a specific emphasis on Kannada.

Models and Tools

The project utilizes the following open-source tools:

Open-Source Tool Source Repository
Automatic Speech Recognition : ASR ASR Indic Server
Text to Speech : TTS TTS Indic Server
Translation Indic Translate Server
Document Parser Indic Document Server
dwani API Server Dwani Server
dwani Android Android
dwani python sdk python-sdk
vllm-arm64 vllm-arm64

Features

Feature Description Components
Voice AI Provides answers to voice queries using a LLM ASR + LLM
Text Translate Translates text from one language to another. Translation
Text Query Allows querying text data for specific information. LLM
Voice to Text Translation Converts spoken language to text and translates it. ASR, Translation
PDF Translate Translates content from PDF documents.
Text to Speech Generates speech from text. TTS
Voice to Voice Translation Converts spoken language to text, translates it, and then generates speech. ASR, Translation, TTS
Answer Engine with Translate Provides answers to queries with translation capabilities. ASR, LLM, Translation, TTS

Contact

  • For any questions or issues, please open an issue on GitHub or contact us via email.
  • For collaborations
  • For business queries, Email : sachin (at) dwani (dot) ai

Popular repositories Loading

  1. dwani-server dwani-server Public

    Python 7 4

  2. dwani-ai.github.io dwani-ai.github.io Public

    Multimodal Inference for Indian Languages

    TypeScript 5 1

  3. asr-indic-server asr-indic-server Public

    Python 3 6

  4. workshop workshop Public

    Python 3 38

  5. dwani-android dwani-android Public

    Kotlin 2 2

  6. vllm-arm64 vllm-arm64 Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 2 1

Repositories

Showing 10 of 25 repositories

Top languages

Loading…

Most used topics

Loading…