GitHub - aletfa/doctalk: A .Net AI Tool to Chat about your Docs.

A .Net, AI tool, to Chat about your Docs.

Example: A folder with all your e-books, from here you run a technical chat with the AI.
Example: A folder with documentation of your next work activity, from here perform an AI-assisted discovery.

Introduction

DocTalk is an orchestration of 3 principal components:

Whisper AI
- Thanks to OpenAI Whisper for the algorithm @officialsite
- Thanks to the WhisperNet team @github
LLama AI
- Thanks to Meta LLama2 for the algorithm @officialsite
- Thanks to TheBlokeAI for the model @officialsite
- Thanks to LLamaSharp Team for the .Net wrapper @github
Microsoft.KernelMemory @github

All to generate what is intended to be an example of how it is possible to exploit the advantages of generative AI in order to start a discussion regarding your documentation with an AI assistant.

All locally and without taking the advantages of the Cloud. This only to avoid getting into difficult problems due to the privacy of the information contained.

Getting started

Build the project and restore the NuGet packages.
Download FFMpeg and put all the 3 executables in the bin folder into the application root directory (example: DocTalk\bin\Debug\net8.0)
Start the program
The program ask you for a directory to discuss about
The program will start downloading the AI models
- ℹ️: This operation may take some time (more info into the "tweak" section below)
All the media file will be read using Whisper AI and converted to text format
The LLama model is loaded and the chat with the AI begins.

Tweak

LLama model

The model used is the one provided by TheBlokeAI on hubbingface and in particular the llama-2-7b-chat.Q4_K_M.gguf approximately 4 GB. This because in my opinion, it is the best cost (computational) / benefit ratio.
LLamaSharp speedup

For compatibility the project starts with the LLamaSharp.Backend.Cpu library which uses the CPU for computation. For better performance, if you have an Nvidia RTX video card you can use the CUDA suite which uses the GPU for calculations instead.

Install the CUDA Toolkit available at Nvidia
Remove the LLamaSharp.Backend.Cpu NuGet package
Install the LLamaSharp.Backend.Cuda12 package (or 11 for backwards compatibility)

Move to Cloud

Microsoft.KernelMemory offers various computing options via Cloud, all of which are more efficient than what can be done locally. However, be careful because the contents of the documents will be sent to the Cloud for analysis. In many scenarios this is completely safe but it is worth evaluating on a case-by-case basis.

Known Issues

LLamaSharp.kernel-memory always use the Console to log.
Some AI responses may not always be great. The model used is an previous version of LLama. The latest version also uses 47GB models and offers the best performance (BlokeAI@hubbingface)

Special Thanks

Martin Evans (@martindevans) for his quickly support about LLamaSharp.
Marco Minerva (@marcominerva) for his always useful support and for the quality of the code he publishes.
Copilot (@officialsite) for the logo and the support during the coding sessions.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
DocTalk		DocTalk
.gitignore		.gitignore
DocTalk.sln		DocTalk.sln
LICENSE		LICENSE
README.md		README.md
logo.jpg		logo.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Getting started

Tweak

Known Issues

Special Thanks

About

Releases

Packages

Languages

License

aletfa/doctalk

Folders and files

Latest commit

History

Repository files navigation

Introduction

Getting started

Tweak

Known Issues

Special Thanks

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages