Local LLMs Presentation Examples

Overview

This repository contains the code examples described during the Local LLMs presentation at the UW Madison ITPC 2024. The aim of these examples is to build a hands-on understanding of how LLMs work, how they can be used and their various limitations. All rely on freely available tools and data models that can be run locally on most developer's computers.

Examples

just-curl
- Just a simple curl command to get the LLM to write a story
- Simple example of OpenAI-compatible Ollama API
simplest_chat
- Simple Javascript REST example
- Demonstrates use of a System Prompt
- Demonstrates some Model Parameters
simple_chat
- Demonstrates the use and maintenance of Chat History to provide ongoing context to the LLM
chat-ui
- Simple Chat web application
- Demonstrates streaming output to make 'next word' prediction a bit more obvious
- Allows easy experimentation with System Prompts
rag-embed
- Demonstrates a very basic Retrieval Augmented Generation system
llama-index
- Demonstrates the use of the LlamaIndex library to implement a RAG system
alt-text-generator
- Demonstrates the use of both a BLIP caption model and a multi-modal LLM model to generate image captions
xkcd-explainer
- Leverages a multi-model model to attempt to explain random xkcd cartoons.

Preparation & Requirements

Hardware
- 4-8 CPU cores
- 8-32 GB RAM
- 8GB+ GPU or Apple Silicon CPU (M1, M2, M3) recommended, but not required
- 20GB of disk space (or so, less might work)
Software
- Python 3.9.x - Some components of some examples may have trouble under 3.10 at the moment (3/2024).
- Node 18.x
- Ollama desktop application - Cross platform LLM manager.
  - Free download at https://ollama.com
- Mistral:7B LLM model - Available through Ollama once installed
  - ollama pull mistral:latest
- LlaVa Multi-model model - Available through Ollama once installed
  - ollama pull llava:latest
- Nomic embedding model - Available through Ollama once installed
  - ollama pull nomic-embed-text

Running the examples

Javascript Examples

Ensure that you are using Node 18+ (older versions may work, but not tested)
1. node -v
Change to the directory of one of the javascript examples (simple_chat or simplest_chat).
Install the dependencies
1. npm install
Run the script
1. node main.js

Python Examples

For best results, I'd suggest creating a separate Python virtual environment in each example directory.

Ensure that you are using Python 3.9.x
1. python -V
Change to the example directory
Create and activate a new python virtual environment
1. python -m venv .venv
2. . .venv/bin/activate
Install the requirements
1. pip install -r requirements.txt
Run the script
1. python main.py

For examples that provide a web interface, it will be available at http://localhost:7861

NOTE: Some examples will need to download a relatively large model file (about 5 GB) upon first run.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Local LLMs Presentation Examples

Overview

Examples

Preparation & Requirements

Running the examples

Javascript Examples

Python Examples

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
alt-text-generator		alt-text-generator
chat-ui		chat-ui
just-curl		just-curl
llama-index		llama-index
rag-embed		rag-embed
simple_chat		simple_chat
simplest_chat		simplest_chat
xkcd-explainer		xkcd-explainer
.gitignore		.gitignore
Readme.md		Readme.md

bhill6/local_llms_examples

Folders and files

Latest commit

History

Repository files navigation

Local LLMs Presentation Examples

Overview

Examples

Preparation & Requirements

Running the examples

Javascript Examples

Python Examples

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages