Skip to content

Latest commit

 

History

History
101 lines (46 loc) · 3.96 KB

README.md

File metadata and controls

101 lines (46 loc) · 3.96 KB

ChadGPT

OpenAI GPT Link

ARCHIVE

Run llama.cpp distributed over MPI

Centeral Iowa Linux Users Group
19 April 2023

News for April 2023

Presentation: Self-hosting large language models

  • What is a large language model(LLM)?

  • What can I do with a LLM?

  • Run your own LLM - llama.cpp and web-llm

What is a large languge model (LLM)?

What can I do with an LLM?

  • ChatGPT4 Demo - GitHub Copilot demo if you want

  • whisper.cpp - use OpenAI whisper to transcribe audio to text - WANTED live transcription for meetings.

  • AutoGPT - BabyAGI - use GPT and scripts to drive other GPTs and scripts.

  • Reddit GPT has good weekly briefings.

Run your own LLM - llama.cpp and web-llm

  • llama.cpp - a fork of whisper.cpp - most widely used C++ code to host your own LLM.

  • Huggingface - stores open models as Git LFS.

  • web-llm - uses WebGPU to run in the LLM in your browser.

Linux Predictions

  • Linux 7 will have a LLM of various sizes and an SMT solver to prove responses correct.

  • CGROUPS3 - closer to AWS Zelkova and AWS IAM

  • Kernel LLM will be used as a dictionary for data compression.

  • Oxide Computer size racks will have distrubuted linux schedulers. Kubernetes goes extinct.

  • More systems code like compilers will run on GPU.