transformers: correctly load automodels #1643

mudler · 2024-01-25T18:19:17Z

Description

This PR makes the transformers backend functional, and adds an example with tinyllama. It also expands the examples section with animagine-xl.

Notes for Reviewers

Signed commits

Yes, I signed my commits.

netlify · 2024-01-25T18:19:22Z

✅ Deploy Preview for localai canceled.

Name	Link
🔨 Latest commit	`e02f2f0`
🔍 Latest deploy log	https://app.netlify.com/sites/localai/deploys/65b2b7239985330008a04279

….0@b689c91 by renovate (#17756) This PR contains the following updates: | Package | Update | Change | |---|---|---| | [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) | minor | `v2.6.1` -> `v2.7.0` | --- > [!WARNING] > Some dependencies could not be looked up. Check the Dependency Dashboard for more information. --- ### Release Notes <details> <summary>mudler/LocalAI (docker.io/localai/localai)</summary> ### [`v2.7.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.7.0) [Compare Source](https://togithub.com/mudler/LocalAI/compare/v2.6.1...v2.7.0)  This release adds support to the transformer backend for LLM as well! For now instance you can run codellama-7b with transformers with: docker run -ti -p 8080:8080 --gpus all localai/localai:v2.7.0-cublas-cuda12 codellama-7b In the quickstart there are more examples available https://localai.io/basics/getting_started/#running-models. Note: As llama.cpp is ongoing with changes that could possible cause breakage, this release does not includes changes from [ggerganov/llama.cpp#5138 (the future versions will). #### What's Changed ##### Bug fixes 🐛 - fix(paths): automatically create paths by [@mudler](https://togithub.com/mudler) in [mudler/LocalAI#1650 ##### Exciting New Features 🎉 - feat(transformers): support also text generation by [@mudler](https://togithub.com/mudler) in [mudler/LocalAI#1630 - transformers: correctly load automodels by [@mudler](https://togithub.com/mudler) in [mudler/LocalAI#1643 - feat(startup): fetch model definition remotely by [@mudler](https://togithub.com/mudler) in [mudler/LocalAI#1654 ##### 👒 Dependencies - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1642 - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1644 - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1652 - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1655 ##### Other Changes - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1632 - ⬆️ Update docs version mudler/LocalAI by [@localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1631 **Full Changelog**: mudler/LocalAI@v2.6.1...v2.6.2 </details> --- ### Configuration 📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone Europe/Amsterdam, Automerge - At any time (no schedule defined). 🚦 **Automerge**: Enabled. ♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Renovate Bot](https://togithub.com/renovatebot/renovate).

….0@b689c91 by renovate (truecharts#17756) This PR contains the following updates: | Package | Update | Change | |---|---|---| | [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) | minor | `v2.6.1` -> `v2.7.0` | --- > [!WARNING] > Some dependencies could not be looked up. Check the Dependency Dashboard for more information. --- ### Release Notes <details> <summary>mudler/LocalAI (docker.io/localai/localai)</summary> ### [`v2.7.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.7.0) [Compare Source](https://togithub.com/mudler/LocalAI/compare/v2.6.1...v2.7.0)  This release adds support to the transformer backend for LLM as well! For now instance you can run codellama-7b with transformers with: docker run -ti -p 8080:8080 --gpus all localai/localai:v2.7.0-cublas-cuda12 codellama-7b In the quickstart there are more examples available https://localai.io/basics/getting_started/#running-models. Note: As llama.cpp is ongoing with changes that could possible cause breakage, this release does not includes changes from [ggerganov/llama.cpp#5138 (the future versions will). #### What's Changed ##### Bug fixes 🐛 - fix(paths): automatically create paths by [@&truecharts#8203;mudler](https://togithub.com/mudler) in [mudler/LocalAI#1650 ##### Exciting New Features 🎉 - feat(transformers): support also text generation by [@&truecharts#8203;mudler](https://togithub.com/mudler) in [mudler/LocalAI#1630 - transformers: correctly load automodels by [@&truecharts#8203;mudler](https://togithub.com/mudler) in [mudler/LocalAI#1643 - feat(startup): fetch model definition remotely by [@&truecharts#8203;mudler](https://togithub.com/mudler) in [mudler/LocalAI#1654 ##### 👒 Dependencies - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1642 - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1644 - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1652 - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1655 ##### Other Changes - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1632 - ⬆️ Update docs version mudler/LocalAI by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [mudler/LocalAI#1631 **Full Changelog**: mudler/LocalAI@v2.6.1...v2.6.2 </details> --- ### Configuration 📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone Europe/Amsterdam, Automerge - At any time (no schedule defined). 🚦 **Automerge**: Enabled. ♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Renovate Bot](https://togithub.com/renovatebot/renovate).

backends(transformers): use AutoModel with LLM types

47bca84

examples: animagine-xl

ee79fce

mudler force-pushed the transformers_automodelllm branch 3 times, most recently from 853c88e to f1c2875 Compare January 25, 2024 19:15

Add codellama examples

e02f2f0

mudler force-pushed the transformers_automodelllm branch from f1c2875 to e02f2f0 Compare January 25, 2024 19:31

mudler merged commit cb75127 into master Jan 25, 2024
24 checks passed

mudler deleted the transformers_automodelllm branch January 25, 2024 23:13

mudler added the enhancement New feature or request label Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transformers: correctly load automodels #1643

transformers: correctly load automodels #1643

mudler commented Jan 25, 2024

netlify bot commented Jan 25, 2024 •

edited

Loading

transformers: correctly load automodels #1643

transformers: correctly load automodels #1643

Conversation

mudler commented Jan 25, 2024

netlify bot commented Jan 25, 2024 • edited Loading

✅ Deploy Preview for localai canceled.

netlify bot commented Jan 25, 2024 •

edited

Loading