Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

transformers: correctly load automodels #1643

Merged
merged 3 commits into from
Jan 25, 2024
Merged

Conversation

mudler
Copy link
Owner

@mudler mudler commented Jan 25, 2024

Description

This PR makes the transformers backend functional, and adds an example with tinyllama. It also expands the examples section with animagine-xl.

Notes for Reviewers

Signed commits

  • Yes, I signed my commits.

Copy link

netlify bot commented Jan 25, 2024

Deploy Preview for localai canceled.

Name Link
🔨 Latest commit e02f2f0
🔍 Latest deploy log https://app.netlify.com/sites/localai/deploys/65b2b7239985330008a04279

@mudler mudler force-pushed the transformers_automodelllm branch 3 times, most recently from 853c88e to f1c2875 Compare January 25, 2024 19:15
@mudler mudler merged commit cb75127 into master Jan 25, 2024
24 checks passed
@mudler mudler deleted the transformers_automodelllm branch January 25, 2024 23:13
@mudler mudler added the enhancement New feature or request label Jan 29, 2024
truecharts-admin added a commit to truecharts/charts that referenced this pull request Jan 29, 2024
….0@b689c91 by renovate (#17756)

This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.6.1` -> `v2.7.0` |

---

> [!WARNING]
> Some dependencies could not be looked up. Check the Dependency
Dashboard for more information.

---

### Release Notes

<details>
<summary>mudler/LocalAI (docker.io/localai/localai)</summary>

### [`v2.7.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.7.0)

[Compare
Source](https://togithub.com/mudler/LocalAI/compare/v2.6.1...v2.7.0)

<!-- Release notes generated using configuration in .github/release.yml
at master -->

This release adds support to the transformer backend for LLM as well!

For now instance you can run codellama-7b with transformers with:

docker run -ti -p 8080:8080 --gpus all
localai/localai:v2.7.0-cublas-cuda12 codellama-7b

In the quickstart there are more examples available
https://localai.io/basics/getting_started/#running-models.

Note: As llama.cpp is ongoing with changes that could possible cause
breakage, this release does not includes changes from
[ggerganov/llama.cpp#5138
(the future versions will).

#### What's Changed

##### Bug fixes 🐛

- fix(paths): automatically create paths by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1650

##### Exciting New Features 🎉

- feat(transformers): support also text generation by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1630
- transformers: correctly load automodels by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1643
- feat(startup): fetch model definition remotely by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1654

##### 👒 Dependencies

- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1642
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1644
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1652
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1655

##### Other Changes

- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1632
- ⬆️ Update docs version mudler/LocalAI by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1631

**Full Changelog**:
mudler/LocalAI@v2.6.1...v2.6.2

</details>

---

### Configuration

📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone
Europe/Amsterdam, Automerge - At any time (no schedule defined).

🚦 **Automerge**: Enabled.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about this update
again.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR has been generated by [Renovate
Bot](https://togithub.com/renovatebot/renovate).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy4xNTYuNCIsInVwZGF0ZWRJblZlciI6IjM3LjE1Ni40IiwidGFyZ2V0QnJhbmNoIjoibWFzdGVyIn0=-->
GabrielBarzen pushed a commit to GabrielBarzen/charts that referenced this pull request Feb 2, 2024
….0@b689c91 by renovate (truecharts#17756)

This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.6.1` -> `v2.7.0` |

---

> [!WARNING]
> Some dependencies could not be looked up. Check the Dependency
Dashboard for more information.

---

### Release Notes

<details>
<summary>mudler/LocalAI (docker.io/localai/localai)</summary>

### [`v2.7.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.7.0)

[Compare
Source](https://togithub.com/mudler/LocalAI/compare/v2.6.1...v2.7.0)

<!-- Release notes generated using configuration in .github/release.yml
at master -->

This release adds support to the transformer backend for LLM as well!

For now instance you can run codellama-7b with transformers with:

docker run -ti -p 8080:8080 --gpus all
localai/localai:v2.7.0-cublas-cuda12 codellama-7b

In the quickstart there are more examples available
https://localai.io/basics/getting_started/#running-models.

Note: As llama.cpp is ongoing with changes that could possible cause
breakage, this release does not includes changes from
[ggerganov/llama.cpp#5138
(the future versions will).

#### What's Changed

##### Bug fixes 🐛

- fix(paths): automatically create paths by
[@&truecharts#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1650

##### Exciting New Features 🎉

- feat(transformers): support also text generation by
[@&truecharts#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1630
- transformers: correctly load automodels by
[@&truecharts#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1643
- feat(startup): fetch model definition remotely by
[@&truecharts#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1654

##### 👒 Dependencies

- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1642
- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1644
- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1652
- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1655

##### Other Changes

- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1632
- ⬆️ Update docs version mudler/LocalAI by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1631

**Full Changelog**:
mudler/LocalAI@v2.6.1...v2.6.2

</details>

---

### Configuration

📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone
Europe/Amsterdam, Automerge - At any time (no schedule defined).

🚦 **Automerge**: Enabled.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about this update
again.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR has been generated by [Renovate
Bot](https://togithub.com/renovatebot/renovate).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy4xNTYuNCIsInVwZGF0ZWRJblZlciI6IjM3LjE1Ni40IiwidGFyZ2V0QnJhbmNoIjoibWFzdGVyIn0=-->
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant