Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends #1689

Merged
merged 12 commits into from
Feb 8, 2024

Conversation

mudler
Copy link
Owner

@mudler mudler commented Feb 6, 2024

Description

This PR also closes #1651 as it tidies up old backends relying on old versions of ggml libraries

Notes for Reviewers

Disables automated bark tests as are CPU intensive and most of the time fails as being killed

Signed commits

  • Yes, I signed my commits.

Copy link

netlify bot commented Feb 6, 2024

Deploy Preview for localai canceled.

Name Link
🔨 Latest commit dcc02d1
🔍 Latest deploy log https://app.netlify.com/sites/localai/deploys/65c50667f877dd000849657b

@mudler mudler changed the title Sycl fixes 2 Dockerfile: change base image, add sycl deps from repositories Feb 6, 2024
@mudler mudler changed the title Dockerfile: change base image, add sycl deps from repositories Use ubuntu as base for container images, drop old backends Feb 7, 2024
@mudler mudler changed the title Use ubuntu as base for container images, drop old backends Use ubuntu as base for container images, drop deprecated ggml backends Feb 7, 2024
@mudler mudler changed the title Use ubuntu as base for container images, drop deprecated ggml backends feat: Use ubuntu as base for container images, drop deprecated ggml backends Feb 7, 2024
@mudler mudler added the enhancement New feature or request label Feb 7, 2024
@mudler mudler force-pushed the sycl_fixes_2 branch 7 times, most recently from 3263d2d to 0c3a0b8 Compare February 7, 2024 17:07
@mudler mudler force-pushed the sycl_fixes_2 branch 3 times, most recently from de146e5 to 2cee605 Compare February 8, 2024 12:00
@mudler mudler changed the title feat: Use ubuntu as base for container images, drop deprecated ggml backends feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends Feb 8, 2024
@mudler
Copy link
Owner Author

mudler commented Feb 8, 2024

This PR finally makes functional sycl images with Intel GPUs!

@mudler mudler merged commit ddd21f1 into master Feb 8, 2024
24 checks passed
@mudler mudler deleted the sycl_fixes_2 branch February 8, 2024 19:12
@mudler mudler mentioned this pull request Feb 8, 2024
truecharts-admin added a commit to truecharts/charts that referenced this pull request Feb 12, 2024
….0 by renovate (#18178)

This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0-cublas-cuda11-ffmpeg-core` ->
`v2.8.0-cublas-cuda11-ffmpeg-core` |
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0-cublas-cuda11-core` -> `v2.8.0-cublas-cuda11-core` |
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0-cublas-cuda12-ffmpeg-core` ->
`v2.8.0-cublas-cuda12-ffmpeg-core` |
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0-cublas-cuda12-core` -> `v2.8.0-cublas-cuda12-core` |
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0-ffmpeg-core` -> `v2.8.0-ffmpeg-core` |
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0` -> `v2.8.0` |

---

> [!WARNING]
> Some dependencies could not be looked up. Check the Dependency
Dashboard for more information.

---

### Release Notes

<details>
<summary>mudler/LocalAI (docker.io/localai/localai)</summary>

### [`v2.8.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.8.0)

[Compare
Source](https://togithub.com/mudler/LocalAI/compare/v2.7.0...v2.8.0)

This release adds support for Intel GPUs, and it deprecates old
ggml-based backends which are by now superseded by llama.cpp (that now
supports more architectures out-of-the-box). See also
[mudler/LocalAI#1651.

Images are now based on Ubuntu 22.04 LTS instead of Debian bullseye.

##### Intel GPUs

There are now images tagged with "sycl". There are sycl-f16 and sycl-f32
images indicating f16 or f32 support.

For example, to start phi-2 with an Intel GPU it is enough to use the
container image like this:

docker run -e DEBUG=true -ti -v $PWD/models:/build/models -p 8080:8080
-v /dev/dri:/dev/dri --rm
quay.io/go-skynet/local-ai:master-sycl-f32-ffmpeg-core phi-2

##### What's Changed

##### Exciting New Features 🎉

- feat(sycl): Add support for Intel GPUs with sycl
([#&#8203;1647](https://togithub.com/mudler/LocalAI/issues/1647)) by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1660
- Drop old falcon backend (deprecated) by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1675
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1678
- Drop ggml-based gpt2 and starcoder (supported by llama.cpp) by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1679
- fix(Dockerfile): sycl dependencies by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1686
- feat: Use ubuntu as base for container images, drop deprecated
ggml-transformers backends by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1689

##### 👒 Dependencies

- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1656
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1665
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1669
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1673
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1683
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1688
- ⬆️ Update mudler/go-stable-diffusion by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1674

##### Other Changes

- ⬆️ Update docs version mudler/LocalAI by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1661
- feat(mamba): Add bagel-dpo-2.8b by
[@&#8203;richiejp](https://togithub.com/richiejp) in
[mudler/LocalAI#1671
- fix (docs): fixed broken links `github/` -> `github.com/` by
[@&#8203;Wansmer](https://togithub.com/Wansmer) in
[mudler/LocalAI#1672
- Fix HTTP links in README.md by
[@&#8203;vfiftyfive](https://togithub.com/vfiftyfive) in
[mudler/LocalAI#1677
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1681
- ci: cleanup worker before run by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1685
- Revert "fix(Dockerfile): sycl dependencies" by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1687
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1691

##### New Contributors

- [@&#8203;richiejp](https://togithub.com/richiejp) made their first
contribution in
[mudler/LocalAI#1671
- [@&#8203;Wansmer](https://togithub.com/Wansmer) made their first
contribution in
[mudler/LocalAI#1672
- [@&#8203;vfiftyfive](https://togithub.com/vfiftyfive) made their first
contribution in
[mudler/LocalAI#1677

**Full Changelog**:
mudler/LocalAI@v2.7.0...v2.8.0

</details>

---

### Configuration

📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone
Europe/Amsterdam, Automerge - At any time (no schedule defined).

🚦 **Automerge**: Enabled.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about these
updates again.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR has been generated by [Renovate
Bot](https://togithub.com/renovatebot/renovate).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy4xODMuMCIsInVwZGF0ZWRJblZlciI6IjM3LjE4My4wIiwidGFyZ2V0QnJhbmNoIjoibWFzdGVyIn0=-->
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Deprecation of old backends
1 participant