Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: inline templates and accept URLs in models #1452

Merged
merged 4 commits into from
Dec 18, 2023
Merged

Conversation

mudler
Copy link
Owner

@mudler mudler commented Dec 17, 2023

Description

This PR makes possible to allow inline templates and, to specify urls in models. as such now the yaml file can be a single, portable config file representing the model.

An example of a model YAML file now could be:

name: mixtral

parameters:
  model: https://huggingface.co/TheBloke/Mixtral-8x7B-v0.1-GGUF/resolve/main/mixtral-8x7b-v0.1.Q2_K.gguf
# or just model: huggingface://TheBloke/Mixtral-8x7B-v0.1-GGUF/mixtral-8x7b-v0.1.Q2_K.gguf@main
template:
  completion: |
            Complete the following: {{.Input}}

TODO: tests, examples and docs updates

Partly related to #1373

Copy link

netlify bot commented Dec 17, 2023

Deploy Preview for localai canceled.

Name Link
🔨 Latest commit 47471ec
🔍 Latest deploy log https://app.netlify.com/sites/localai/deploys/658034adeb152e0008481b2f

@mudler mudler added the enhancement New feature or request label Dec 17, 2023
api/config/config.go Outdated Show resolved Hide resolved
@mudler mudler force-pushed the enhancements branch 3 times, most recently from a8eeda5 to 18e581f Compare December 17, 2023 09:09
@mudler mudler changed the title feat: inline templates and accepts URLs in models feat: inline templates and accept URLs in models Dec 17, 2023
@mudler mudler force-pushed the enhancements branch 2 times, most recently from 831566f to 47425cb Compare December 18, 2023 11:59
@mudler mudler merged commit 1fc3a37 into master Dec 18, 2023
29 checks passed
@mudler mudler deleted the enhancements branch December 18, 2023 17:58
truecharts-admin added a commit to truecharts/charts that referenced this pull request Dec 25, 2023
…2.0 by renovate (#16499)

This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [quay.io/go-skynet/local-ai](https://togithub.com/mudler/LocalAI) |
minor | `v2.1.0-cublas-cuda11-ffmpeg` -> `v2.2.0-cublas-cuda11-ffmpeg` |

---

> [!WARNING]
> Some dependencies could not be looked up. Check the Dependency
Dashboard for more information.

---

### Release Notes

<details>
<summary>mudler/LocalAI (quay.io/go-skynet/local-ai)</summary>

### [`v2.2.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.2.0)

[Compare
Source](https://togithub.com/mudler/LocalAI/compare/v2.1.0...v2.2.0)

##### What's Changed

This release brings updates to the backends and includes a fix for
recompilation of LocalAI with go-rwkv
([mudler/LocalAI#1473).
To note, it also tries to reduce the image size by allowing some
backends (transformers-based) to share the same environment.

With this release inline templates and models as URLs are supported, for
example:

```yaml
name: mixtral

parameters:
  model: https://huggingface.co/TheBloke/Mixtral-8x7B-v0.1-GGUF/resolve/main/mixtral-8x7b-v0.1.Q2_K.gguf

##### or huggingface://TheBloke/Mixtral-8x7B-v0.1-GGUF/mixtral-8x7b-v0.1.Q2_K.gguf@main
template:
  completion: |
            Complete the following: {{.Input}}
```

##### Exciting New Features 🎉

- docs: automatically track latest versions by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1451
- feat: inline templates and accept URLs in models by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1452
- feat(conda): share envs with transformer-based backends by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1465

##### 👒 Dependencies

- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1450
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1455
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1457
- ⬆️ Update ggerganov/whisper.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1462
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1461
- ⬆️ Update donomii/go-rwkv.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[mudler/LocalAI#1470

##### Other Changes

- How To (Updates and Fixes) by
[@&#8203;lunamidori5](https://togithub.com/lunamidori5) in
[mudler/LocalAI#1456
- Revert ":arrow_up: Update donomii/go-rwkv.cpp" by
[@&#8203;mudler](https://togithub.com/mudler) in
[mudler/LocalAI#1474

**Full Changelog**:
mudler/LocalAI@v2.1.0...v2.2.0

</details>

---

### Configuration

📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone
Europe/Amsterdam, Automerge - At any time (no schedule defined).

🚦 **Automerge**: Enabled.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about this update
again.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR has been generated by [Renovate
Bot](https://togithub.com/renovatebot/renovate).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy4xMDcuMCIsInVwZGF0ZWRJblZlciI6IjM3LjEwNy4wIiwidGFyZ2V0QnJhbmNoIjoibWFzdGVyIn0=-->
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant