Update binaries feb 2024 #479

martindevans · 2024-02-01T19:40:43Z

Previous update (#460) was cancelled, due to some other work which changed things (introducing CLBLAST binaries, renaming windows binaries).

Build job: https://github.com/SciSharp/LLamaSharp/actions/runs/7746303349
llama.cpp version: https://github.com/ggerganov/llama.cpp/tree/d71ac90985854b0905e1abba778e407e17f9f887

…369e3c295ae2a

…aSharp/actions/runs/7746303349)

martindevans · 2024-02-01T19:40:54Z

m0nsky · 2024-02-02T12:59:41Z

My application seems to run fine in both avx2 and cu12.1.0, however, upping the batch size from 4096 to 8192 still throws:

System.AccessViolationException: Attempted to read or write protected memory. This is often an indication that other memory is corrupt.

when using or cu12.1.0 (while avx2 has no issues with 8192 batch size). I don't think I'll need a 8192 batch size and I'm not sure if this is expected, but just wanted to let you know.

Also, clblast throws a LLama.Exceptions.RuntimeError: The native library cannot be correctly loaded. for me. Does it require anything else to be installed?

Edit
I'm on Windows 10, RTX 3080

martindevans · 2024-02-02T14:48:05Z

@jasoncouture any ideas about the CLBLAST loading issue?

zsogitbe · 2024-02-03T09:39:31Z

I have tested the new code with several models on Windows 10 & GPU and it works fine. Thank you Martin!
The above errors are most likely errors in llame.cpp. Try another version. I remember having issues like this with some versions of the C++ code. Also, it seems to me that they have corrected a bug in the batch processing code just yesterday.

m0nsky · 2024-02-03T09:56:10Z

@martindevans Got clblast to work. I needed to copy clblast.dll to my executable dir (alongside the existing llama.dll), I just grabbed it from the latest llama.cpp release for now.

martindevans · 2024-02-03T20:55:24Z

Thanks for testing that @m0nsky, I've added a note to #464 (which is the tracking issue for OpenCL support) about distributing clblast.dll in the nuget packages.

ladeak · 2024-02-04T19:26:43Z

Is there a preview nuget package to be tested?

martindevans · 2024-02-04T19:41:56Z

@ladeak there's no preview package, if you want to test it your best option is to clone this branch

jasoncouture · 2024-02-04T20:13:23Z

@martindevans should we use something like nbgv and publish preview builds?

ladeak · 2024-02-04T20:59:39Z

@martindevans I was able to test Phi-2 on Windows 11 x64. However, I notice there is a breaking change in the sense if I want to load a model for the LLamaEmbedder and for the LLamaSharpChatCompletion I would need to load it with EmbeddingMode set to true. Is that right?

EDIT: I tested through the semantic-kernel integration.

martindevans · 2024-02-04T22:38:13Z

I notice there is a breaking change in the sense if I want to load a model for the LLamaEmbedder and for the LLamaSharpChatCompletion I would need to load it with EmbeddingMode set to true. Is that right?

Yep that's a deliberate change 👍

jasoncouture · 2024-02-05T11:52:17Z

@martindevans I posted the PRs for full CLBLAST including the nuspec, see #489 , solves the issues of the missing clblast.dll, and adds linux AMDGPU support as well.

martindevans · 2024-02-05T19:58:14Z

New run started here, using the same commit version.

jasoncouture · 2024-02-05T22:00:34Z

@martindevans looks like it failed due to that sha256 check. I'll put up a PR to pull it for now, and get it figured out.

martindevans · 2024-02-05T23:37:04Z

Started another run here

…harp/actions/runs/7792319886

…amaSharp into update_binaries_feb_2024

ladeak · 2024-02-07T21:40:44Z

When do you plan to release the next version?

martindevans · 2024-02-07T22:00:55Z

@jasoncouture is doing some tweaks to CLBLAST support, I think if that's done soon we'll probably release then.

liqngliz · 2024-02-12T17:33:47Z

Any update when this release will be out?

martindevans · 2024-02-12T23:54:05Z

When it's ready! Hopefully this week.

I think if it's not done soon we'll push CLBLAST to the next release and get everything else out the door :)

martindevans added 3 commits February 1, 2024 16:35

Updated everything to work with llama.cpp ce32060198b7e2d6a13a9b8e1e1…

15a98b3

…369e3c295ae2a

Updated all binaries (from this run: https://github.com/SciSharp/LLam…

b2e815d

…aSharp/actions/runs/7746303349)

Fixed number type

765c697

This was referenced Feb 2, 2024

Enable OpenCL/ROCm #464

Open

Support for Phi-2 #381

Closed

[Feature Request] Support InternLM Deploy #168

Closed

Feature Request: Switch backends dynamically at runtime? #264

Open

Update README.md

7dbaed2

martindevans added the minor-release label Feb 5, 2024

martindevans mentioned this pull request Feb 5, 2024

Create nuspec for OpenCL #489

Merged

martindevans removed the minor-release label Feb 5, 2024

martindevans added 2 commits February 6, 2024 00:27

Added new binaries, from this run: https://github.com/SciSharp/LLamaS…

bac40a3

…harp/actions/runs/7792319886

Merge branch 'update_binaries_feb_2024' of github.com:martindevans/LL…

21bdecd

…amaSharp into update_binaries_feb_2024

martindevans merged commit 17385e1 into SciSharp:master Feb 6, 2024
3 checks passed

martindevans deleted the update_binaries_feb_2024 branch February 6, 2024 01:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update binaries feb 2024 #479

Update binaries feb 2024 #479

martindevans commented Feb 1, 2024

martindevans commented Feb 1, 2024 •

edited

Loading

m0nsky commented Feb 2, 2024 •

edited

Loading

martindevans commented Feb 2, 2024

zsogitbe commented Feb 3, 2024 •

edited

Loading

m0nsky commented Feb 3, 2024 •

edited

Loading

martindevans commented Feb 3, 2024

ladeak commented Feb 4, 2024

martindevans commented Feb 4, 2024

jasoncouture commented Feb 4, 2024

ladeak commented Feb 4, 2024 •

edited

Loading

martindevans commented Feb 4, 2024

jasoncouture commented Feb 5, 2024

martindevans commented Feb 5, 2024

jasoncouture commented Feb 5, 2024

martindevans commented Feb 5, 2024

ladeak commented Feb 7, 2024

martindevans commented Feb 7, 2024

liqngliz commented Feb 12, 2024

martindevans commented Feb 12, 2024

Update binaries feb 2024 #479

Update binaries feb 2024 #479

Conversation

martindevans commented Feb 1, 2024

martindevans commented Feb 1, 2024 • edited Loading

m0nsky commented Feb 2, 2024 • edited Loading

martindevans commented Feb 2, 2024

zsogitbe commented Feb 3, 2024 • edited Loading

m0nsky commented Feb 3, 2024 • edited Loading

martindevans commented Feb 3, 2024

ladeak commented Feb 4, 2024

martindevans commented Feb 4, 2024

jasoncouture commented Feb 4, 2024

ladeak commented Feb 4, 2024 • edited Loading

martindevans commented Feb 4, 2024

jasoncouture commented Feb 5, 2024

martindevans commented Feb 5, 2024

jasoncouture commented Feb 5, 2024

martindevans commented Feb 5, 2024

ladeak commented Feb 7, 2024

martindevans commented Feb 7, 2024

liqngliz commented Feb 12, 2024

martindevans commented Feb 12, 2024

martindevans commented Feb 1, 2024 •

edited

Loading

m0nsky commented Feb 2, 2024 •

edited

Loading

zsogitbe commented Feb 3, 2024 •

edited

Loading

m0nsky commented Feb 3, 2024 •

edited

Loading

ladeak commented Feb 4, 2024 •

edited

Loading