Bump llama cpp b1656 #3095

hydai · 2023-12-18T04:34:43Z

No description provided.

juntao · 2023-12-18T04:34:46Z

Hello, I am a code review bot on flows.network. Here are my reviews of code commits in this PR.

The pull request titled "Bump llama cpp b1656" comprises majorly of two sets of changes. The first set pertains to the update to the llama library which has been modified in the plugins/wasi_nn/CMakeLists.txt file from b1616 to b1656. The chief intent behind this was to potentially leverage new features or rectify existing bugs. However, the outlined changes might trigger potential issues such as incompatibility due to breaking changes in the updated versions, errors introduced due to unmodified application of ggml.patch, potential implications on the runtime, or the build process which necessitates thorough testing and difficulties in debugging due to incomplete git history due to GIT_SHALLOW being set to TRUE.

The second set of changes pertain to device-specific configurations for an AI model parsing function in the WASI-NN plugin with a focus on enhanced compatibility with macOS devices. The code has been modified to enforce Metal (Apple's hardware-accelerated graphics API) on macOS by setting the number of GPU layers to 1 and moving the hack workaround into the non-Apple devices section. However, the change could incur drawbacks such as the lack of checking for the actual availability of Metal API on macOS, inconsistencies due to behavior changes based on the platform, or suboptimal settings of GPU layer on macOS for certain use cases.

The overall modifications seem straightforward, however, in light of the potential implications, it is recommended to meticulously review and test the changes and offer configuration options for end-users for enhanced source code maintainability.

Details

Commit 47ec62392e44137b4f4e63d06414be0a04f78878

The key change in this patch is an update to the version of the llama library used by the WASI-NN plugin's ggml backend. Specifically, the GIT_TAG in the plugins/wasi_nn/CMakeLists.txt file has been updated from b1616 to b1656. This essentially means that the build system will fetch a newer version of the llama library when setting up the project dependencies. This is presumably done to take advantage of new features or bug fixes in the updated version.

However, there are a few potential problems:

Potential Incompatibility: The update from b1616 to b1656 could introduce incompatibilities if there were breaking changes in the updated versions.
Testing: This change might have impacts in the runtime or build process, it is advisable to perform thorough testing to ensure that the new version doesn't introduce any bugs or unintended side effects.
Patch File: The patch file ggml.patch is applied without any changes. If there are any changes in the newer version of llama that affect the parts of the code that ggml.patch is modifying, this could potentially introduce errors or unexpected behavior.
GIT_SHALLOW is set to TRUE, which means that git history is not complete, it only gets the latest snapshot of the specified version. This makes debugging harder if something goes wrong in the future because you don't have the full git history.

Overall, this change appears to be straightforward but potential implications make it necessary to carefully review and test the change before merging it into the main codebase.

Commit 77805792903bc9ccf43e1c3f29e0a22e39f78b07

The patch for the file ggml.cpp appears to be implementing device-specific configurations for an AI model parsing function in the WASI-NN plugin. The author is trying to adjust the behavior of the code depending on whether it is run on macOS or not.

Key Changes:

The developer has added a device check for Apple devices where the number of GPU layers (GraphRef.NGPULayers) is set to 1, signifying that Metal (Apple's hardware-accelerated graphics API) is forced enabled on macOS.
The code documentation mentioning the hack workaround was moved into the non-Apple section.
Expanded the commentary about the workaround, making clear that it is for non-macOS devices. In addition to the original statement about the limitation of the WASI-NN proposal, the comment now warns other developers not to use this approach if the model parameters are updated in the Config stage to avoid reloading the model.

Potential Problems:

Not checking for the actual availability of Metal API on macOS. Just because the OS is macOS, it doesn't mean Metal API is available. Old Mac machines may not support it.
There's a potential case where this could cause behavior changes based on the platform which could lead to inconsistencies if not properly managed.
The forced set-up for 1 GPU layer on macOS might not be the most optimal setting for all use cases.

Finally, as a reviewer, I would suggest that a better practice would be to offer configuration options for the end-user, rather than hardcoding the settings in the source code, providing the user with better control over their system settings. This would require a more substantial code change but would generally be a more maintainable solution.

plugins/wasi_nn/ggml.cpp

codecov · 2023-12-20T05:17:36Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (f576ba7) 80.84% compared to head (7780579) 80.84%.
Report is 1 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3095      +/-   ##
==========================================
- Coverage   80.84%   80.84%   -0.01%     
==========================================
  Files         159      159              
  Lines       23035    23035              
  Branches     4734     4734              
==========================================
- Hits        18623    18622       -1     
  Misses       3131     3131              
- Partials     1281     1282       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: hydai <z54981220@gmail.com>

hydai requested a review from dm4 December 18, 2023 04:34

hydai requested a review from ibmibmibm as a code owner December 18, 2023 04:34

github-actions bot added the c-Plugin An issue related to WasmEdge Plugin label Dec 18, 2023

dm4 previously approved these changes Dec 18, 2023

View reviewed changes

dm4 self-requested a review December 18, 2023 05:54

dm4 reviewed Dec 18, 2023

View reviewed changes

plugins/wasi_nn/ggml.cpp Outdated Show resolved Hide resolved

hydai dismissed dm4’s stale review via 578a157 December 20, 2023 05:07

hydai force-pushed the hydai/bump_llama_cpp_b1656 branch from b421e73 to 578a157 Compare December 20, 2023 05:07

hydai added 2 commits December 20, 2023 13:22

[WASI-NN] ggml backend: bump to llama.cpp b1656

47ec623

Signed-off-by: hydai <z54981220@gmail.com>

[WASI-NN] ggml backend: force enable metal on macOS

7780579

Signed-off-by: hydai <z54981220@gmail.com>

hydai force-pushed the hydai/bump_llama_cpp_b1656 branch from 578a157 to 7780579 Compare December 20, 2023 05:22

dm4 approved these changes Dec 20, 2023

View reviewed changes

hydai merged commit cc038b4 into master Dec 20, 2023
49 of 50 checks passed

hydai deleted the hydai/bump_llama_cpp_b1656 branch December 20, 2023 09:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump llama cpp b1656 #3095

Bump llama cpp b1656 #3095

hydai commented Dec 18, 2023

juntao commented Dec 18, 2023 •

edited

codecov bot commented Dec 20, 2023 •

edited

Bump llama cpp b1656 #3095

Bump llama cpp b1656 #3095

Conversation

hydai commented Dec 18, 2023

juntao commented Dec 18, 2023 • edited

Details

Commit 47ec62392e44137b4f4e63d06414be0a04f78878

Commit 77805792903bc9ccf43e1c3f29e0a22e39f78b07

codecov bot commented Dec 20, 2023 • edited

Codecov Report

juntao commented Dec 18, 2023 •

edited

codecov bot commented Dec 20, 2023 •

edited