Skip to content

-ngl to load ·last n layers· to gpu #12577

Closed
@Bobchenyx

Description

@Bobchenyx

Hi there,

I would like to know if we could use -ngl to load last N layers to GPU instead of first N.

If possible can someone please point me to a place where I should modify the source code?

llama-bench for example.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions