【help】why  function  llama_build_graph  is internal function  llama_decode？

  I read the llama.cpp source code。
  I am confused as to why the function llama_build_graph needs to be called every time the function llama_decode is called.
 The function llama_build_graph cannot be called during program initialization, which will reduce the inference time.

static int llama_decode_internal(
         llama_context & lctx,
           llama_batch   batch) {
    ....
   ggml_cgraph * gf = llama_build_graph(lctx, batch, false);
.....
}

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

【help】why function llama_build_graph is internal function llama_decode？ #5916

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

【help】why function llama_build_graph is internal function llama_decode？ #5916

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions