Skip to content

Latest commit

 

History

History
26 lines (25 loc) · 4.33 KB

gemma-7b-it-lora.md

File metadata and controls

26 lines (25 loc) · 4.33 KB
model task_type model_display_name layout weight title json_schema
id source name description task tags properties
337170b7-bd2f-4631-9a57-688b579cf6d3
1
@cf/google/gemma-7b-it-lora
This is a Gemma-7B base model that Cloudflare dedicates for inference with LoRA adapters. Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.
id name description
c329a1f9-323d-4e91-b2aa-582dd4188d34
Text Generation
Family of generative text models, such as large language models (LLM), that can be adapted for a variety of natural language tasks.
property_id value
beta
true
property_id value
lora
true
text-generation
gemma-7b-it-lora
model
0
gemma-7b-it-lora
input output
{ "type": "object", "oneOf": [ { "properties": { "prompt": { "type": "string", "minLength": 1, "maxLength": 6144 }, "raw": { "type": "boolean", "default": false }, "stream": { "type": "boolean", "default": false }, "max_tokens": { "type": "integer", "default": 256 }, "temperature": { "type": "number", "minimum": 0, "maximum": 5 }, "top_p": { "type": "number", "minimum": 0, "maximum": 2 }, "top_k": { "type": "integer", "minimum": 1, "maximum": 50 }, "seed": { "type": "integer", "minimum": 1, "maximum": 9999999999 }, "repetition_penalty": { "type": "number", "minimum": 0, "maximum": 2 }, "frequency_penalty": { "type": "number", "minimum": 0, "maximum": 2 }, "presence_penalty": { "type": "number", "minimum": 0, "maximum": 2 }, "lora": { "type": "string" } }, "required": [ "prompt" ] }, { "properties": { "messages": { "type": "array", "items": { "type": "object", "properties": { "role": { "type": "string" }, "content": { "type": "string", "maxLength": 6144 } }, "required": [ "role", "content" ] } }, "stream": { "type": "boolean", "default": false }, "max_tokens": { "type": "integer", "default": 256 }, "temperature": { "type": "number", "minimum": 0, "maximum": 5 }, "top_p": { "type": "number", "minimum": 0, "maximum": 2 }, "top_k": { "type": "integer", "minimum": 1, "maximum": 50 }, "seed": { "type": "integer", "minimum": 1, "maximum": 9999999999 }, "repetition_penalty": { "type": "number", "minimum": 0, "maximum": 2 }, "frequency_penalty": { "type": "number", "minimum": 0, "maximum": 2 }, "presence_penalty": { "type": "number", "minimum": 0, "maximum": 2 } }, "required": [ "messages" ] } ] }
{ "oneOf": [ { "type": "object", "contentType": "application/json", "properties": { "response": { "type": "string" } } }, { "type": "string", "contentType": "text/event-stream", "format": "binary" } ] }