Skip to content

tritonserver return error result for codellama #98

@Lzhang-hub

Description

@Lzhang-hub

I lanch the tritonserver follow readme with codellama-7b-hf, and request through http.

curl -X POST localhost:8000/v2/models/ensemble/generate -d '{"text_input": "write a quick sort", "max_tokens": 20, "bad_words": "", "stop_words": ""}'

get the result:

{"model_name":"ensemble","model_version":"1","sequence_end":false,"sequence_id":0,"sequence_start":false,"text_output":"<s> write a quick sort Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane Jane"}

I launch the run.py get the result:

Input: "write a quick sort"
Output: "algorithm in C++.

I have a vector of integers and I want to sort it using quick sort.

I have written the following code:

\begin{code}
#include <iostream>
#include <vector>

using namespace std;

void quickSort(vector<int> &v, int left, int right)
{
    int i = left, j = right;
    int pivot = v[left];

    while (i < j)
    {
        while (v[j] >= pivot)
            j--;
        while (v[i] <= pivot)
            i++;
        if (i < j)
        {
            int temp = v[i];
            v[i] = v[j];
            v[j] = temp;
        }
    }

Metadata

Metadata

Assignees

Labels

triagedIssue has been triaged by maintainers

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions