the output of bls is unstable

### System Info

Ubuntu 22.04
Triton image: nvcr.io/nvidia/tritonserver:24.06-trtllm-python-py3 and the version of trtllm-backend is 0.10.0 
Model: qwen2-7b-instruct

### Who can help?

_No response_

### Information

- [ ] The official example scripts
- [ ] My own modified scripts

### Tasks

- [ ] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...)
- [ ] My own task or dataset (give details below)

### Reproduction

launch a qwen2-7b-instruct in a container with image nvcr.io/nvidia/tritonserver:24.06-trtllm-python-py3, and then test ensemble and bls model.

1 ensemble model test
1.1 there are no characters between two sentences in input text 
![image](https://github.com/user-attachments/assets/becf6718-dcb0-47b3-9397-380ec3ee6e5f)
1.2 there are some sapce or '\n' between two sentences in input text, the semantics of input text is not changing
![image](https://github.com/user-attachments/assets/26bb087c-56d0-40ae-aaf5-69a077d3172f)
![image](https://github.com/user-attachments/assets/1f36d191-8606-46b8-91fd-495d766440f4)

2 bls model test
2.1 there are no characters between two sentences in input text 
![image](https://github.com/user-attachments/assets/ce22dd19-a8c6-49c5-8c31-9974c3265c14)
2.2 there are some sapce or '\n' between two sentences in input text, the semantics of input text is not changing
![image](https://github.com/user-attachments/assets/1fc89761-a6c0-4454-b348-d2f5aa45a3bc)
![image](https://github.com/user-attachments/assets/c2be8791-37bb-4ee0-bbac-ac22916b8d1f)


### Expected behavior

the result of 1.1 and 1.2 is same
the result of 2.1 and 2.2 is same

### actual behavior

the result of 2.1 and 2.2 is not same

### additional notes

 no

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

the output of bls is unstable #630

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

the output of bls is unstable #630

Description

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions