[Eval] Get logits output. #319

marvin-Yu · 2024-04-16T13:51:34Z

No description provided.

pujiang2018 · 2024-04-17T01:11:45Z

include/models.h

@@ -70,6 +76,7 @@ class Model {
    std::vector<int32_t> inputIds;
    int batchSize;
    int seqLen;
+    int vocabSize_;


Why do we need vocabSize in Model class?
if needed, could you pls make it in same naming style?

pujiang2018 · 2024-04-17T01:17:24Z

src/pytorch/auto_model.h

+        int vocabSize = model->getVocabSize();
+        int logitsN = batchSize * seqLen * vocabSize;
+
+        if (model->getRank() == 0) { input(inputIds); }


For the original code, why 'input' method is not called 'setInput', input is more like a keyword, :(

pujiang2018 · 2024-04-17T01:19:07Z

overall LGTM. @Duyi-Wang please help to review as it is closely related with the interface.

Duyi-Wang · 2024-04-17T01:56:41Z

I think it's better to keep align with transformers to use output_scores=True in generate() instead of forward().

output_scores (bool, optional, defaults to False) — Whether or not to return the prediction scores. See scores under returned tensors for more details.
output_logits (bool, optional) — Whether or not to return the unprocessed prediction logit scores. See logits under returned tensors for more details.
https://huggingface.co/docs/transformers/main_classes/text_generation#transformers.GenerationConfig.output_scores

Duyi-Wang · 2024-04-17T02:04:37Z

src/pytorch/auto_model.h

+        int sampleSize = std::get<2>(result);
+
+        // Create a torch::Tensor from the C array
+        int64_t tdims[3] = {batchSize, seqLen, vocabSize};


Shape is correct? The decoder just returns the last token's logits.

It opens the "logitsAll" to true for all logit output.

decoder->forward(..., logitsAll = true);

Duyi-Wang · 2024-04-17T02:08:49Z

src/pytorch/auto_model.h

+        std::tuple<float *, int, int> result = model->forward();
+        float *outBuf = std::get<0>(result);
+        int sampleOffset = std::get<1>(result);
+        int sampleSize = std::get<2>(result);


Sync in multi-ranks?

not support in this PR.

marvin-Yu · 2024-04-17T02:17:45Z

I think it's better to keep align with transformers to use output_scores=True in generate() instead of forward().

output_scores (bool, optional, defaults to False) — Whether or not to return the prediction scores. See scores under returned tensors for more details. output_logits (bool, optional) — Whether or not to return the unprocessed prediction logit scores. See logits under returned tensors for more details. https://huggingface.co/docs/transformers/main_classes/text_generation#transformers.GenerationConfig.output_scores

This implementation aligns with the approach on Hugging Face, which is merely a model's execution from input (token id) to output (logits), without considering the searcher component.

[Eval] Get logits output.

341ae89

marvin-Yu requested a review from Duyi-Wang April 16, 2024 13:58

pujiang2018 reviewed Apr 17, 2024

View reviewed changes

follow the naming style.

c629851

Duyi-Wang reviewed Apr 17, 2024

View reviewed changes

Duyi-Wang approved these changes Apr 17, 2024

View reviewed changes

Duyi-Wang merged commit a87a55b into main Apr 17, 2024
1 check passed

Duyi-Wang deleted the eval/test_with_dataset branch April 17, 2024 02:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Eval] Get logits output. #319

[Eval] Get logits output. #319

marvin-Yu commented Apr 16, 2024

pujiang2018 Apr 17, 2024

marvin-Yu Apr 17, 2024

pujiang2018 Apr 17, 2024

pujiang2018 commented Apr 17, 2024

Duyi-Wang commented Apr 17, 2024

Duyi-Wang Apr 17, 2024

marvin-Yu Apr 17, 2024

Duyi-Wang Apr 17, 2024

marvin-Yu Apr 17, 2024

marvin-Yu commented Apr 17, 2024

[Eval] Get logits output. #319

[Eval] Get logits output. #319

Conversation

marvin-Yu commented Apr 16, 2024

pujiang2018 Apr 17, 2024

Choose a reason for hiding this comment

marvin-Yu Apr 17, 2024

Choose a reason for hiding this comment

pujiang2018 Apr 17, 2024

Choose a reason for hiding this comment

pujiang2018 commented Apr 17, 2024

Duyi-Wang commented Apr 17, 2024

Duyi-Wang Apr 17, 2024

Choose a reason for hiding this comment

marvin-Yu Apr 17, 2024

Choose a reason for hiding this comment

Duyi-Wang Apr 17, 2024

Choose a reason for hiding this comment

marvin-Yu Apr 17, 2024

Choose a reason for hiding this comment

marvin-Yu commented Apr 17, 2024