Skip to content

Conversation

@Abhishek-TAMU
Copy link
Contributor

As Iterations to generate tokens request is controlled by iters, currently the value assigns to 1024 // max_new_tokens which caps the usage of max_new_tokens to 1024. To increase this added a new env var FMS_TEST_SHAPES_CUMULATIVE_TEST_TOKENS_PER_SEQUENCE to be used in test_decoders for the same.

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>
Signed-off-by: Abhishek <maurya.abhishek@ibm.com>
USE_DISTRIBUTED = os.environ.get("FMS_TEST_SHAPES_DISTRIBUTED", "0") == "1"
TIMING = os.environ.get("TIMING", "")

CUMULATIVE_TEST_TOKENS_PER_SEQUENCE = os.environ.get(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we should wrap this in an int as the below statement will be doing "1024" // max_new_tokens

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@JRosenkranz Thanks for the review. Pushed the change.

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>
@JRosenkranz
Copy link
Contributor

bot:test
TEST_FILE=test_decoders.py MODEL_ID=ibm-granite/granite-3.3-8b-instruct BATCH_SIZE=1 SEQUENCE_LENGTH=2048 USE_TINY_MODEL=1 NUM_AIU=4

@Abhishek-TAMU
Copy link
Contributor Author

bot:test
TEST_FILE=test_decoders.py MODEL_ID=ibm-granite/granite-3.3-8b-instruct BATCH_SIZE=1 SEQUENCE_LENGTH=2048 USE_TINY_MODEL=1 NUM_AIU=4 IMAGE_URL=icr.io/ibmaiu_internal/1.0/x86_64/spyre:dev FMS_TEST_SHAPES_VALIDATION_INFO_DIR=/home/senuser/models/validation_info/sharegpt

@Abhishek-TAMU
Copy link
Contributor Author

@JRosenkranz Spyre-ci test is successful now.

Copy link
Contributor

@JRosenkranz JRosenkranz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@JRosenkranz JRosenkranz merged commit ea7efd1 into foundation-model-stack:main Sep 10, 2025
3 checks passed
@Abhishek-TAMU Abhishek-TAMU deleted the add_cumulative_env branch October 7, 2025 19:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants