# Add fmperf Library and Update Dependencies #42

wangchen615 · 2025-03-27T18:27:04Z

This PR adds the fmperf library and updates project dependencies to support it.

Changes

Added fmperf library with its core components:
- Cluster.py: Kubernetes cluster management
- ModelSpecs.py: Model specification handling
- WorkloadSpecs.py: Workload configuration
- utils/: Utility modules for benchmarking, logging, and data processing
Updated project dependencies in pyproject.toml:
- Added pandas>=2.2.0 for data processing
- Added kubernetes>=29.0.0 for cluster management
- Added pyyaml>=6.0.1 for configuration handling
Fixed code quality issues:
- Added proper type hints and imports
- Fixed bare except statements
- Improved boolean comparisons
- Added __all__ exports for better module organization
- Fixed loop control variable conflicts

Features

Kubernetes cluster management for model deployment
Support for different workload types:
- Homogeneous workloads
- Heterogeneous workloads
- Realistic workloads
Benchmarking utilities for performance testing
Data processing and analysis tools

Testing

Verified all dependencies install correctly
Tested fmperf library functionality
Confirmed code quality checks pass

Notes

The fmperf library is designed to work with Kubernetes clusters
Supports both TGI and vLLM model deployments
Includes utilities for workload generation and performance analysis

Signed-off-by: Chen Wang <Chen.Wang1@ibm.com>

k8s-ci-robot · 2025-03-27T18:27:11Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: wangchen615

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [wangchen615]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

achandrasekar

Thanks for sending this out! Having the ability to deploy the model server and benchmark with different configurations makes sense. It would be good to get this working with the inference-perf library and clean up additional logic like report generation that is handled separately by inference-perf.

achandrasekar · 2025-03-31T21:53:29Z

fmperf/Cluster.py

+
+from kubernetes import client
+
+from fmperf.ModelSpecs import ModelSpec, TGISModelSpec, vLLMModelSpec


Should we call this library something else instead of fmperf? Maybe a name that makes it clear that it simplifies deployment of model server and the benchmarking tool?

@achandrasekar , what would be the good library name?

achandrasekar · 2025-03-31T21:57:27Z

fmperf/WorkloadSpecs.py

+from fmperf.Cluster import DeployedModel
+
+
+class WorkloadSpec:


Can we have this deploy the inference-perf tool instead?

Will take a look at your tool. Thanks, @achandrasekar

achandrasekar · 2025-03-31T22:00:22Z

fmperf/utils/Parsing.py

+pd.set_option("future.no_silent_downcasting", True)
+
+
+def parse_results(results, print_df=False, print_csv=False):


Would be good to replace this with the reportgen in inference-perf.

k8s-ci-robot · 2025-04-01T06:12:54Z

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Add fmperf library and update dependencies

ece4dc9

Signed-off-by: Chen Wang <Chen.Wang1@ibm.com>

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Mar 27, 2025

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 27, 2025

k8s-ci-robot requested review from ArangoGutierrez and SergeyKanzhelev March 27, 2025 18:27

k8s-ci-robot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Mar 27, 2025

achandrasekar reviewed Mar 31, 2025

View reviewed changes

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Apr 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

# Add fmperf Library and Update Dependencies #42

# Add fmperf Library and Update Dependencies #42

wangchen615 commented Mar 27, 2025

Uh oh!

k8s-ci-robot commented Mar 27, 2025

Uh oh!

achandrasekar left a comment

Uh oh!

achandrasekar Mar 31, 2025

Uh oh!

wangchen615 May 22, 2025

Uh oh!

achandrasekar Mar 31, 2025

Uh oh!

wangchen615 May 22, 2025

Uh oh!

achandrasekar Mar 31, 2025

Uh oh!

k8s-ci-robot commented Apr 1, 2025

Uh oh!

Uh oh!


		from kubernetes import client

		from fmperf.ModelSpecs import ModelSpec, TGISModelSpec, vLLMModelSpec

		pd.set_option("future.no_silent_downcasting", True)


		def parse_results(results, print_df=False, print_csv=False):

# Add fmperf Library and Update Dependencies #42

Are you sure you want to change the base?

# Add fmperf Library and Update Dependencies #42

Conversation

wangchen615 commented Mar 27, 2025

Changes

Features

Testing

Notes

Uh oh!

k8s-ci-robot commented Mar 27, 2025

Uh oh!

achandrasekar left a comment

Choose a reason for hiding this comment

Uh oh!

achandrasekar Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

wangchen615 May 22, 2025

Choose a reason for hiding this comment

Uh oh!

achandrasekar Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

wangchen615 May 22, 2025

Choose a reason for hiding this comment

Uh oh!

achandrasekar Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Apr 1, 2025

Uh oh!

Uh oh!