Skip to content

# Add fmperf Library and Update Dependencies #42

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

wangchen615
Copy link
Contributor

This PR adds the fmperf library and updates project dependencies to support it.

Changes

  • Added fmperf library with its core components:

    • Cluster.py: Kubernetes cluster management
    • ModelSpecs.py: Model specification handling
    • WorkloadSpecs.py: Workload configuration
    • utils/: Utility modules for benchmarking, logging, and data processing
  • Updated project dependencies in pyproject.toml:

    • Added pandas>=2.2.0 for data processing
    • Added kubernetes>=29.0.0 for cluster management
    • Added pyyaml>=6.0.1 for configuration handling
  • Fixed code quality issues:

    • Added proper type hints and imports
    • Fixed bare except statements
    • Improved boolean comparisons
    • Added __all__ exports for better module organization
    • Fixed loop control variable conflicts

Features

  • Kubernetes cluster management for model deployment
  • Support for different workload types:
    • Homogeneous workloads
    • Heterogeneous workloads
    • Realistic workloads
  • Benchmarking utilities for performance testing
  • Data processing and analysis tools

Testing

  • Verified all dependencies install correctly
  • Tested fmperf library functionality
  • Confirmed code quality checks pass

Notes

  • The fmperf library is designed to work with Kubernetes clusters
  • Supports both TGI and vLLM model deployments
  • Includes utilities for workload generation and performance analysis

Signed-off-by: Chen Wang <Chen.Wang1@ibm.com>
@k8s-ci-robot k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Mar 27, 2025
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: wangchen615

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 27, 2025
@k8s-ci-robot k8s-ci-robot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Mar 27, 2025
Copy link
Contributor

@achandrasekar achandrasekar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for sending this out! Having the ability to deploy the model server and benchmark with different configurations makes sense. It would be good to get this working with the inference-perf library and clean up additional logic like report generation that is handled separately by inference-perf.


from kubernetes import client

from fmperf.ModelSpecs import ModelSpec, TGISModelSpec, vLLMModelSpec
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we call this library something else instead of fmperf? Maybe a name that makes it clear that it simplifies deployment of model server and the benchmarking tool?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@achandrasekar , what would be the good library name?

from fmperf.Cluster import DeployedModel


class WorkloadSpec:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we have this deploy the inference-perf tool instead?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will take a look at your tool. Thanks, @achandrasekar

pd.set_option("future.no_silent_downcasting", True)


def parse_results(results, print_df=False, print_csv=False):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would be good to replace this with the reportgen in inference-perf.

@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Apr 1, 2025
@k8s-ci-robot
Copy link
Contributor

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants