POWER10: Add optimized dgemm kernel by RajalakshmiSR · Pull Request #9652 · microsoft/onnxruntime

RajalakshmiSR · 2021-11-02T21:58:39Z

This patch makes use of POWER10 matrix multiply assist feature and
adds new DGEMM kernel.

This patch makes use of POWER10 matrix multiply assist feature and adds new DGEMM kernel.

RajalakshmiSR · 2021-11-04T12:57:07Z

@yufenglee Requesting review. I also have handled common header changes in this PR that you commented in DGEMM PR last week.

RajalakshmiSR · 2021-11-10T15:51:58Z

@yufenglee @snnn Requesting review.

RajalakshmiSR · 2021-11-18T02:36:03Z

@yufenglee Just a reminder on this review.

yufenglee · 2021-11-22T16:32:17Z

onnxruntime/core/mlas/lib/power/DgemmKernelPOWER10.cpp

+    MLAS_FLOAT64X2 ABroadcast[RowCount]
+    )
+{
+        ABroadcast[0] = vec_mergee (AElements[0], AElements[1]);


[](http://example.com/codeflow?start=0&length=8)

nit: space

yufenglee · 2021-11-22T16:38:15Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, Windows CPU CI Pipeline, Windows GPU TensorRT CI Pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-amd-gpu-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

azure-pipelines · 2021-11-22T16:38:21Z

You have several pipelines (over 10) configured to build pull requests in this repository. Specify which pipelines you would like to run by using /azp run [pipelines] command. You can specify multiple pipelines using a comma separated list.

yufenglee · 2021-11-22T16:38:27Z

/azp run Windows GPU TensorRT CI Pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-amd-gpu-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

azure-pipelines · 2021-11-22T16:39:01Z

Azure Pipelines successfully started running 6 pipeline(s).

yufenglee · 2021-11-22T21:22:16Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, Windows CPU CI Pipeline, Windows GPU TensorRT CI Pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-amd-gpu-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

azure-pipelines · 2021-11-22T21:22:24Z

You have several pipelines (over 10) configured to build pull requests in this repository. Specify which pipelines you would like to run by using /azp run [pipelines] command. You can specify multiple pipelines using a comma separated list.

yufenglee · 2021-11-22T21:22:28Z

/azp run Windows GPU TensorRT CI Pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-amd-gpu-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

azure-pipelines · 2021-11-22T21:23:02Z

Azure Pipelines successfully started running 6 pipeline(s).

yufenglee · 2021-11-22T23:44:01Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, Windows CPU CI Pipeline, Windows GPU CI Pipeline

azure-pipelines · 2021-11-22T23:44:51Z

Azure Pipelines successfully started running 10 pipeline(s).

jingyanwangms · 2021-11-23T00:56:32Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, Windows CPU CI Pipeline, Windows GPU CI Pipeline

azure-pipelines · 2021-11-23T00:57:19Z

Azure Pipelines successfully started running 10 pipeline(s).

RajalakshmiSR · 2021-11-23T12:50:42Z

@yufenglee Thanks for the review.

POWER10: Add optimized dgemm kernel

2aced19

This patch makes use of POWER10 matrix multiply assist feature and adds new DGEMM kernel.

snnn requested a review from yufenglee November 10, 2021 22:26

yufenglee reviewed Nov 22, 2021

View reviewed changes

Indentation update

cb33de8

yufenglee approved these changes Nov 22, 2021

View reviewed changes

jingyanwangms added the release:1.10 label Nov 23, 2021

jingyanwangms merged commit 8564fc1 into microsoft:master Nov 23, 2021

jingyanwangms removed the release:1.10 label Nov 24, 2021

Conversation

RajalakshmiSR commented Nov 2, 2021

Uh oh!

RajalakshmiSR commented Nov 4, 2021

Uh oh!

RajalakshmiSR commented Nov 10, 2021

Uh oh!

RajalakshmiSR commented Nov 18, 2021

Uh oh!

yufenglee Nov 22, 2021

Choose a reason for hiding this comment

Uh oh!

yufenglee commented Nov 22, 2021

Uh oh!

azure-pipelines bot commented Nov 22, 2021

Uh oh!

yufenglee commented Nov 22, 2021

Uh oh!

azure-pipelines bot commented Nov 22, 2021

Uh oh!

yufenglee commented Nov 22, 2021

Uh oh!

azure-pipelines bot commented Nov 22, 2021

Uh oh!

yufenglee commented Nov 22, 2021

Uh oh!

azure-pipelines bot commented Nov 22, 2021

Uh oh!

yufenglee commented Nov 22, 2021

Uh oh!

azure-pipelines bot commented Nov 22, 2021

Uh oh!

jingyanwangms commented Nov 23, 2021

Uh oh!

azure-pipelines bot commented Nov 23, 2021

Uh oh!

RajalakshmiSR commented Nov 23, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants