fix(//core/conversion/converters/Weights): Fix buffer allocation for weights data #378

narendasan · 2021-02-25T20:45:01Z

Description

The memcpy for weights tensors over copies when the Tensor is not a FP32 tensor. This seems to be the root cause of segfaults observed in #326 as well as failing DLA tests on aarch64 when users try to compile a model that is already in FP16.

Fixes #326

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes

weights data that occassionally may cause segfaults and causes issues with importing FP16 weights Signed-off-by: Naren Dasan <naren@narendasan.com> Signed-off-by: Naren Dasan <narens@nvidia.com>

narendasan · 2021-02-25T20:45:26Z

@andi4191 can you verify this change fixes the observed segfaults?

narendasan · 2021-02-25T20:48:34Z

I would still say that the compiler should still not be considered thread-safe yet. There's at least a few parts that may still have issues (Logger, Runtime).

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

andi4191 · 2021-02-25T21:17:08Z

Tested it for a few iterations. It passes everytime.

andi4191

Minor comment. LGTM

andi4191 · 2021-02-25T21:18:29Z

core/conversion/converters/Weights.cpp

@@ -80,9 +80,29 @@ Weights::Weights(ConversionCtx* ctx, at::Tensor t) {

  // Store the data in the conversion context so it remains until building is
  // complete
-  void* buf = malloc(t_cpu.numel() * sizeof(float));
+
+  void* buf;


Should be initialized to nullptr.

Signed-off-by: Naren Dasan <naren@narendasan.com> Signed-off-by: Naren Dasan <narens@nvidia.com>

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

fix(//core/conversion/converters/Weights): Fix buffer allocation for

dd7cfaf

weights data that occassionally may cause segfaults and causes issues with importing FP16 weights Signed-off-by: Naren Dasan <naren@narendasan.com> Signed-off-by: Naren Dasan <narens@nvidia.com>

github-actions bot added component: conversion Issues re: Conversion stage component: converters Issues re: Specific op converters component: core Issues re: The core compiler labels Feb 25, 2021

narendasan requested a review from andi4191 February 25, 2021 20:45

github-actions bot approved these changes Feb 25, 2021

View reviewed changes

andi4191 approved these changes Feb 25, 2021

View reviewed changes

fix(//core/conversion/converters/Weights): intialize buf to nullptr

8dc3140

Signed-off-by: Naren Dasan <naren@narendasan.com> Signed-off-by: Naren Dasan <narens@nvidia.com>

github-actions bot approved these changes Feb 25, 2021

View reviewed changes

narendasan merged commit 721b071 into master Feb 25, 2021

narendasan deleted the fix_multithreaded_fp16 branch February 25, 2021 22:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(//core/conversion/converters/Weights): Fix buffer allocation for weights data #378

fix(//core/conversion/converters/Weights): Fix buffer allocation for weights data #378

Uh oh!

narendasan commented Feb 25, 2021

Uh oh!

narendasan commented Feb 25, 2021

Uh oh!

narendasan commented Feb 25, 2021

Uh oh!

github-actions bot left a comment

Uh oh!

github-actions bot left a comment

Uh oh!

github-actions bot left a comment

Uh oh!

github-actions bot left a comment

Uh oh!

andi4191 commented Feb 25, 2021

Uh oh!

andi4191 left a comment

Uh oh!

andi4191 Feb 25, 2021

Uh oh!

github-actions bot left a comment

Uh oh!

github-actions bot left a comment

Uh oh!

Uh oh!

fix(//core/conversion/converters/Weights): Fix buffer allocation for weights data #378

fix(//core/conversion/converters/Weights): Fix buffer allocation for weights data #378

Uh oh!

Conversation

narendasan commented Feb 25, 2021

Description

Type of change

Checklist:

Uh oh!

narendasan commented Feb 25, 2021

Uh oh!

narendasan commented Feb 25, 2021

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

andi4191 commented Feb 25, 2021

Uh oh!

andi4191 left a comment

Choose a reason for hiding this comment

Uh oh!

andi4191 Feb 25, 2021

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!