fix: declare the core.lock to be a pointer to make core concurrency safe #1626

ZetaoZhuang · 2022-09-20T19:23:23Z

Reason for Change:

To make the AI Core concurrency safe, we need to declare the core.lock to be a pointer so in cloned core we can point to the same lock
Issue Fixed:

Requirements:

uses conventional commit messages
includes documentation
adds unit tests

Notes:

to the same pointer

rbtr · 2022-09-22T16:10:56Z

@timraymond that this is necessary feels like something is big wrong, wdyt?

timraymond · 2022-09-22T18:40:06Z

@rbtr well, the patch is correct as it stands in isolation, because bd236e7 introduced a defect to zapai.(*Core).clone() where the mutex is copied. I guess what I don't understand is why the mutex exists in the first place. It's not clear what it's protecting...

ZetaoZhuang · 2022-09-22T18:52:44Z

@timraymond currently we are only able to have a single encoder shared between a core and its derived. so the mutex here is to protect the traceTelemetry of encoder while there is a process working on resetting/updating it

ZetaoZhuang · 2022-10-03T22:57:22Z

@timraymond currently we are only able to have a single encoder shared between a core and its derived. so the mutex here is to protect the traceTelemetry of encoder while there is a process working on resetting/updating it

@rbtr any thoughts on this?

rbtr · 2022-10-04T00:56:56Z

@ZetaoZhuang I regret the singleton gobber, I don't think this is sustainable. I discussed this with @timraymond who had the idea that maybe we could use protobuf to encode/decode the TraceTelemetry<->[]byte which could get us away from this synchronization mess...wdyt? Could you investigate some alternatives?

timraymond · 2022-10-07T17:56:45Z

@rbtr btw, I approved this because the code in its current state is provably broken, and this patch does provide a minimal fix. I agree that we should investigate using protobufs, but I don't think this PR should be a forcing function for that investigation. I'll open an issue with the idea so that it can be handled separately.

ZetaoZhuang · 2022-10-07T18:01:04Z

@ZetaoZhuang I regret the singleton gobber, I don't think this is sustainable. I discussed this with @timraymond who had the idea that maybe we could use protobuf to encode/decode the TraceTelemetry<->[]byte which could get us away from this synchronization mess...wdyt? Could you investigate some alternatives?

+1 on the protobuf. we can do the investigation on protobufs separately.

rbtr · 2022-10-07T18:48:59Z

no objection

…afe (Azure#1626) declare core.lock to be a pointer, so the lock in cloned core can point to the same pointer

declare core.lock to be a pointer, so the lock in cloned core can point

a4a12c1

to the same pointer

ZetaoZhuang requested a review from rbtr as a code owner September 20, 2022 19:23

ZetaoZhuang changed the title ~~declare the core.lock to be a pointer~~ fix: declare the core.lock to be a pointer to make core concurrency safe Sep 20, 2022

timraymond approved these changes Oct 5, 2022

View reviewed changes

Merge branch 'master' into fix_core_lock

ddfcc22

ZetaoZhuang added 3 commits October 7, 2022 12:11

Merge branch 'master' into fix_core_lock

3d0af2e

Merge branch 'master' into fix_core_lock

b3fdc6f

Merge branch 'master' into fix_core_lock

95a83d3

ZetaoZhuang enabled auto-merge (squash) October 10, 2022 20:48

ZetaoZhuang added 2 commits October 18, 2022 11:46

Merge branch 'master' into fix_core_lock

ef9d843

Merge branch 'master' into fix_core_lock

3cd0de2

ZetaoZhuang merged commit e9d143d into master Oct 19, 2022

ZetaoZhuang deleted the fix_core_lock branch October 19, 2022 01:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: declare the core.lock to be a pointer to make core concurrency safe #1626

fix: declare the core.lock to be a pointer to make core concurrency safe #1626

Uh oh!

ZetaoZhuang commented Sep 20, 2022 •

edited by bohuini

Loading

Uh oh!

rbtr commented Sep 22, 2022

Uh oh!

timraymond commented Sep 22, 2022

Uh oh!

ZetaoZhuang commented Sep 22, 2022 •

edited

Loading

Uh oh!

ZetaoZhuang commented Oct 3, 2022

Uh oh!

rbtr commented Oct 4, 2022

Uh oh!

timraymond commented Oct 7, 2022

Uh oh!

ZetaoZhuang commented Oct 7, 2022

Uh oh!

rbtr commented Oct 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: declare the core.lock to be a pointer to make core concurrency safe #1626

fix: declare the core.lock to be a pointer to make core concurrency safe #1626

Uh oh!

Conversation

ZetaoZhuang commented Sep 20, 2022 • edited by bohuini Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rbtr commented Sep 22, 2022

Uh oh!

timraymond commented Sep 22, 2022

Uh oh!

ZetaoZhuang commented Sep 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZetaoZhuang commented Oct 3, 2022

Uh oh!

rbtr commented Oct 4, 2022

Uh oh!

timraymond commented Oct 7, 2022

Uh oh!

ZetaoZhuang commented Oct 7, 2022

Uh oh!

rbtr commented Oct 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ZetaoZhuang commented Sep 20, 2022 •

edited by bohuini

Loading

ZetaoZhuang commented Sep 22, 2022 •

edited

Loading