Add seconds to engine program id generation #3906

dstrain115 · 2021-03-11T14:33:42Z

Clients were occasionally getting duplicate program ids
when running successive results.
Root cause is unknown, but likely cause is that a sub-module
was calling random.seed() somewhere.
This changes program id generation to include seconds in the id,
so that successive calls will generate distinct ids even if the
random number generator is seeded.

- Clients were occasionally getting duplicate program ids when running successive results. - Root cause is unknown, but likely cause is that a sub-module was calling random.seed() somewhere. - This changes program id generation to include seconds in the id, so that successive calls will generate distinct ids even if the random number generator is seeded.

balopat · 2021-03-11T14:49:48Z

cirq/google/engine/engine.py

@@ -65,7 +65,7 @@ class ProtoVersion(enum.Enum):
 def _make_random_id(prefix: str, length: int = 16):
    random_digits = [random.choice(string.ascii_uppercase + string.digits) for _ in range(length)]
    suffix = ''.join(random_digits)
-    suffix += datetime.date.today().strftime('%y%m%d')
+    suffix += datetime.date.today().strftime('%y%m%d-%H%M%S')


why not use UUID?

The problem with this second based approach that it assumes that if the faulty (assumingly seeding) client program is generating more than one program per second, you'll still get collisions. With UUID, neither seeding nor speed won't be a problem - though it will be longer (36 characters).

A UUID based on what? I am not sure if that solves the problem.
Would you hash based on the time + circuit? If we use a random uuid, then that would be based on random() and also fail if seeded.

Seems that https://docs.python.org/3/library/uuid.html#uuid.uuid4 is based on os.urandom(). It can't be seeded or manipulated.

balopat

LGTM for experimenting but I think it might still not really fix the issue for clients that generate a lot of programs fast.

mrwojtek

I'm also fine with this as a temporary fix. I like the UUID approach, this is marginal compared to the rest of request payload.

dstrain115 · 2021-03-12T23:59:39Z

LGTM for experimenting but I think it might still not really fix the issue for clients that generate a lot of programs fast.

Possibly, but you would have to generate more than one program in a second and also set random.seed() accidentally before generating each program.

balopat · 2021-03-15T20:45:51Z

Let's experiment then!

CirqBot · 2021-03-15T20:46:48Z

Automerge cancelled: A required status check is not present.

Missing statuses: ['Build docs', 'Build protos', 'Changed Notebooks Isolated Test against Cirq stable', 'Changed files test', 'Coverage check', 'Doc test', 'Format check', 'Lint check', 'Misc check', 'Notebook formatting', 'Pytest MacOS (3.7)', 'Pytest MacOS (3.8)', 'Pytest Ubuntu (3.7)', 'Pytest Ubuntu (3.8)', 'Pytest Windows (3.7)', 'Pytest Windows (3.8)', 'Type check']

dstrain115 requested review from cduck, vtomole, wcourtney and a team as code owners March 11, 2021 14:33

google-cla bot added the cla: yes Makes googlebot stop complaining. label Mar 11, 2021

dstrain115 requested a review from mrwojtek March 11, 2021 14:33

balopat reviewed Mar 11, 2021

View reviewed changes

balopat approved these changes Mar 11, 2021

View reviewed changes

mrwojtek approved these changes Mar 11, 2021

View reviewed changes

balopat added the automerge Tells CirqBot to sync and merge this PR. (If it's running.) label Mar 15, 2021

CirqBot added the front_of_queue_automerge CirqBot uses this label to indicate (and remember) what's being merged next. label Mar 15, 2021

Merge branch 'master' into random_not_random

754a154

CirqBot removed automerge Tells CirqBot to sync and merge this PR. (If it's running.) front_of_queue_automerge CirqBot uses this label to indicate (and remember) what's being merged next. labels Mar 15, 2021

Merge branch 'master' into random_not_random

49980a3

dstrain115 merged commit 8cf7825 into quantumlib:master Mar 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add seconds to engine program id generation #3906

Add seconds to engine program id generation #3906

dstrain115 commented Mar 11, 2021

balopat Mar 11, 2021

balopat Mar 11, 2021

dstrain115 Mar 12, 2021

mrwojtek Mar 15, 2021

balopat Mar 15, 2021

balopat left a comment •

edited

mrwojtek left a comment

dstrain115 commented Mar 12, 2021

balopat commented Mar 15, 2021

CirqBot commented Mar 15, 2021

Add seconds to engine program id generation #3906

Add seconds to engine program id generation #3906

Conversation

dstrain115 commented Mar 11, 2021

balopat Mar 11, 2021

Choose a reason for hiding this comment

balopat Mar 11, 2021

Choose a reason for hiding this comment

dstrain115 Mar 12, 2021

Choose a reason for hiding this comment

mrwojtek Mar 15, 2021

Choose a reason for hiding this comment

balopat Mar 15, 2021

Choose a reason for hiding this comment

balopat left a comment • edited

Choose a reason for hiding this comment

mrwojtek left a comment

Choose a reason for hiding this comment

dstrain115 commented Mar 12, 2021

balopat commented Mar 15, 2021

CirqBot commented Mar 15, 2021

balopat left a comment •

edited