Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Revert upgrade to "hashicorp/go-msgpack v2" #17047

Merged
merged 1 commit into from
May 1, 2023
Merged

Conversation

lgfa29
Copy link
Contributor

@lgfa29 lgfa29 commented May 1, 2023

After some internal testing we noticed that this changed causes a server upgrade to fail with an error like this:

2023-05-01T16:39:31.542-0400 [ERROR] nomad: failed to start Raft: error="failed to get last log at index 20: Time.UnmarshalBinary: unsupported version"
2023-05-01T16:39:31.542-0400 [ERROR] agent: error starting agent: error="server setup failed: Failed to start Raft: failed to get last log at index 20: Time.UnmarshalBinary: unsupported version"

go-msgpack/v2 was supposed to be the exact same code as v1.1.5 that we were using before, so I'm not sure what is causing this issue. Reverting these changes fixes the problem.

Reverts #16810

@lgfa29 lgfa29 added backport/1.3.x backport to 1.3.x release line backport/1.4.x backport to 1.4.x release line backport/1.5.x backport to 1.5.x release line labels May 1, 2023
@lgfa29 lgfa29 requested review from schmichael and shoenig May 1, 2023 20:53
Copy link
Member

@schmichael schmichael left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yikes! Are you able to followup with what went wrong?

@lgfa29
Copy link
Contributor Author

lgfa29 commented May 1, 2023

Yup, I've reopened #16808 while I investigate this further.

@lgfa29 lgfa29 merged commit ee5a08d into main May 1, 2023
@lgfa29 lgfa29 deleted the revert-16810-go-msgpack-v2 branch May 1, 2023 21:18
lgfa29 added a commit that referenced this pull request May 1, 2023
lgfa29 added a commit that referenced this pull request May 1, 2023
lgfa29 added a commit that referenced this pull request May 1, 2023
lgfa29 added a commit that referenced this pull request May 1, 2023
This reverts commit 3b2da71.

Backport of #17047.

Co-authored-by: Luiz Aoqui <luiz@hashicorp.com>
lgfa29 added a commit that referenced this pull request May 1, 2023
This reverts commit a6e3a0c.

Backport of #17047.

Co-authored-by: Luiz Aoqui <luiz@hashicorp.com>
lgfa29 added a commit that referenced this pull request May 1, 2023
This reverts commit a76ac25.

Backport of #17047.

Co-authored-by: Luiz Aoqui <luiz@hashicorp.com>
@IamTheFij
Copy link
Contributor

That's really strange... It's not just supposed to be the same exact code, but it is the same exact code: hashicorp/go-msgpack@v1.1.5...v2.0.0.

Is there a local proxy you're using aside from the Google one? I could try to verify a test from outside your infra, if that helps rule out some cached package with additional diffs.

@lgfa29
Copy link
Contributor Author

lgfa29 commented May 2, 2023

I'm really not sure what happened. I don't have any proxy configured, but I do have GOPRIVATE=github.com/hashicorp set to help working on some of our private repos, which I believe bypasses any cache.

We decided to revert this because we're approaching a release and this was the quickest fix for now. I will need some time to investigate this further. It may be some incompatibility between the versions used in Nomad and other libraries, like https://github.com/hashicorp/raft-boltdb.

If you have a chance to help us investigate we would very much appreciate 🙂

You can easily test the problem by starting a (non-dev) cluster using the 1.5.3 official binary, run a job, stop the agent, start it with the custom binary.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport/1.3.x backport to 1.3.x release line backport/1.4.x backport to 1.4.x release line backport/1.5.x backport to 1.5.x release line
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants