Report resource usage counts by handling heartbeat events #35968

zmb3 · 2023-12-21T16:04:46Z

Buddy PR for #34954
Closes #34954

github-actions · 2023-12-21T16:05:17Z

The PR changelog entry failed validation: Changelog entry not found in the PR body. Please add a "no-changelog" label to the PR, or changelog lines starting with changelog: followed by the changelog entries for the PR.

proto/prehog/v1/teleport.proto

lib/usagereporter/teleport/aggregating/reporter.go

rosstimothy · 2023-12-21T20:12:39Z

lib/usagereporter/teleport/aggregating/service.go

+	for _, item := range result.Items {
+		report := &prehogv1.ResourcePresenceReport{}
+		if err := proto.Unmarshal(item.Value, report); err != nil {
+			return nil, trace.Wrap(err)


Do we want to abort the listing operation if there is one bad report in storage? Can we log the failure and keep trying the rest of the reports instead?

Potentially, but if we make that change we should do it for user activity reports too, so for now I think it's best to be consistent with current behavior.

Probably something we want to look into. I know we've had various bugs caused by getting resources failing due to one bad resource aborting the entire operation.

Failing usage data submission will result in a cluster alert, which will hopefully prompt the customer into calling us. That'd still work if we skipped over invalid data, but we would need to tweak the logic around creating and deleting the cluster alerts (to still create one), since we don't want to keep ignoring some logic bug.

lib/usagereporter/teleport/aggregating/service_test.go

proto/prehog/v1/teleport.proto

espadolini

do-not-merge because the .protos need to be updated in cloud master first, then copied here, other than that LGTM.

Buddy PR for #34954 Closes #34954 Co-authored-by: Edoardo Spadolini <edoardo.spadolini@goteleport.com> Signed-off-by: Zac Bergquist <zac.bergquist@goteleport.com>

zmb3 · 2024-01-02T21:47:41Z

do-not-merge because the .protos need to be updated in cloud master first, then copied here, other than that LGTM.

Protos were already merged in cloud master (see https://github.com/gravitational/cloud/pull/6823) but I pulled the latest in here (only differences were in comments).

zmb3 · 2024-01-02T21:49:05Z

cc @timothyb89 - this pulls in some of your new bot protos. Let me know if that's okay.

timothyb89 · 2024-01-02T22:42:39Z

cc @timothyb89 - this pulls in some of your new bot protos. Let me know if that's okay.

I think it's fine, it'll just be competing with #35881 so one of us will have a conflict to resolve 🙂

public-teleport-github-review-bot · 2024-01-03T20:22:03Z

@zmb3 See the table below for backport results.

Branch	Result
branch/v12	Failed
branch/v13	Failed
branch/v14	Failed

…se-anon-key * origin/master: (344 commits) Undelete CreateHostUserMode_HOST_USER_MODE_DROP (gravitational#36273) allow cwd to be changed in difftest (gravitational#35946) Auth device list component (gravitational#36235) make unified resources responsive (gravitational#35961) Support running Teleport in a "hot reload" mode (gravitational#35040) Prevent deleting enum values, allow deleting enum reservations in types.proto (gravitational#36248) Remove support for legacy (Amazon Linux 2) AMIs (gravitational#36153) Bump version(s) used for teleport-lab and teleport-quickstart (gravitational#36167) Allow Reconciler update handler to examine old value during update (gravitational#36171) Validate the user still exists during account reset (gravitational#35676) ButtonTextWithAddIcon shared component (gravitational#36103) Refactor hostname resolution for SSH connections via the WebUI (gravitational#35773) add structuredClone to jest JSDOMEnvironment (gravitational#36213) fix flaky `lib/auth` cache-enabled tests (gravitational#36216) Report resource usage counts by handling heartbeat events (gravitational#35968) Reviewer bot should use the stable version of Go (gravitational#36242) RFD 0153 Resource Guidelines (gravitational#34103) Use cmp and cmpots properly in operator tests (gravitational#36215) Relax Kubernetes CRD discovery when building cache (gravitational#36214) Add Access List messages to TAG protobuf (gravitational#36176) ...

zmb3 requested a review from espadolini December 21, 2023 16:04

github-actions bot requested review from avatus and rudream December 21, 2023 16:05

github-actions bot added the size/md label Dec 21, 2023

zmb3 added the no-changelog Indicates that a PR does not require a changelog entry label Dec 21, 2023

zmb3 force-pushed the zmb3/buddy-resource-reporting branch from 457c248 to f37ed08 Compare December 21, 2023 16:57

zmb3 requested a review from rosstimothy December 21, 2023 17:34

rosstimothy reviewed Dec 21, 2023

View reviewed changes

rosstimothy self-requested a review December 21, 2023 20:14

zmb3 force-pushed the zmb3/buddy-resource-reporting branch from f37ed08 to a03f93d Compare December 24, 2023 18:28

strideynet approved these changes Dec 27, 2023

View reviewed changes

proto/prehog/v1/teleport.proto Outdated Show resolved Hide resolved

rosstimothy approved these changes Jan 2, 2024

View reviewed changes

public-teleport-github-review-bot bot removed request for espadolini, avatus and rudream January 2, 2024 13:19

espadolini approved these changes Jan 2, 2024

View reviewed changes

espadolini added the do-not-merge label Jan 2, 2024

Report resource usage counts by handling heartbeat events

54cd766

Buddy PR for #34954 Closes #34954 Co-authored-by: Edoardo Spadolini <edoardo.spadolini@goteleport.com> Signed-off-by: Zac Bergquist <zac.bergquist@goteleport.com>

Copy latest protos from cloud and regenerate

89c35c9

zmb3 force-pushed the zmb3/buddy-resource-reporting branch from 6e930ff to 89c35c9 Compare January 2, 2024 21:47

zmb3 removed the do-not-merge label Jan 3, 2024

zmb3 added this pull request to the merge queue Jan 3, 2024

zmb3 added backport/branch/v12 backport/branch/v13 backport/branch/v14 labels Jan 3, 2024

Merged via the queue into master with commit 711fa4e Jan 3, 2024
41 checks passed

zmb3 deleted the zmb3/buddy-resource-reporting branch January 3, 2024 20:20

This was referenced Jan 4, 2024

[v14] Report resource usage counts by handling heartbeat events #36256

Merged

[v13] Report resource usage counts by handling heartbeat events #36257

Merged

[v12] Report resource usage counts by handling heartbeat events #36258

Merged

This was referenced Feb 1, 2024

Fix oversized report submission #37680

Merged

[v15] Fix oversized report submission #37687

Merged

[v14] Fix oversized report submission #37688

Merged

[v13] Fix oversized report submission #37689

Merged

[v12] Fix oversized report submission #37690

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report resource usage counts by handling heartbeat events #35968

Report resource usage counts by handling heartbeat events #35968

zmb3 commented Dec 21, 2023

github-actions bot commented Dec 21, 2023

rosstimothy Dec 21, 2023

zmb3 Dec 24, 2023

rosstimothy Jan 2, 2024

espadolini Jan 2, 2024

espadolini left a comment

zmb3 commented Jan 2, 2024

zmb3 commented Jan 2, 2024

timothyb89 commented Jan 2, 2024

public-teleport-github-review-bot bot commented Jan 3, 2024

Report resource usage counts by handling heartbeat events #35968

Report resource usage counts by handling heartbeat events #35968

Conversation

zmb3 commented Dec 21, 2023

github-actions bot commented Dec 21, 2023

rosstimothy Dec 21, 2023

Choose a reason for hiding this comment

zmb3 Dec 24, 2023

Choose a reason for hiding this comment

rosstimothy Jan 2, 2024

Choose a reason for hiding this comment

espadolini Jan 2, 2024

Choose a reason for hiding this comment

espadolini left a comment

Choose a reason for hiding this comment

zmb3 commented Jan 2, 2024

zmb3 commented Jan 2, 2024

timothyb89 commented Jan 2, 2024

public-teleport-github-review-bot bot commented Jan 3, 2024