feat(processor): enhance reports to hold transformation and tracking plan metrics #3138

Jayachand · 2023-03-29T12:19:02Z

Description

Enhancing reports table to hold transformation and tracking plan information
New connectionDetails columns

TransformationId
TrannsformationVersionId
TrackingPlanId
TrackingPlanVersion

New statusDetails columns

ViolationCount
ErrorType

Notion Ticket

https://www.notion.so/rudderstacks/Send-violation-information-tp-id-version-violation-message-payload-c213bd096b1744189ce570d33026cdd2

https://www.notion.so/rudderstacks/Send-Transformation-Id-information-in-report-metrics-912964eb39144675b8905639e471c8a8?pvs=4

Security

The code changed/added as part of this pull request won't create any security issues with how the software is being used.

codecov · 2023-04-03T09:32:38Z

Codecov Report

Patch coverage: 59.34% and project coverage change: +0.07 🎉

Comparison is base (1b4698e) 51.26% compared to head (cd9826c) 51.33%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3138      +/-   ##
==========================================
+ Coverage   51.26%   51.33%   +0.07%     
==========================================
  Files         311      311              
  Lines       52463    52637     +174     
==========================================
+ Hits        26893    27022     +129     
- Misses      23943    23979      +36     
- Partials     1627     1636       +9

Impacted Files	Coverage Δ
processor/trackingplan.go	`53.38% <0.00%> (ø)`
router/batchrouter/batchrouter.go	`37.75% <0.00%> (ø)`
enterprise/reporting/reporting.go	`30.76% <27.27%> (+14.87%)`	⬆️
processor/processor.go	`86.84% <80.76%> (-0.30%)`	⬇️
processor/transformer/transformer.go	`73.54% <100.00%> (ø)`
router/router.go	`78.45% <100.00%> (ø)`
utils/types/reporting_types.go	`90.00% <100.00%> (+1.76%)`	⬆️

... and 4 files with indirect coverage changes

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

utils/types/reporting_types.go

sql/migrations/reports/000004_alter_reports_add_tr_tp_columns.up.sql

enterprise/reporting/reporting.go

router/router.go

lvrach

Approved with minor comments

lvrach · 2023-04-17T12:44:14Z

utils/types/reporting_types.go

+	TransformationId        string `json:"transformationId"`
+	TransformationVersionId string `json:"transformationVersionId"`
+	TrackingPlanId          string `json:"trackingPlanId"`
+	TrackingPlanVersion     int    `json:"trackingPlanVersion"`


[minor] to keep with the idiomatic naming of go Id -> ID:

Suggested change

TransformationId string `json:"transformationId"`

TransformationVersionId string `json:"transformationVersionId"`

TrackingPlanId string `json:"trackingPlanId"`

TrackingPlanVersion int `json:"trackingPlanVersion"`

TransformationID string `json:"transformationId"`

TransformationVersionID string `json:"transformationVersionId"`

TrackingPlanID string `json:"trackingPlanId"`

TrackingPlanVersion int `json:"trackingPlanVersion"`

I would propose the same for:

SourceDefinitionId string `json:"sourceDefinitionId"` DestinationDefinitionId string `string:"destinationDefinitionId"` SourceCategory string `json:"sourceCategory"` ```

converted Id to ID format for newly added properties.

lvrach · 2023-04-17T12:50:28Z

utils/types/reporting_types.go

@@ -99,7 +113,7 @@ type PUReportedMetric struct {
 	StatusDetail *StatusDetail
 }

-func CreateConnectionDetail(sid, did, strid, sjid, sjrid, sdid, ddid, sc string) *ConnectionDetails {
+func CreateConnectionDetail(sid, did, strid, sjid, sjrid, sdid, ddid, sc, trid, trvid, tpid string, tpv int) *ConnectionDetails {


I would suggest against using this function and directly define: ConnectionDetails.

Having functions with more than 4 arguments is considered a bad practice. When dealing with many arguments is hard to understand which parameter has much with which arguments—making writing and reading harder. The struct, where you can provide the name of the fields is a better option.

router/batchrouter/batchrouter.go

processor/processor.go

processor/trackingplan.go

enterprise/reporting/reporting.go

atzoum · 2023-04-18T06:35:11Z

utils/types/reporting_types.go

+	SUCCEEDED                    = "succeeded"
+	SUCCEEDED_WITHOUT_VIOLATIONS = "succeeded_without_violations"


what is the difference between succeeded and succeeded_without_violations?

In trackingplan stage, there will be no status SUCCEEDED unlike other stages.
SUCCEEDED is split into SUCCEEDED_WITHOUT_VIOLATIONS + SUCCEEDED_WITH_VIOLATIONS

Can't we skip SUCCEEDED_WITHOUT_VIOLATIONS? If it succeeded, it succeeded. If there are violations, I would also skip the SUCCEEDED part and use something more straightforward, e.g. VIOLATIONS_FOUND, since what you are interested in is mostly that violations were found, not that we are ignoring these violations by considering the events as succeeded and letting them flow further in the pipeline. This, from what I understand is about to change in the future anyway, no?

All events sent to tracking plan stage can be categorized into 3 disjoint sets SUCCEEDED_WITHOUT_VIOLATIONS + SUCCEEDED_WITH_VIOLATIONS + ABORTED (due to violations). User is more interested in VIOLATIONS_FOUND (SUCCEEDED_WITH_VIOLATIONS+ ABORTED) and ABORTED.Also User might ask for only succeeded without violations. In view of this, 3 disjoint status data are sent to reporting, graphs shown in UI can be constructed based on user requirements

Letting events succeed with violations is a configuration setting given to user. User might this setting always without dropping events.

utils/types/reporting_types.go

processor/processor.go

atzoum · 2023-04-18T13:27:30Z

processor/processor.go

+			messages = append(messages, eventsByMessageID[userTransformedEvent.Metadata.MessageID].SingularEvent)
+		}
+
+		for _, message := range messages {


Question: Why are we repeating this for every message? The only thing the message affects is the sample payload. Would it be more efficient to increment counters in bulk and use a random sample?

atzoum · 2023-04-18T13:29:53Z

processor/processor.go

@@ -1289,10 +1370,10 @@ func (proc *Handle) processJobsForDest(partition string, subJobs subJob, parsedE
 	inCountMap := make(map[string]int64)
 	inCountMetadataMap := make(map[string]MetricMetadata)
 	connectionDetailsMap := make(map[string]*types.ConnectionDetails)
-	statusDetailsMap := make(map[string]*types.StatusDetail)
+	statusDetailsMap := make(map[string]map[string]*types.StatusDetail)


What kind of key does this second map level holds? I find it hard to understand

second map holds this key:fmt.Sprintf("%s:%d:%s:%s:%s", status, event.StatusCode, eventName, eventType, violationErrorType)
For each violationType, one report needs to be sent

We also appear to be capturing another, aggregate report too for all types :/

Yeah, complexity is ever growing.
I think we should plan for complete refactor of reporting metrics

utils/types/reporting_types.go

atzoum · 2023-04-19T07:38:33Z

utils/types/reporting_types.go

+	SUCCEEDED                    = "succeeded"
+	SUCCEEDED_WITHOUT_VIOLATIONS = "succeeded_without_violations"


Can't we skip SUCCEEDED_WITHOUT_VIOLATIONS? If it succeeded, it succeeded. If there are violations, I would also skip the SUCCEEDED part and use something more straightforward, e.g. VIOLATIONS_FOUND, since what you are interested in is mostly that violations were found, not that we are ignoring these violations by considering the events as succeeded and letting them flow further in the pipeline. This, from what I understand is about to change in the future anyway, no?

atzoum

Collection of reporting metrics through metric maps was already a complicated affair. Unfortunately, now it appears to be even more fragile, complex and hard to follow, comprehend or reason about.
With the current design, I fear that every time there is a need to capture even more granular statistics, the complexity will grow exponentially.

processor/processor.go

atzoum · 2023-04-19T14:05:07Z

processor/processor.go

@@ -1289,10 +1370,10 @@ func (proc *Handle) processJobsForDest(partition string, subJobs subJob, parsedE
 	inCountMap := make(map[string]int64)
 	inCountMetadataMap := make(map[string]MetricMetadata)
 	connectionDetailsMap := make(map[string]*types.ConnectionDetails)
-	statusDetailsMap := make(map[string]*types.StatusDetail)
+	statusDetailsMap := make(map[string]map[string]*types.StatusDetail)


We also appear to be capturing another, aggregate report too for all types :/

feat(reports): add tracking plan diff status reports

e3a9277

github-actions bot added the server-team label Mar 29, 2023

Jayachand added 3 commits March 29, 2023 20:00

nested map

015dece

stage identifier

ec41346

Merge branch 'master' into feat.tpStateReportMetrics

6c7ed75

feat: add event payload, aggreagte

ad1228a

Jayachand changed the title ~~feat(reports): add tracking plan diff status reports~~ feat(processor): add tracking plan diff status reports Apr 10, 2023

Jayachand added 2 commits April 10, 2023 11:14

Merge branch 'master' into feat.tpStateReportMetrics

40ceab6

Merge branch 'master' into feat.tpStateReportMetrics

4712f97

Jayachand marked this pull request as ready for review April 10, 2023 08:36

Jayachand requested review from Sidddddarth and abhimanyubabbar April 10, 2023 08:36

test cases

8b4940a

github-actions bot added the with tests label Apr 10, 2023

lvrach requested a review from atzoum April 11, 2023 14:23

Jayachand added 2 commits April 12, 2023 16:45

Merge branch 'master' into feat.tpStateReportMetrics

ef3f929

transformation metric changes

64481c8

Jayachand changed the title ~~feat(processor): add tracking plan diff status reports~~ feat(processor): enhance reports to hold transformation and tracking plan metrics Apr 12, 2023

Jayachand added 5 commits April 12, 2023 22:57

minor changes

84a9a2c

Merge branch 'master' into feat.tpStateReportMetrics

9bd588b

Merge branch 'master' into feat.tpStateReportMetrics

40a7cb7

Merge branch 'master' into feat.tpStateReportMetrics

91e4bfa

Merge branch 'master' into feat.tpStateReportMetrics

c5a026b

satishrudderstack reviewed Apr 17, 2023

View reviewed changes

utils/types/reporting_types.go Outdated Show resolved Hide resolved

sql/migrations/reports/000004_alter_reports_add_tr_tp_columns.up.sql Show resolved Hide resolved