[PROD-399] Handle edge case of single rollover file with index > 0 #9

rmoneys · 2022-08-04T03:13:51Z

https://synccomputing.atlassian.net/browse/PROD-399

Handles edge case in which a single rollover log file is provide and it has a non-zero index

sonarqubecloud · 2022-08-04T03:14:17Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
2 Code Smells

No Coverage information
0.0% Duplication

NKSync · 2022-08-04T18:30:09Z

spark_log_parser/__main__.py

-if not os.path.isdir(args.result_dir):
+if not args.result_dir.is_dir():
    logger.error("%s is not a directory", args.result_dir)
    sys.exit(1)


I think we should refrain from using sys.exit() here. In the case the spark_log_parser is used anywhere in the backend or celery task, it would terminate the process, and likely cause unintended side effects.

I think we're ok here. This code is only executed by users on the command line, e.g.

python -m spark_log_parser

ok, makes sense. It caught my eye because some areas we raise a ValueError but here we exit the process.

NKSync · 2022-08-04T18:43:27Z

I assume these scenarios encompass the different possibilities we can see. Is it still the case where we're unable to detect if the last rollover file is missing?

a.) The first rollover file is missing
b.) a rollover file in-between is missing
c.) the last rollover file is missing

rmoneys · 2022-08-04T18:46:44Z

Yeah, we can't tell if we're missing the last rollover log files

NKSync

Based on my initial glance, the PR looks good to me. Nice work on handling the different cases!

gorskysd · 2022-08-04T19:04:43Z

spark_log_parser/eventlog.py

-
-        diffs = df.rollover_index.diff()[1:]
-
-        if any(diffs > 1) or df.rollover_index[0] > 0:


I remember seeing this in the last PR to check for the case this PR is addressing. Did this logic just not get run if there was only one log?

Exactly. With only 1 log file rollover validation was skipped.

gorskysd

Looks good to me.

I can confirm that we can't outright detect if a log is missing at the end. However, if that is the case another error will be thrown downstream during the parsing which indicates missing rollover as a possibility.

[PROD-399] Handle edge case of single rollover file with index > 0

9232866

rmoneys requested review from NKSync and gorskysd August 4, 2022 03:19

NKSync suggested changes Aug 4, 2022

View reviewed changes

NKSync self-requested a review August 4, 2022 18:50

NKSync approved these changes Aug 4, 2022

View reviewed changes

gorskysd reviewed Aug 4, 2022

View reviewed changes

gorskysd approved these changes Aug 4, 2022

View reviewed changes

rmoneys merged commit d4525b9 into main Aug 4, 2022

rmoneys deleted the PROD-399-validation branch August 4, 2022 21:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[PROD-399] Handle edge case of single rollover file with index > 0 #9

[PROD-399] Handle edge case of single rollover file with index > 0 #9

Uh oh!

rmoneys commented Aug 4, 2022

Uh oh!

sonarqubecloud bot commented Aug 4, 2022

Uh oh!

NKSync Aug 4, 2022

Uh oh!

rmoneys Aug 4, 2022

Uh oh!

NKSync Aug 4, 2022

Uh oh!

NKSync commented Aug 4, 2022

Uh oh!

rmoneys commented Aug 4, 2022

Uh oh!

NKSync left a comment

Uh oh!

gorskysd Aug 4, 2022

Uh oh!

rmoneys Aug 4, 2022

Uh oh!

gorskysd left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		diffs = df.rollover_index.diff()[1:]

		if any(diffs > 1) or df.rollover_index[0] > 0:

[PROD-399] Handle edge case of single rollover file with index > 0 #9

[PROD-399] Handle edge case of single rollover file with index > 0 #9

Uh oh!

Conversation

rmoneys commented Aug 4, 2022

Uh oh!

sonarqubecloud bot commented Aug 4, 2022

Uh oh!

NKSync Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

rmoneys Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

NKSync Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

NKSync commented Aug 4, 2022

Uh oh!

rmoneys commented Aug 4, 2022

Uh oh!

NKSync left a comment

Choose a reason for hiding this comment

Uh oh!

gorskysd Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

rmoneys Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

gorskysd left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants