Smoke tests: Fix random seed on smoke tests and add asserts on results #97

lotif · 2024-01-15T21:50:21Z

PR Type

Other

Short Description

Clickup Ticket(s): https://app.clickup.com/t/8686mur37

Passing a random seed to FedProx, APFL and SCAFFOLD
Making them save the metrics to a file at the end of their execution
Adding assertions to their metrics values in the smoke tests

Tests Added

The smoke tests themselves.

tests/smoke_tests/run_smoke_test.py

emersodb · 2024-01-17T14:59:57Z

tests/smoke_tests/run_smoke_test.py

+            continue
+
+        metrics_found = True
+        _assert_metrics_dict(metrics_to_assert, metrics)


I may be missing something, so correct me if I'm wrong, but this looks like we're going to compare any metrics json dumped by a client to a single metrics_to_assert dictionary. For some of our example, that's definitely fine, because the different clients are loading the same dataset and running the same training cycle. However, there are instances where we might want to test when that isn't the case. That is, each client loads a distinct dataset. So their metrics won't be the same as each other.

Again, correct me if that interpretation is wrong.

That's correct, but I think the solution might be somewhere else. I think for that use case we could pass in different dictionaries for each client, giving each a client name or id, and then compare them appropriately when pulling their json files. Thoughts?

Yes, I think that makes sense. We can do another smoke test where two clients load slightly different datasets and include it as a separate test.

tests/smoke_tests/run_smoke_test.py

emersodb

Overall it looks really great. Some fairly minor comments. The only one of moderate significance is the assumption that each client will produce matching metrics. In the general case, they won't. So we probably want that flexibility built into the smoke test infra.

lotif added 27 commits January 10, 2024 10:47

Adding metrics basics

68866c0

Improving implementation

eb739e6

Return on error

f77fa5d

Adding to client, cleaner implementation

f8c8b46

Fixing issues, reverting the example code

c80215c

Merge branch 'main' into metrics-reporter

1f05c0a

Adding metrics to evaluate

fa28b45

Merge branch 'main' into metrics-reporter

86cc6d9

Adding documentation

ff30322

Adding first unit tests

0dd2f21

Adding test for metrics.py

d3e21ea

Adding test for evaluate_server

1ca8f7b

Adding tests for evaluate client, simplifying basic client tests a bit

bd0f7f1

Simpler mocking

12f2fbf

Merge branch 'main' into metrics-reporter

7ea8cb6

Implementing checks for metrics

2d7a1d1

CR by David

0e21726

Merge branch 'metrics-reporter' into fix-seed

a1495b5

Updating code, adding debug to see all values

3be6ea3

Trying some scaffold tolerances

0603502

Trying some scaffold tolerances [2]

1b30bcb

Trying some scaffold tolerances [3]

026f10e

Trying some scaffold tolerances [4]

8ccf3e3

Adding proximal loss to fedprox

84d0852

Reverting log level back to INFO

7ac024f

Merge branch 'main' into fix-seed

3de2d3b

Updating docstrings

e356776

lotif changed the title ~~Fix seed~~ Smoke tests: Fix random seed on smoke tests and add asserts on results Jan 16, 2024

lotif marked this pull request as ready for review January 16, 2024 17:47

lotif requested a review from fatemetkl January 16, 2024 17:48

lotif requested review from emersodb, jewelltaylor, sanaAyrml, yuchongzhang and zxj-c January 16, 2024 17:48

emersodb reviewed Jan 17, 2024

View reviewed changes

tests/smoke_tests/run_smoke_test.py Outdated Show resolved Hide resolved

emersodb reviewed Jan 17, 2024

View reviewed changes

tests/smoke_tests/run_smoke_test.py Outdated Show resolved Hide resolved

emersodb reviewed Jan 17, 2024

View reviewed changes

CR by David

da72d36

emersodb self-requested a review January 19, 2024 20:35

emersodb approved these changes Jan 19, 2024

View reviewed changes

lotif merged commit 5229a88 into main Jan 19, 2024

lotif deleted the fix-seed branch January 19, 2024 20:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Smoke tests: Fix random seed on smoke tests and add asserts on results #97

Smoke tests: Fix random seed on smoke tests and add asserts on results #97

Uh oh!

lotif commented Jan 15, 2024 •

edited

Loading

Uh oh!

Uh oh!

emersodb Jan 17, 2024

Uh oh!

lotif Jan 17, 2024

Uh oh!

emersodb Jan 18, 2024

Uh oh!

Uh oh!

emersodb left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Smoke tests: Fix random seed on smoke tests and add asserts on results #97

Smoke tests: Fix random seed on smoke tests and add asserts on results #97

Uh oh!

Conversation

lotif commented Jan 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Type

Short Description

Tests Added

Uh oh!

Uh oh!

emersodb Jan 17, 2024

Choose a reason for hiding this comment

Uh oh!

lotif Jan 17, 2024

Choose a reason for hiding this comment

Uh oh!

emersodb Jan 18, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

emersodb left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lotif commented Jan 15, 2024 •

edited

Loading