Fix basic pipeline test + Pipeline benchmark #104

mariomac · 2022-02-24T14:55:38Z

There was a couple of minor, unnoticed mistakes, that prevented the pipeline transformers from being loaded.

This test also checks that the transformation stage has been properly applied.

It provides a basic pipeline benchmark that would allow measuring any improvement or penalty in future modifications of the pipeline architecture.

codecov-commenter · 2022-02-24T15:02:59Z

Codecov Report

Merging #104 (c3b2324) into main (84dbf1a) will decrease coverage by 0.17%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main     #104      +/-   ##
==========================================
- Coverage   54.80%   54.63%   -0.18%     
==========================================
  Files          37       37              
  Lines        2332     2332              
==========================================
- Hits         1278     1274       -4     
- Misses        981      984       +3     
- Partials       73       74       +1

Flag	Coverage Δ
unittests	`54.63% <100.00%> (-0.18%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
pkg/test/utils.go	`100.00% <100.00%> (ø)`
pkg/pipeline/utils/exit.go	`82.60% <0.00%> (-17.40%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 84dbf1a...c3b2324. Read the comment docs.

KalmanMeth · 2022-02-24T15:08:01Z

pkg/pipeline/pipeline_test.go

      generic:
+        rules:
        - input: Bytes
          output: fl2m_bytes


The rules themselves now should be indented another 2 spaces.

eranra · 2022-02-24T16:21:48Z

pkg/pipeline/pipeline_test.go

+
+func BenchmarkPipeline(b *testing.B) {
+	t := &testing.T{}
+	loadGlobalConfig(t)


@mariomac can you look on https://github.com/netobserv/flowlogs-pipeline/blob/main/Makefile#L88 --- maybe we can somehow improve this >>> ???? I started to create some benchmark area for the project and I totally agree we need to improve that

@mariomac In any event, can we agree to create dedicated go files just for benchmark and split from the rest of the tests so we can run those stand-alone??

I had a look but doesn't seem to work for me... on each invocation I got:

panic: unexpected call to os.Exit(0) during test github.com/netobserv/flowlogs2metrics/cmd/flowlogs2metrics.run() /vagrant/code/flowlogs2metrics/cmd/flowlogs2metrics/main.go:190 +0x1f2

So I decided to create also a very pipeline-specific test to compare the part we are evaluating to change.

With respect to your second question, we could do that if you prefer it. Anyway benchmarks are not run by default even if they are in the same file as the tests. If you mean skipping tests when you run benchmarks, you can add the -test.run=^$ argument to the go test command to skip any test.

But I'm fine if you feel it's better organizing everyting in the same benchmarks file.

@eranra I see what's happening with the benchmark in benchmark_test.go. It basically runs many times the whole command, so it would measure basically the performance of starting the FL2M process and finishing it, not the actual FL2M processing performance.

Go benchmarks are more a sort of "micro-benchmarks" aimed to test some parts of the code, and that's why they are usually located in the test files of the components that they are benchmarking. For example, in the benchmark of this PR, it just tests the time of sending and processing a file of ~5000 flows with a very simple dummy pipeline (no real ingest, no real writing...), but it allows us measuring the impact of the sequential vs parallel pipeline mechanism.

I'd suggest to (in another PR to not loose the focus of our current task) remove the current benchmark_test.go and prepare some benchmark that:

starts the FL2M service with a real ingester

spins up a client that is able to send real IPFIX flows (we did one using a VM library in our goflow-kube)

Measures how many flows it's able to process in a given amount of time. E.g. the last stage of the pipeline could be just a counter.

In the future, this could be improved e.g. spinning parallel clients

KalmanMeth · 2022-02-28T10:08:59Z

pkg/pipeline/pipeline_test.go

+	"github.com/sirupsen/logrus"
+
+	jsoniter "github.com/json-iterator/go"
 	"github.com/netobserv/flowlogs2metrics/pkg/config"


goland editor will re-order these imports to be in alphabetical order.

In netobserv we usually sort the imports with goimports, which is independent of the IDE (many team members use VScode). It can be configured to be used from Goland/IDEA too:

But if this is an inconvenience for you I can adapt to use the default Goland. WDYT @jotak @jpinsonneau @OlivierCazade ?

I'm also using goimports (configured in vscode)

KalmanMeth · 2022-02-28T10:15:34Z

pkg/pipeline/pipeline_test.go

+		if err != nil {
+			t.Fatalf("unexpected error %s", err)
+		}
+		b.StartTimer()


Where is the timing information used?

StopTimer and StartTimer are used to exclude from the benchmark some parts of the code that you don't want to measure. In this case we don't want to measure the Pipeline creation time but just the amount of metrics that you can forward.

mariomac · 2022-02-28T16:24:23Z

superseeded by #105

mariomac requested review from KalmanMeth and eranra February 24, 2022 14:55

Fix basic pipeline test

5b91c56

KalmanMeth reviewed Feb 24, 2022

View reviewed changes

Basic pipeline benchmark

8c6cf31

mariomac changed the title ~~Fix basic pipeline test~~ Fix basic pipeline test + Pipeline benchmark Feb 24, 2022

Mario Macias added 2 commits February 24, 2022 16:16

indenting transform rules

a795735

omit pipeline creation from benchmarks

c3b2324

eranra reviewed Feb 24, 2022

View reviewed changes

Added FileChunks ingester to prepare it for parallel testing

391cc49

mariomac requested review from OlivierCazade, jotak and jpinsonneau February 25, 2022 09:55

KalmanMeth reviewed Feb 28, 2022

View reviewed changes

KalmanMeth approved these changes Feb 28, 2022

View reviewed changes

mariomac closed this Feb 28, 2022

mariomac deleted the fixtest branch February 28, 2022 16:29

Fix basic pipeline test + Pipeline benchmark #104

Fix basic pipeline test + Pipeline benchmark #104

Uh oh!

Conversation

mariomac commented Feb 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Feb 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mariomac commented Feb 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

mariomac commented Feb 24, 2022 •

edited

Loading

codecov-commenter commented Feb 24, 2022 •

edited

Loading