KAFKA-9088: InternalProcessorContext mock builder [partial] #7933

pierDipi · 2020-01-11T20:42:36Z

I made a simpler version of a builder for mocking InternalProcessorContext.

ref: 7718

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

pierDipi · 2020-01-11T20:45:46Z

Call for review: @cadonna @mjsax @guozhangwang @vvcephei

guozhangwang · 2020-01-13T21:38:24Z

@cadonna could you take another look?

cadonna

@pierDipi Thank you very much for this PR. This is how I envisioned the mock.

I did a first pass without looking too much to the unit tests.

streams/src/test/java/org/apache/kafka/test/InternalProcessorContextMock.java

cadonna · 2020-01-15T18:40:28Z

streams/src/test/java/org/apache/kafka/test/InternalProcessorContextMock.java

+                        stateStoreMap.put(storeCapture.getValue().name(), storeCapture.getValue());
+                        return null;


Q: Do you think we need to implement this setter behaviour for this mock? Couldn't we not just pass a map of state stores at during the build specification with a method like stateStores(final Map<String, StateStore> stateStores) and don't specify a behaviour for register()?

Hi @cadonna,
I didn't get the advantage of this.
I would point out that register() is often called here: InMemoryWindowStore.java#L111

Anyway, I made all requested modifications except this one, could you take another look?
Thank you.

@pierDipi Sorry for the late reply.
Actually, during our discussions, I thought we agreed to stop the EasyMock experiment because of the public API issue and we should try to do something like #7979 as proposed by @vvcephei. Sorry for the confusion.
Anyways, I would be really happy, if you could take #7979 and extend it to make it robust. Are you still interested?

@cadonna Yes, I'm still interested, but I cannot guarantee when it will happen. I hope it will be done by the end of the next month.

vvcephei · 2020-01-17T19:15:38Z

Hi @pierDipi @cadonna ,

Thanks for the follow-up, this looks cleaner than the last PR.

I understand that part of the point here is to explore the EasyMock approach. Given what I said on the last PR, I'm sure it's not surprising if I state that I'm still concerned this is not the best approach.

Because such concerns are often too high-level to be useful, I put together a small PR purely to demonstrate the alternative form this work could take: #7979

A couple of things to note:

Just as in this PR, POC: Consolidate testing InternalProcessorContexts #7979 results in an InternalProcessorContext suitable for the remainder of the tests that still depend on the InternalMockProcessorContext
Unlike this PR, POC: Consolidate testing InternalProcessorContexts #7979 converts a test (KStreamSessionWindowAggregateProcessorTest) to demonstrate what the testing usage would look like. Perhaps a good comparative study would be to convert the same test as part of this PR.
Looking just at the support code (not the test) in POC: Consolidate testing InternalProcessorContexts #7979 only requires 17 lines of code to change from the existing trunk, and there are a total of only 124 lines of code in the mocked InternalProcessorContext as a whole. In comparison, this PR's mock is 376 lines. LOC isn't always the most useful metric for comparison, but the fact that the EasyMock approach requires 3x the testing support code is concerning.
POC: Consolidate testing InternalProcessorContexts #7979 allows us to reuse the existing public test-util API MockProcessorContext, which (by dogfooding) increases our confidence and decreases the chance of regression in the public test utility.
Unlike the EasyMock approach, POC: Consolidate testing InternalProcessorContexts #7979 is pure, easy-to-read, debuggable and traceable Java.

I really want to underscore that I appreciate all the work you've both put into exploring the EasyMock approach, and that I don't have a particular beef with EasyMock. Just that EasyMock is highest leverage when used in small and localized ways. When you have to put together a builder for a mock, it indicates right away that the use case is not ideal for EasyMock.

pierDipi · 2020-01-18T09:33:47Z

Hi @cadonna @vvcephei,

Thanks for the reviews.

My first PR merged InternalMockProcessorContext and MockInternalProcessorContext: #7594, which could be a good comparison in terms of LOC, and AFAIU it's the strategy that @vvcephei likes.
Just a note: #7594 changes MockProcessorContext which is a user-facing API, which means the PR requires further improvements.

Unlike this PR, POC: Consolidate testing InternalProcessorContexts #7979 converts a test (KStreamSessionWindowAggregateProcessorTest) to demonstrate what the testing usage would look like. Perhaps a good comparative study would be to convert the same test as part of this PR.

Yes, of course.

Looking just at the support code (not the test) in POC: Consolidate testing InternalProcessorContexts #7979 only requires 17 lines of code to change from the existing trunk, and there are a total of only 124 lines of code in the mocked InternalProcessorContext as a whole. In comparison, this PR's mock is 376 lines. LOC isn't always the most useful metric for comparison, but the fact that the EasyMock approach requires 3x the testing support code is concerning.

If the comparison is based on LOC to change, IMHO, there is no way we will be able to find a solution based on EasyMock that has less LOC to change; at most, we will be able to find a solution that overall has less LOC, given that we can delete InternalMockProcessorContext and MockInternalProcessorContext.

POC: Consolidate testing InternalProcessorContexts #7979 allows us to reuse the existing public test-util API MockProcessorContext, which (by dogfooding) increases our confidence and decreases the chance of regression in the public test utility.

The builder uses it too.

Unlike the EasyMock approach, POC: Consolidate testing InternalProcessorContexts #7979 is pure, easy-to-read, debuggable and traceable Java.

This is a good point.

cadonna · 2020-01-19T08:46:34Z

@vvcephei and @pierDipi Thanks for your comments. I will try to answer to your comments as soon as possible.

cadonna · 2020-01-20T20:55:57Z

First of all thank you very much for the discussion, @pierDipi and @vvcephei. I have now a much clearer picture about the implications of using EasyMock in this case.

My original idea was to try to use EasyMock to mock InternalProcessorContext to let EasyMock handle call verification and get rid of our own call verification code. The goal was to have less code to maintain.

@vvcephei, while I also think we should try to avoid unnecessary complexity, I also find that your LOC comparison is not completely fair for the following reasons:

The solution with EasyMock gives you the possibility to add call verification on any public method you want which PR POC: Consolidate testing InternalProcessorContexts #7979 (nice PR number, BTW) does not give you.
As I wrote above, the goal was to have less code to maintain, i.e., we would need to also include the LOC that could be removed because of this PR in the calculation.

Regarding readability, that would still need a couple of iterations of this PR. I cannot follow your reasoning about debuggability and traceability. While I see why it is important to debug a hand-crafted mock, I do not see why it is important to debug a mock created by an external library. Admittedly, there are for sure cases where this is needed, but they should be rare.

Said that let's now move to my concerns that arose during our discussions. To really accomplish the goal to let EasyMock handle all call verifications and not introducing too much complexity, we would need to refactor also MockProcessorContext and to use an EasyMock also for it, because in that class resided the majority of self-made call verification code that EasyMock should handle. That would also mean that we would make our public API (MockProcessorContext is part of the public test-utils and there are plans also to move a mock for InternalProcessorContext to test-utils) dependent on EasyMock. IMO that is a big issue, because we would limit our freedom in changing the API and make the API in general more brittle. For this reason, I think EasyMock is not the right approach in this case.

What do you think?

@pierDipi
Thank you very much for all your work to get to this point. I am sorry that I overlooked this fact in the beginning. It would have saved you some work. If no public API were involved, I would opt to go ahead with the EasyMock approach. I hope you are not too much disappointed about how this work developed and that you will continue with this ticket.

Again, thank you both for the discussions. I learned a lot.

vvcephei · 2020-01-24T16:01:11Z

Thanks @pierDipi and @cadonna ,

Just to clarify, although the LOC point may be invalid, it sounds like you basically in agreement that (at least for now), we can do something more like #7979 than an EasyMock builder. If so, then I'm (obviously) +1.

As a side note, I do think that we should consider what is right for each test. For example, if the desire is just to provide a "dummy" context and verify the black-box behavior of a component (such as, "does this processor forward the right result for the given inputs?"), then the #7979 approach is preferable. However, if the goal is really to verify some specific interaction (like, "does this method get called exactly twice?"), then a mock can still be defined in situ, which we do in many tests mocking other components. IMHO, this strategy plays to the strengths of both approaches.

Thanks again,
-John

guozhangwang · 2020-02-16T23:11:45Z

test this please

guozhangwang · 2020-02-16T23:12:23Z

@pierDipi @cadonna @vvcephei could you bring me up to date on the current progress of this PR?

pierDipi · 2020-02-16T23:44:04Z

Hi @guozhangwang,

We decided to go ahead with #7979 and therefore this PR can be closed.

pierDipi added 27 commits December 8, 2019 18:02

Mock applicationId() method

38dca1e

Mock taskId() method

5b385e3

Mock keySerde() method

6151c55

Mock valueSerde() method

ad8985e

Mock stateDir() method

1efecff

Mock metrics() method

910ecf1

Mock register() and getStateStore() methods

ab5c07f

mock schedule() method

b02fbfc

Format

720b197

mock commit()

8132005

mock topic()

8bae4be

mock partition()

c33cc41

mock offset()

ecf8759

mock headers()

a1579fe

mock timestamp()

42823b2

mock appConfigs()

61b8216

mock appConfigsWithPrefix()

bab3706

mock recordContext()

5fb8a89

mock setRecordContext()

1ee7294

mock setCurrentNode() and currentNode()

4ee0ada

mock getCache()

4a9b4a7

mock initialize() and uninitialize()

e208722

fix checkstyle

053144f

mock forward()

e7918aa

change default values to recordContext

fb5509c

clean up

c194129

Merge branch 'trunk' into KAFKA-9088-IPCMock-Builder

0549940

Align processorContext state

d10c6ad

fix style issue

e2d50fb

cadonna reviewed Jan 14, 2020

View reviewed changes

cadonna reviewed Jan 15, 2020

View reviewed changes

vvcephei mentioned this pull request Jan 17, 2020

POC: Consolidate testing InternalProcessorContexts #7979

Closed

3 tasks

pierDipi added 2 commits January 20, 2020 22:59

make InternalProcessorContextMockBuilder top level class

99fe4cc

migrate EasyMock expectations to EasyMock stub expectations

ebe6603

pierDipi closed this Feb 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAFKA-9088: InternalProcessorContext mock builder [partial] #7933

KAFKA-9088: InternalProcessorContext mock builder [partial] #7933

pierDipi commented Jan 11, 2020

pierDipi commented Jan 11, 2020

guozhangwang commented Jan 13, 2020

cadonna left a comment

cadonna Jan 15, 2020

pierDipi Jan 30, 2020

cadonna Feb 11, 2020

pierDipi Feb 12, 2020

vvcephei commented Jan 17, 2020

pierDipi commented Jan 18, 2020 •

edited

Loading

cadonna commented Jan 19, 2020

cadonna commented Jan 20, 2020

vvcephei commented Jan 24, 2020

guozhangwang commented Feb 16, 2020

guozhangwang commented Feb 16, 2020

pierDipi commented Feb 16, 2020

		stateStoreMap.put(storeCapture.getValue().name(), storeCapture.getValue());
		return null;

KAFKA-9088: InternalProcessorContext mock builder [partial] #7933

KAFKA-9088: InternalProcessorContext mock builder [partial] #7933

Conversation

pierDipi commented Jan 11, 2020

Committer Checklist (excluded from commit message)

pierDipi commented Jan 11, 2020

guozhangwang commented Jan 13, 2020

cadonna left a comment

Choose a reason for hiding this comment

cadonna Jan 15, 2020

Choose a reason for hiding this comment

pierDipi Jan 30, 2020

Choose a reason for hiding this comment

cadonna Feb 11, 2020

Choose a reason for hiding this comment

pierDipi Feb 12, 2020

Choose a reason for hiding this comment

vvcephei commented Jan 17, 2020

pierDipi commented Jan 18, 2020 • edited Loading

cadonna commented Jan 19, 2020

cadonna commented Jan 20, 2020

vvcephei commented Jan 24, 2020

guozhangwang commented Feb 16, 2020

guozhangwang commented Feb 16, 2020

pierDipi commented Feb 16, 2020

pierDipi commented Jan 18, 2020 •

edited

Loading