Add class for serializing message to bytes #29384

Tim-Brooks · 2018-04-04T23:34:43Z

This is related to #28898. In the tcp transport when data is queued for
writing to a channel it is always bytes. However, for the http transport
this data will be http response objects. These objects will need to be
serialized to bytes on the transport thread prior to flushing.

This commit resolves this by separating flush and write operations. A
write operation can be any object. When queuing a write operation with
a context, the context will use the provider class to serialize the
message and produce a flush operation. Currently there is only a single
class (BytesFlushProducer) which only supports flush-ready write
operations (operations that are already byte buffers).

elasticmachine · 2018-04-04T23:34:44Z

Pinging @elastic/es-core-infra

s1monw · 2018-04-05T13:04:50Z

libs/elasticsearch-nio/src/main/java/org/elasticsearch/nio/SocketChannelContext.java

+        default void close() throws IOException {}
+    }
+
+    public interface FlushProducer extends AutoCloseable {


I really wonder what this buys us. I think it would be enough to have a method on WriteOperation to get a Writeable of some sort? That seems liek much simpler since we safe an object per operation potentially?

Http Responses are not necessarily individually serializable. For example in the pipelining case we need need to know what other responses have been submitted before handling the current response.

If you ‘queue’ http response number 3, but have not received 1 or 2 yet, the ‘poll’ operation will return nothing as we are not ready to write anything.

Additionally we have to fuse this on-top of a netty http serializing pipeline which is a stateful thing that you put a response into and get bytes out of. It is not really a "per-response" thing.

If we want to avoid different flush and write objects we could eventually optimize that with like an internal ByteBuffer setter that can only be accessed by the flush producer. And you could put the write operation into the producer/pipeline and when it comes out it has byte buffers and is flush ready. But I feel like that could be an optimization / follow-up.

i will need to think about this for a bit...

I really wonder if we need to separate this out and make object creation and complexity necessary in all cases. This new abstraction confuses me a lot. I wonder if we should think more in the way of composing messages. ie.

class Writeable { //this may be called with subsequent messages until we have all and then we write it back // optimizations can apply in here and depend on the context. public Writeable compose(Writeable writeable); }

This will all for only adding the complexity when it's really needed in this pipelining edgecase.

WDYT

I mean I don't really follow.

We're sitting here in the SocketChannelContext. Someone has given us a Writeable to queue to flush to the channel. How do we know if this Writeable is ready to flush? How do we know if we are supposed to compose it with a subsequent Writeable? And I guess in this scenario, instead of the http serializing work living in the FlushProducer, we have to pass that around so that it will be available to put in the Writeable at outbound?

I also don't completely understanding why a WriteProducer is considered "complex". It is just a pipeline that aggregates and serializes outbound messages. And is specific to the protocol in question (our protocol or http). Is it a naming issue? OutboundPipeline? OutboundSerializer? WriteSerializer? It could even be a single method if you want that returns ready messages (opposed to write and poll):

public List<FlushOperation> serialize(WriteOperation writeOp);

I just did not do that to avoid creating a new List every call.

ok so I have a couple of issues:

I have a hard time to see how you need to use it in the http case. Here we only have one implementation and nothing shows how it's used in the http case. This also means I have no idea it's sufficient for that usecase. I think we should never introduce an abstraction that has only one implementation.

we can fix that by adding a test that show for example how http use it or we go and simplify it by making it concrete for now and then intoduce the abstaction when we do the http one. I'd opt for the latter and remove the inferface and make BytesFlushProducer a concrete inner class in SocketChannelContext

I don't like that we have to create a new object every time here for no obvious reason. (http might add one) can't we make WriteOperation#getObject() return a FlushOperation and then we call it Flushable getFlushable() our default one would then simply call return this and we don't create any new objects unless needed?

if we do this we can special case the piplining usecase, no?

I hope this helps.

Tim-Brooks · 2018-04-12T03:49:18Z

@s1monw I made some changes based on your last comment.

s1monw

left two comments / quesitons

s1monw · 2018-04-12T11:47:12Z

libs/elasticsearch-nio/src/main/java/org/elasticsearch/nio/WriteOperation.java


-    SocketChannelContext getChannel();
+    public WriteOperation(SocketChannelContext channelContext, Object writeObject, BiConsumer<Void, Throwable> listener) {


why do we have to loose all type safety here. Can't we fix the interface to return FlushOperation and return this?

s1monw · 2018-04-12T11:49:52Z

libs/elasticsearch-nio/src/main/java/org/elasticsearch/nio/SocketChannelContext.java

@@ -108,14 +126,82 @@ public boolean connect() throws IOException {
        return isConnected;
    }

-    public abstract int read() throws IOException;
+    public void sendMessage(Object message, BiConsumer<Void, Throwable> listener) {


this one could accept WriteOperation and then we could just add the context and the listener to it via setters?

Tim-Brooks · 2018-04-13T01:59:22Z

I'll make a decision tomorrow, but I am leaning towards closing this PR. I'm not sure it has simplified the process to open a PR for abstractions without the Http work. I think I will probably just submit a PR that includes both the abstractions and the basic http work (with follow-up PRs for more advanced features like pipeline, cors, etc).

Tim-Brooks · 2018-04-17T15:24:19Z

Closing. Will submit different PR.

Tim-Brooks added 30 commits January 17, 2018 15:39

Add server context

17c904a

WIP

6c8cd44

Do not extend autocloseable

d9d995b

But keep close method

88bca9a

Remove multiple context getters

c4db506

WIP

4c3bf37

Merge remote-tracking branch 'upstream/master' into layer_http

dfd1901

Merge branch 'master' into layer_http

c4e1c50

Do not depend on netty module

b5a374a

Pull over tests

408c87c

Work on bytes producer

0f74aa0

Merge remote-tracking branch 'upstream/master' into layer_http

44be919

Comments

913028e

Merge remote-tracking branch 'upstream/master' into layer_http

df53494

WIP

9072a45

Merge remote-tracking branch 'upstream/master' into layer_http

5c89990

WIP

8bccda3

Merge remote-tracking branch 'upstream/master' into layer_http

26bdadf

Work on refactoring

99ad1fd

Continue op refactor

3540e5c

WIP

bcc776a

Work on fixing tests

1e86182

Work on tests

c6d01b7

get simple tests passing

234e965

WIP

4e9ab9d

Close write producer

6b16068

Move bytes writer

c3c9c4f

Move producer

fe4c961

Remove imports

8f5622d

Merge remote-tracking branch 'upstream/master' into layer_http

e7fc228

Tim-Brooks added 2 commits April 3, 2018 09:55

Remove http stuff

0bd3810

Extract stuff to super

6bd2d91

Tim-Brooks added >enhancement review :Distributed/Network Http and internode communication implementations v7.0.0 labels Apr 4, 2018

Tim-Brooks requested review from s1monw and jasontedor April 4, 2018 23:34

Tim-Brooks added 3 commits April 4, 2018 17:39

Fix tests

c761bce

Fix warning

e196257

Merge remote-tracking branch 'upstream/master' into remove_http

35daf28

s1monw reviewed Apr 5, 2018

View reviewed changes

Tim-Brooks requested a review from s1monw April 5, 2018 17:38

Changes based on review

ac7ba90

Tim-Brooks changed the title ~~Add FlushProducer for producing outbound bytes~~ Add class for serializing message to bytes Apr 11, 2018

s1monw reviewed Apr 12, 2018

View reviewed changes

Tim-Brooks closed this Apr 17, 2018

Tim-Brooks deleted the remove_http branch December 10, 2018 16:20

colings86 added v7.0.0-beta1 and removed v7.0.0 labels Feb 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add class for serializing message to bytes #29384

Add class for serializing message to bytes #29384

Tim-Brooks commented Apr 4, 2018 •

edited

Loading

elasticmachine commented Apr 4, 2018

s1monw Apr 5, 2018

Tim-Brooks Apr 5, 2018

Tim-Brooks Apr 5, 2018

s1monw Apr 9, 2018

s1monw Apr 10, 2018

Tim-Brooks Apr 10, 2018 •

edited

Loading

s1monw Apr 11, 2018

Tim-Brooks commented Apr 12, 2018

s1monw left a comment

s1monw Apr 12, 2018

s1monw Apr 12, 2018

Tim-Brooks commented Apr 13, 2018

Tim-Brooks commented Apr 17, 2018


		SocketChannelContext getChannel();
		public WriteOperation(SocketChannelContext channelContext, Object writeObject, BiConsumer<Void, Throwable> listener) {

Add class for serializing message to bytes #29384

Add class for serializing message to bytes #29384

Conversation

Tim-Brooks commented Apr 4, 2018 • edited Loading

elasticmachine commented Apr 4, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Tim-Brooks Apr 10, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Tim-Brooks commented Apr 12, 2018

s1monw left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Tim-Brooks commented Apr 13, 2018

Tim-Brooks commented Apr 17, 2018

Tim-Brooks commented Apr 4, 2018 •

edited

Loading

Tim-Brooks Apr 10, 2018 •

edited

Loading