Transport: allow to de-serialize arbitrary objects given their name #12393

javanna · 2015-07-22T10:33:41Z

This commit makes it possible to serialize arbitrary objects by having them extend Writeable. When reading them though, we need to be able to identify which object we have to create, based on its name. This is useful for queries once we move to parsing on the coordinating node, as well as with aggregations and so on.

Introduced a new abstraction called NamedWriteable, which is supported by StreamOutput and StreamInput through writeNamedWriteable and readNamedWriteable methods. A new NamedWriteableRegistry is introduced also where named writeable prototypes need to be registered so that we are able to retrieve the proper instance of the writeable given its name and then de-serialize it calling readFrom against it.

Note that this same change was previously reviewed and pushed to the query-refactoring branch (#11553), the goal of this PR is to backport the change to master.

The main question is whether we should use this mechanism for exceptions or not. It seems like it would require a class per exception, maybe a bit too verbose compared to the switch that we have in StreamInput#readThrowable and StreamOutput#writeThrowable. Looking for a second opinion there.

This commit makes it possible to serialize arbitrary objects by having them extend Writeable. When reading them though, we need to be able to identify which object we have to create, based on its name. This is useful for queries once we move to parsing on the coordinating node, as well as with aggregations and so on. Introduced a new abstraction called NamedWriteable, which is supported by StreamOutput and StreamInput through writeNamedWriteable and readNamedWriteable methods. A new NamedWriteableRegistry is introduced also where named writeable prototypes need to be registered so that we are able to retrieve the proper instance of the writeable given its name and then de-serialize it calling readFrom against it.

colings86 · 2015-07-22T13:13:03Z

LGTM but I'm not too familiar with the Netty stuff so might be worth getting someone to look at that bit. I don't have a strong opinion on the Throwable stuff but it might be nice if everything uses the same mechanism.

javanna · 2015-07-22T14:01:16Z

maybe @jpountz can have a look?

jpountz · 2015-07-22T17:18:08Z

The main question is whether we should use this mechanism for exceptions or not. It seems like it would require a class per exception, maybe a bit too verbose compared to the switch that we have in StreamInput#readThrowable and StreamOutput#writeThrowable. Looking for a second opinion there.

I like the current logic as it prevents from being tempted to make exception types pluggable?

jpountz · 2015-07-22T17:20:26Z

core/src/main/java/org/elasticsearch/common/io/stream/NamedWriteable.java

+    /**
+     * Returns the name of the writeable object
+     */
+    String getWriteableName();


Or just getName()?

it is getName on the query-refactoring branch, but then classes that implement this interface might already have a getName method (e.g. aggs), which is why I went for this more specific method name.

jpountz · 2015-07-22T17:30:31Z

I'm a bit concerned that this pull request adds coupling between StreamInput and NamedWriteableRegistry on one hand, and transport modules and NamedWriteableRegistry on the other hand.

Could we only have this NamedWriteableRegistry handling in a subclass of StreamInput so that StreamInput would remain unaware of NamedWriteableRegistry? Also I think it would be cleaner if every component maintained its own registry (could be eg. one for queries, one for aggs, etc.) instead of sharing a single one for everything?

javanna · 2015-07-31T08:41:34Z

Replaced by #12571 .

This commit makes it possible to serialize arbitrary objects by having them extend Writeable. When reading them though, we need to be able to identify which object we have to create, based on its name. This is useful for queries once we move to parsing on the coordinating node, as well as with aggregations and so on. Introduced a new abstraction called NamedWriteable, which is supported by StreamOutput and StreamInput through writeNamedWriteable and readNamedWriteable methods. A new NamedWriteableRegistry is introduced also where named writeable prototypes need to be registered so that we are able to retrieve the proper instance of the writeable given its name and then de-serialize it calling readFrom against it. Closes elastic#12393

javanna added >enhancement v2.0.0-beta1 review labels Jul 22, 2015

jpountz reviewed Jul 22, 2015
View reviewed changes

javanna mentioned this pull request Jul 31, 2015

Transport: allow to de-serialize arbitrary objects given their name #12571

Merged

javanna removed v2.0.0-beta1 review labels Jul 31, 2015

javanna closed this Jul 31, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transport: allow to de-serialize arbitrary objects given their name #12393

Transport: allow to de-serialize arbitrary objects given their name #12393

javanna commented Jul 22, 2015

colings86 commented Jul 22, 2015

javanna commented Jul 22, 2015

jpountz commented Jul 22, 2015

jpountz Jul 22, 2015

javanna Jul 23, 2015

jpountz commented Jul 22, 2015

javanna commented Jul 31, 2015

Transport: allow to de-serialize arbitrary objects given their name #12393

Transport: allow to de-serialize arbitrary objects given their name #12393

Conversation

javanna commented Jul 22, 2015

colings86 commented Jul 22, 2015

javanna commented Jul 22, 2015

jpountz commented Jul 22, 2015

jpountz Jul 22, 2015

Choose a reason for hiding this comment

javanna Jul 23, 2015

Choose a reason for hiding this comment

jpountz commented Jul 22, 2015

javanna commented Jul 31, 2015