DRAFT: DBZ-121 Added options for how change events are mapped to topics #129

rhauch · 2016-11-01T22:19:23Z

The default is to map all change events in each table to a separate topic, but this commit adds the ability to map all change events for all database tables to a database-specific topic, or to use a custom mapping via TopicMappingProvider implementation.

NOTE: This is still incomplete and not thoroughly tested.

…atabase to a database-specific topic, or using a custom mapping.

hchiorean · 2016-11-04T07:26:21Z

@rhauch: I've also added a TOPIC_SELECTION_STRATEGY config option for the PG connector with the options of PER_TABLE(default) or PER_SCHEMA. This PR contains the missing pieces in TableSchemaBuilder to support this feature (i.e. correct event Key schema and value generator) so I will not attempt to make the same/similar changes to TableSchemaBuilder, but rather rely on this once it's merged.

rhauch · 2016-11-04T12:00:58Z

@hchiorean Awesome. Any preferences or suggested changes to names of anything? I do like TOPIC_SELECTION_STRATEGY better than TOPIC_MAPPING.

hchiorean · 2016-11-04T13:00:05Z

the only comment I would have based on the current proposed PR changes is that I'm not convinced of the benefit of the added complexity for the custom TopicMappingProvider option. For PG, there are only 2 options which make sense : per table and per schema.

As far as integrating with the rest of the code, it's hard to say until I actually get around to doing that, so I'm fine with it in its current form.

rhauch · 2016-11-04T13:46:19Z

@hchiorean, agreed. Ideally we can completely hide how this is implemented, and instead use a configuration option that specifies something like per-table, per-database, per-schemas or whatever reads well. We can expand these if/when we add new built-in strategies.

DBZ-121 not only deals with expanding how the connector maps events to topics beyond just topic-per-table to also handle topic-per-database and topic-per-schema (whichever makes sense for the DBMS), but it also calls for a way to deal with sharded tables. Originally, I was thinking that dealing with shards is really just a different topic mapping strategy, since you may want to annotate the key to do this. There are so many different approaches to sharding that I'm not sure we can have a built-in approach that works for everyone. Thus the need for specifying a custom mapping strategy.

But just because we could use a custom mapping strategy doesn't mean the configuration should require an implementation class name. This still needs to be changed/refined.

dasl- · 2017-02-17T22:33:32Z

debezium-core/src/main/java/io/debezium/relational/topic/TopicMappingProvider.java

+        }
+
+        /**
+         * Get the schema of the keys for all messages produced from the table.


docblock seems a bit weird... "Get" implies to me something other than a void return.

dasl- · 2017-02-17T22:34:20Z

debezium-core/src/main/java/io/debezium/relational/topic/TopicMappingProvider.java

+        void enhanceKeySchema(SchemaBuilder keySchemaBuilder);
+
+        /**
+         * Get the key for the row defined by the specified


docblock seems incomplete here.

dasl- · 2017-02-17T22:46:24Z

debezium-core/src/main/java/io/debezium/relational/topic/ByTableTopicMapping.java

+
+    @Override
+    public TopicMapping getMapper(String prefix, Table table) {
+        String topicName = prefix + table.id().toString();


This is supposed to be equivalent to the current dbz approach, right? AFAICT, the topic name in current logic ends with tableName, which is populated via tableId.table(), which is equivalent to table.id().table().

But in this new logic, the topic name ends with table.id().toString(). Is that equivalent?

dasl- · 2017-02-17T22:55:28Z

debezium-core/src/main/java/io/debezium/relational/topic/TopicMappingProvider.java

+ * @author Randall Hauch
+ *
+ */
+public interface TopicMappingProvider {


Why is the Provider abstraction necessary? Compared to the old approach, it seems like it just adds another layer needlessly to the abstraction. Could users specify in the config a TopicMapping rather than a TopicMappingProvider?

dasl- · 2017-02-17T23:07:06Z

debezium-core/src/main/java/io/debezium/relational/topic/ByDatabaseTopicMapping.java

+        }
+
+        @Override
+        public void enhanceKeySchema(SchemaBuilder keySchemaBuilder) {


might be nice to add docs about why this enhancement is necessary.

dasl- · 2017-02-17T23:10:37Z

debezium-core/src/main/java/io/debezium/relational/topic/TopicMappingProvider.java

+         * 
+         * @return the key's schema name; may not be null
+         */
+        default String getKeySchemaName() {


Why are we providing a way to customize the KeySchemaName and ValueSchemaName? As I understand it, the motivation for this PR was custom DML topic mappings. That doesn't seem to require customizing the KeySchemaName or ValueSchemaName...

That said, I guess it doesn't hurt to add that customization feature.

dasl- · 2017-02-17T23:11:44Z

I'm excited about these changes. Let me know if I can help!

rhauch · 2017-02-18T15:05:39Z

@dasl- I hope to get back to this soon. It is possible to limit the changes to only allow someone to simply customize the names of the topics, but not the table-to-topic mapping. This proposal tries to be even more flexible by allowing multiple tables to be mapping to a single topic, and to do this we'll likely need to augment the message keys since a table's primary key is no longer sufficient to distinguish the row in table A from perhaps a row from table B with the same primary key.

Having said all this, though, I've not really looked at this PR for some time, and I may want to do things differently. Any chance you want to take a whack at it, even if it's just a start/concept?

dasl- · 2017-02-22T17:47:45Z

@rhauch My approach would probably be very similar to the one you took here. I would like to allow multiple tables to map to a single topic also.

I will try applying this patch and seeing how it works. Perhaps I will have a follow up code review with some additions.

rhauch · 2017-04-04T17:57:50Z

Closed without merging. Instead, we've gone with using Single Message Transforms per #211.

DBZ-121 Added support for mapping change events for all tables in a d…

189b6a1

…atabase to a database-specific topic, or using a custom mapping.

rhauch mentioned this pull request Nov 3, 2016

DRAFT: DBZ-121 Add flexible topic naming strategy to MySQL connector (concept 1) #124

Closed

rhauch changed the title ~~DBZ-121 Added options for how change events are mapped to topics~~ DRAFT: DBZ-121 Added options for how change events are mapped to topics Feb 10, 2017

dasl- reviewed Feb 17, 2017

View reviewed changes

This was referenced Mar 1, 2017

WORK IN PROGRESS: Dbz-121 #193

Closed

DBZ-121 take2 #194

Closed

rhauch closed this Apr 4, 2017

ARostov mentioned this pull request Mar 12, 2019

Enable TOPIC_SELECTION_STRATEGY for postgres connector #807

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DRAFT: DBZ-121 Added options for how change events are mapped to topics #129

DRAFT: DBZ-121 Added options for how change events are mapped to topics #129

rhauch commented Nov 1, 2016

hchiorean commented Nov 4, 2016 •

edited

rhauch commented Nov 4, 2016

hchiorean commented Nov 4, 2016

rhauch commented Nov 4, 2016

dasl- Feb 17, 2017

dasl- Feb 17, 2017

dasl- Feb 17, 2017

dasl- Feb 17, 2017

dasl- Feb 17, 2017

dasl- Feb 17, 2017

dasl- commented Feb 17, 2017

rhauch commented Feb 18, 2017 •

edited

dasl- commented Feb 22, 2017

rhauch commented Apr 4, 2017

DRAFT: DBZ-121 Added options for how change events are mapped to topics #129

DRAFT: DBZ-121 Added options for how change events are mapped to topics #129

Conversation

rhauch commented Nov 1, 2016

hchiorean commented Nov 4, 2016 • edited

rhauch commented Nov 4, 2016

hchiorean commented Nov 4, 2016

rhauch commented Nov 4, 2016

dasl- Feb 17, 2017

Choose a reason for hiding this comment

dasl- Feb 17, 2017

Choose a reason for hiding this comment

dasl- Feb 17, 2017

Choose a reason for hiding this comment

dasl- Feb 17, 2017

Choose a reason for hiding this comment

dasl- Feb 17, 2017

Choose a reason for hiding this comment

dasl- Feb 17, 2017

Choose a reason for hiding this comment

dasl- commented Feb 17, 2017

rhauch commented Feb 18, 2017 • edited

dasl- commented Feb 22, 2017

rhauch commented Apr 4, 2017

hchiorean commented Nov 4, 2016 •

edited

rhauch commented Feb 18, 2017 •

edited