[Do not merge] First cut at dynamic sharding support #392

emmanuelbernard · 2013-04-06T21:06:32Z

I need to write down tests to finish that up but @Sanne could you give me your first impression on the proposal. I did not have to break any existing public interface. I even reuse the IndexShardingStrategy.

If you have some advice on making the structure more scalable concurrency wise, I'm interested too.

…vider

Sanne · 2013-04-08T23:16:46Z

+1 very clever approach :)

Generally speaking, don't you think the ShardIdentifierProvider should superseed the IndexShardingStrategy ?
It's great we have a backwards compatible migration path with the dual-design but I'm wondering if we shouldn't deprecate IndexShardingStrategy or if it makes sense to keep both around.

I add some more details in the code as comments.

Sanne · 2013-04-08T23:21:32Z

...-search-engine/src/main/java/org/hibernate/search/engine/impl/EntityIndexBindingFactory.java

+																			  IndexManagerHolder indexManagerHolder,
+																			  IndexManagerFactory indexManagerFactory) {
+		if ( !isDynamicSharding && providers.length == 0 ) {
+			throw log.entityWithNoShard( type );


Why is a zero-shards entity invalid?
I'm wondering about the multitenancy use case, if the application should not be able to deploy with an initial state of zero tenants.

Queries would return zero elements; while on an add operation the ShardIdentifierProvider has to return an id anyway.. which would trigger creation of an indexmanager.

0 shard is not legal for static shards as ti will always stay 0 :). In case of dynamic sharding (value set to dynamic), we don't throw the exception.

right :)
misinterpreted the condition.

Sanne · 2013-04-08T23:46:05Z

I've added many more comments on the code - this is just to make sure you see them as I'm not sure it will notify you as I closed the issue initially.

emmanuelbernard · 2013-04-09T08:48:39Z

I did not think about deprecating IndexShardingStrategy but looking at it now, I could not find a good reason to keep it except maybe to eagerly initialize the IndexManagers but if the lazy code is good enough that might not be necessary.

Sanne · 2013-04-09T09:13:41Z

We could use ShardIdentifierProvider#getAllShardIdentifiers for that?

Since we validate IndexManager configuration options only when it's started it would be good to be eager on initializing those for which it's possible.

Sanne · 2013-04-09T09:16:12Z

Different thought: looks like the user code should be able to interact directly with the ShardIdentifierProvider instance, right?

I mean in the multi-tenant use case you want to be able to let it know you are creating a new tenant.

another nice use case coming to mind is per-language independent indexes.. would it make sense to combine two levels of dynamic sharding?

emmanuelbernard · 2013-04-09T13:02:36Z

Different thought: looks like the user code should be able to interact directly with the ShardIdentifierProvider instance, right?
I mean in the multi-tenant use case you want to be able to let it know you are creating a new tenant.

Yes, in an ideal world ShardIdentifierProvider could have CDI injection points but worse case, threadlocals could be used. You're saying we should provide access to the ShardIdentifierProvider instance so that a user could call specific methods to it

MyImpl provider = (MyImpl) searchFactory.getMetadata().getEntityMetadata(User.class).getShardIdentifierProvider();

Not entirely satisfactory, would be nice to enlist a callback.

another nice use case coming to mind is per-language independent indexes.. would it make sense to combine two levels of dynamic sharding?

Dow we need to have built-in layers of sharding or leave that to the ShardIdentifierProvider implementor? It seems to be the later.

emmanuelbernard · 2013-04-09T13:34:39Z

I could not find a good reason to keep it except maybe to eagerly initialize the IndexManagers

It turns out, when building the SF, we call EIB.getIndexManagers() and thus eagerly initialize the indexes. so we are good.

Sanne · 2013-04-09T16:35:03Z

what you called

searchFactory.getMetadata().getEntityMetadata(User.class)

is ~ available today as

org.hibernate.search.spi.SearchFactoryIntegrator.getIndexBindingForEntity(Class<?>)

but I agree the getMetada() is looking better.. just we don't have that yet.

Sanne · 2013-04-10T09:33:05Z

From today's forum posts, I'm convinced it would be awesome to have a dynamic sharding option working out of the box for multiple languages, extending the use case we documented for dynamic analyzers:

http://docs.jboss.org/hibernate/search/4.2/reference/en-US/html_single/#d0e3840

As the tricky part for the above link is actually running queries on the appropriate index: needs to use a shard sensitive filter.
http://docs.jboss.org/hibernate/search/4.2/reference/en-US/html_single/#query-filter-shard

emmanuelbernard added 6 commits April 6, 2013 16:32

Isolate EntityIndexBinder creation in a factory

921837c

Prepare IndexManagerHolder for dynamic sharding

5a4aba1

Split MutableEntityIndexBinding into interface / implementation

c7d4ce9

Implement dynamic sharding strategy and introduce ShardIndentifierPro…

4564aab

…vider

Import reordering / cleanup

6d430a2

Make IndexManagerHolder offer the ability to add IndexManagers lazily

796a3fe

ghost assigned Sanne Apr 6, 2013

Sanne closed this Apr 8, 2013

Sanne reviewed Apr 8, 2013
View reviewed changes

Sanne reopened this Apr 8, 2013

Sanne closed this Apr 8, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Do not merge] First cut at dynamic sharding support #392

[Do not merge] First cut at dynamic sharding support #392

emmanuelbernard commented Apr 6, 2013

Sanne commented Apr 8, 2013

Sanne Apr 8, 2013

emmanuelbernard Apr 9, 2013

Sanne Apr 9, 2013

Sanne commented Apr 8, 2013

emmanuelbernard commented Apr 9, 2013

Sanne commented Apr 9, 2013

Sanne commented Apr 9, 2013

emmanuelbernard commented Apr 9, 2013

emmanuelbernard commented Apr 9, 2013

Sanne commented Apr 9, 2013

Sanne commented Apr 10, 2013

[Do not merge] First cut at dynamic sharding support #392

[Do not merge] First cut at dynamic sharding support #392

Conversation

emmanuelbernard commented Apr 6, 2013

Sanne commented Apr 8, 2013

Sanne Apr 8, 2013

Choose a reason for hiding this comment

emmanuelbernard Apr 9, 2013

Choose a reason for hiding this comment

Sanne Apr 9, 2013

Choose a reason for hiding this comment

Sanne commented Apr 8, 2013

emmanuelbernard commented Apr 9, 2013

Sanne commented Apr 9, 2013

Sanne commented Apr 9, 2013

emmanuelbernard commented Apr 9, 2013

emmanuelbernard commented Apr 9, 2013

Sanne commented Apr 9, 2013

Sanne commented Apr 10, 2013