Introducing slave groups, can be used concurrently with sharding #239

mumoshu · 2014-04-18T08:56:29Z

Hi, @tchandy and commiters

First of all, I would like to thank you for sharing this great library 😄
At CrowdWorks, we are using Octopus in production and it is working like a charm 👍

In this pull request, I'm introducing "slave groups" to make Octopus distribute queries to a specific group of slaves.
They can be used concurrently with shards, e.g. merging this pull request also adds "Replication + Sharding" in your TODO list.
This change is backward-compatible.

A slave group is a group of slaves to which queries are distributed.
In other words, all the slaves in the same slave group are load-balanced.
Octopus does not load-balance between slaves across multiple slave groups.

An usecase of slave groups is to distribute/completely seperate loads of SELECT queries from each part of an application to seperate databases.
For example, consider that you have a Web application consists of the A part which serves Web pages for users, and the B part which serves Web pages for admins.
If you just send heavy SELECT queries while serving for admins, they will degrade performance serving Web pages for other users.
This is where slave groups come in.
You can configure seperate slave groups for A and B to make them not affect each other's performance.

You can configure any number of slave groups in shards.yml.
To configure two slave groups slave_group_a and slave_group_b for the development environment, you need shards.yml like:

octopus:
  replicated: true
  environments:
    - development
  development:
    slave_group_a:
      slave1:
        # ...
      slave2:
        # ...
    slave_group_b:
      slave3:
        # ...
      slave4:
        # ...

Then:

In Octopus.using(slave_group: :slave_group_a) blocks, all the SELECT queries are distributed between slave1 and slave2
In Octopus.using(slave_group: :slave_group_b) blocks, all the SELECT queries are distribuetd between slave3 and slave4
Without using, all the SELECT queries go to the development database configured in database.yml

To use with sharding, you need shards.yml like:

octopus:
  replicated: true
  environments:
    - development
  development:
    shard_1:
      slave_group_a:
        slave1:
          # ...
        slave2:
          # ...
      slave_group_b:
        slave3:
          # ...
        slave4:
          # ...
    shard_2:
      slave_group_a:
        slave5:
          # ...

Then:

In Octopus.using(shard: :shard_1, slave_group: :slave_group_a) blocks, all the SELECT queries are distributed between slave1 and slave2
In Octopus.using(shard: :shard_1, slave_group: :slave_group_b) blocks, all the SELECT queries are distribuetd between slave3 and slave4
Without using, all the SELECT queries go to the development database configured in database.yml

…tion + Sharding support

…ific group of slaves. They can be used concurrently with shards. This change is backward-compatible. A slave group is a group of slaves to which queries are distributed. In other words, all the slaves in the same slave group are load-balanced. Octopus does not load-balance between slaves across multiple slave groups. An usecase of slave groups is to distribute/completely seperate loads of `SELECT` queries from each part of an application to seperate databases. For example, consider that you have a Web application consists of the A part which serves Web pages for users, and the B part which serves Web pages for admins. If you just send heavy `SELECT` queries while serving for admins, they will degrade performance serving Web pages for other users. This is where slave groups come in. You can configure seperate slave groups for A and B to make them not affect each other's performance. You can configure any number of slave groups in `shards.yml`. To configure two slave groups `slave_group_a` and `slave_group_b` for the `development` environment, you need something like: octopus: replicated: true environments: - development development: slave_group_a: slave1: # ... slave2: # ... slave_group_b: slave3: # ... slave4: # ... Then: * In `Octopus.using(slave_group: :slave_group_a)` blocks, all the `SELECT` queries are distributed between `slave1` and `slave2` * In `Octopus.using(slave_group: :slave_group_b)` blocks, all the `SELECT` queries are distribuetd between `slave3` and `slave4` * Without `using`, all the `SELECT` queries go to the development database configured in `database.yml`

thiagopradi · 2014-04-18T19:47:56Z

hey @mumoshu, awesome feature!

I'll ask the other maintainers to review it. Could you add some documentation about this feature to the README file? Maybe overwriting the section about sharding + replication.

Thanks

nickmarden · 2014-04-19T00:26:18Z

@tchandy I can spend some time reviewing this over the weekend. Please assign me to the PR?

…r the Sharding + Replication scenario

mumoshu · 2014-04-19T09:24:48Z

Hi, @tchandy and @nickmarden 😄

Could you add some documentation about this feature to the README file?

Sure!
And I have just added a commit to update README with a small description of slave groups which also mentions a Wiki page for further information.
May I have comments for it? I'm glad to update it so that everyone can try this feature with less hassle.

Thanks

… of doing so by matching keys

… more DRY

mumoshu · 2014-04-22T13:19:29Z

@nickmarden

Thank you for reviewing!
I have just added commits in answer to your reviews.
Would you mind reviewing once again?

nickmarden · 2014-04-22T18:33:55Z

@mumoshu: I have some things to do today and tomorrow, but I will look again tomorrow night or Thursday morning.

nickmarden · 2014-04-23T20:36:01Z

Looks good, merging.

Introducing slave groups, can be used concurrently with sharding

mumoshu · 2014-04-24T03:13:24Z

@nickmarden

Thanks for merging!

mumoshu · 2014-04-24T03:25:59Z

Hi @tchandy

Would you mind releasing it as 0.8.x?

As I have experiences in managing gems, I think I can do it for you if you make me as an additional owner for the gem.

Thanks

mumoshu added 2 commits April 15, 2014 15:02

Extract the round-robin algorithm out of Proxy to prepare for Replica…

9787a41

…tion + Sharding support

thiagopradi assigned nickmarden Apr 19, 2014

Update README with description of Slave Groups and its utilization fo…

5bbc8dc

…r the Sharding + Replication scenario

mumoshu mentioned this pull request Apr 22, 2014

The "using" method doesn't cycle through an array of followers #235

Closed

Yusuke KUOKA added 5 commits April 22, 2014 22:15

Integrate an unnecessary local variable into an instance variable

f4e21d7

When analyzing shards.yml, structurally detect slave groups instead…

0cd1d28

… of doing so by matching keys

Make send_queries_to_* methods a bit more DRY

64b015f

Make run_queries_on_shard and send_queries_to_slave methods a bit…

c5e2d4d

… more DRY

Make should_send_queries_to_* methods a bit more DRY

6288e4e

nickmarden added a commit that referenced this pull request Apr 23, 2014

Merge pull request #239 from mumoshu/slave-groups-with-optional-sharding

c8a01d8

Introducing slave groups, can be used concurrently with sharding

nickmarden merged commit c8a01d8 into thiagopradi:master Apr 23, 2014

sobrinho mentioned this pull request Aug 10, 2015

replicated_model is not working (on Heroku at least) #317

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introducing slave groups, can be used concurrently with sharding #239

Introducing slave groups, can be used concurrently with sharding #239

mumoshu commented Apr 18, 2014

thiagopradi commented Apr 18, 2014

nickmarden commented Apr 19, 2014

mumoshu commented Apr 19, 2014

mumoshu commented Apr 22, 2014

nickmarden commented Apr 22, 2014

nickmarden commented Apr 23, 2014

mumoshu commented Apr 24, 2014

mumoshu commented Apr 24, 2014

Introducing slave groups, can be used concurrently with sharding #239

Introducing slave groups, can be used concurrently with sharding #239

Conversation

mumoshu commented Apr 18, 2014

thiagopradi commented Apr 18, 2014

nickmarden commented Apr 19, 2014

mumoshu commented Apr 19, 2014

mumoshu commented Apr 22, 2014

nickmarden commented Apr 22, 2014

nickmarden commented Apr 23, 2014

mumoshu commented Apr 24, 2014

mumoshu commented Apr 24, 2014