Per-vhost message store #567

michaelklishin · 2016-01-20T13:29:01Z

This idea has been floating around since at least late 2012: we should use separate message stores for different vhosts, e.g. two (for persistent, transient messages) per vhost.

Some of the benefits are:

Better isolation of physical resources between hosts (e.g. in the spirit of Dealing with Noisy Neighbors on brokers with multiple vhosts #498, Per-vhost connection limit #500, Per-vhost limit on the number of queues #501)
Improved availability should a message store terminate or significantly fall behind
(At least potentially) Less contention, more parallelism during normal operation of a multi-tenant node
Opens the door to moving different vhosts to different disks

Downsides:

Virtual host initialisation (or restart) will be more expensive. Most users don't seem to use a lot of virtual hosts, so this is a reasonable trade-off.

michaelklishin · 2016-01-20T13:30:00Z

effort-high because migration of existing data can take a while to cover well with tests. Otherwise this is effort-medium.

hairyhum · 2016-03-09T14:46:23Z

What if we enable per vhost message stores for new vhosts only, and recommend to shovel messages for migration?

michaelklishin · 2016-03-09T14:48:28Z

I'm afraid that's not gonna fly with hosted RabbitMQ providers, and at least some Cloud Foundry users. It would be great to automatically migrate messages on boot (if we can).

hairyhum · 2016-03-11T17:07:13Z

Initial research:
Thanks to rabbit_msg_store architecture enabling per-vhost grouping is pretty easy by starting message stores in subdirectories and controlling them with supervisor.
Migration of old data can still be a problem, since there is no vhost information in message records.

hairyhum · 2016-04-06T13:12:07Z

Migration mechanism proposal:

move_messages_to_vhost_store() ->
    Queues = get_durable_queues(),
    OldStore = run_old_persistent_store(),
    Migrations = spawn_for_each(fun(Queue) -> 
                                    migrate_queue(Queue, OldStore) 
                                end, Queues),
    wait_for_completion(Migrations),
    delete_old_store(OldStore).

migrate_queue(Queue, OldStore) ->
    OldStoreClient = get_client(OldStore),
    NewStore       = ensure_new_store_exist(get_vhost(Queue)),
    NewStoreClient = get_client(NewStore),
    walk_queue_index(
        fun(MessageIdInStore) ->
            Msg = get_msg_from_store(OldStoreClient),
            put_message_to_store(Msg, NewStoreClient)
        end,
        Queue).

michaelklishin · 2016-06-02T14:11:06Z

@hairyhum given that upgrades perform a backup of the node data dir, that sounds reasonable.

It emits a warning and breaks compilation on OTP 20. Originally introduced in 492e23b, likely unintentionally. Fixes #1272, references #567, #766.

Before #567 all binding recover functions were executed once on node restart. Executing table_filter can be expensive with high number of vhosts and bindings and effectively should be called only once. Moved to a separate function, which should be called from the global recovery function (`rabbit_vhost:recover/0`)

Before #567 all binding recover functions were executed once on node restart. Executing table_filter can be expensive with high number of vhosts and bindings and effectively should be called only once. Moved to a separate function, which should be called from the global recovery function (`rabbit_vhost:recover/0`) (cherry picked from commit e5f4cbd)

michaelklishin added effort-high enhancement labels Jan 20, 2016

hairyhum self-assigned this Mar 10, 2016

hairyhum mentioned this issue Apr 21, 2016

Per vhost message store #766

Merged

michaelklishin added this to the 3.7.0 milestone Nov 12, 2016

hairyhum mentioned this issue Dec 2, 2016

Lager sink for upgrade logs rabbitmq/rabbitmq-common#152

Merged

michaelklishin closed this as completed in #766 Dec 24, 2016

hairyhum mentioned this issue Mar 13, 2017

Per-vhost supervision tree for queues and VHosts #1146

Closed

michaelklishin added a commit that referenced this issue Jun 23, 2017

Don't use export_all

371ebdb

It emits a warning and breaks compilation on OTP 20. Originally introduced in 492e23b, likely unintentionally. Fixes #1272, references #567, #766.

michaelklishin mentioned this issue Jul 23, 2018

Concurrent virtual host state recovery #1648

Open

hairyhum mentioned this issue Jul 24, 2018

More efficient virtual host recovery: prepare semi-durable binding table once instead of each time a virtual host is recovered #1650

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Per-vhost message store #567

Per-vhost message store #567

michaelklishin commented Jan 20, 2016 •

edited

Loading

michaelklishin commented Jan 20, 2016

hairyhum commented Mar 9, 2016

michaelklishin commented Mar 9, 2016

hairyhum commented Mar 11, 2016

hairyhum commented Apr 6, 2016

michaelklishin commented Jun 2, 2016

Per-vhost message store #567

Per-vhost message store #567

Comments

michaelklishin commented Jan 20, 2016 • edited Loading

michaelklishin commented Jan 20, 2016

hairyhum commented Mar 9, 2016

michaelklishin commented Mar 9, 2016

hairyhum commented Mar 11, 2016

hairyhum commented Apr 6, 2016

michaelklishin commented Jun 2, 2016

michaelklishin commented Jan 20, 2016 •

edited

Loading