[NEW] Support slot-based data migration #412

ChrisZMF · 2021-12-01T13:15:09Z

1 Background

Data online migration is an essential feature for database servers if they are deployed in the cluster. Since kvrocks already supports redis cluster mode #219, it is more necessary to support data online migration. Different from the key-based data migration method of the Redis community, we proposed a slot-based data migration method for kvrocks.

2 Implementation

2.1 Data encoding format

To support slot-based migration, we need to encode slotid onto every key to improve the efficiency of iterating data. The string key and hash key are adopted to explain the slotid encoding for simple key and complex key respectively as follows.

String key:

    +--------+----+---------+----------+
    | ns_len | ns | slot_id | user_key |
    +--------+----+---------+----------+

Hash key:

    hash metakey
    +--------+----+---------+----------+
    | ns_len | ns | slot_id | user_key |
    +--------+----+---------+----------+
    hash subkey
    +--------+----+---------+--------------+----------+---------+-------+
    | ns_len | ns | slot_id | user_key_len | user_key | Version | field |
    +--------+----+---------+--------------+----------+---------+-------+

As shown above, slotid is encoded onto the prefix of every key. Keys of the same slot will have the same prefix, they will be stored in an adjacent location in rocksdb which will improve data iterating efficiency. Encoding slotid onto keys has been supported at #291 for kvrocks.

2.2 Slot-based migration brief design

Slot-based migration process mainly includes the following stages.

Start migrating
Migrating existing data
Migrating incremental data
End migrating

The main process of slot-based migration can be described in the following diagram.

Figure 1. Slot-based migration process diagram

2.3 Detail implementation

2.3.1 Details of migrating process

As shown in the process diagram (Figure 1), the data migration will be triggered by sending a request to the source server. The source server will create a migration task after it got the data migration request. The main processes of slot-based migration are processed by this migration task. The details can be described as the following stages.

1) Start migrating stage

At this stage, the source server will notify the destination server to prepare to import data. If the destination server is ready, the source server will go to the next stage to migrate data. Otherwise, the source server will stop the migration task.

2) Migrating existing data stage

At this stage, the existing data will be migrated. Existing data of kvrocks is described by rocksdb snapshot at the migration beginning moment. Then, the source server will iterate all data of the snapshot, and construct data into Redis commands to send to the destination server. Constructed Redis commands will be sent by pipeline to improve efficiency.

3) Migrating incremental data stage

While migrating existing data, the migrating slot can keep writing. In other words, new data will be written during migrating existing data. These new data must be migrated to the destination server too. Before migrating incremental data, the migrating slot will be forbidden to write to maintain consistency. New data of the target slot cannot be written to the source server again.
The amount of the incremental data may be very large, because it may take a long time to migrate the existing data. It will cause the slot to forbid writing for a long time. To reduce the forbidden writing time duration, the incremental data migration will be processed in the two-step.

First step: Slot will not be forbidden writing while migrating incremental data. This step will be repeated until the amount of new data is less than a threshold, or the repetition times reach a threshold.
Second step: Slot will be forbidden from writing before migrating the rest new data.
The incremental data will be gotten via iterating WAL of rocksdb.

4) End migrating stage

The previous stages may succeed or fail. In this stage, the source server will notify the destination server that the migration task succeeded or failed. If the migration is successful, both source server and the destination server will change the cluster topology maintained by themslves, and source server will clear data belonging to migreted slot. If the migration fails, only destination server will clear imported data of the target slot.

2.3.2 Support commands

CLUSTERX MIGRATE $slot $dst_nodeid
$dst_nodeid is the node id of destination server in the cluster. See [NEW] Support redis cluster mode #219 for more details.
CLUSTER IMPORT $slot $state
It is an internal command which will be sent by the source server to notify the destination server to prepare for data importing. This command cannot be used directly by clients.

3 Advantages and Disadvantages

3.1 Advantages

Efficiency
Compare with the key-based data migration method, the slot-based method supports data transmission with the pipeline, it is more efficient.
More convenient failure rollback
Data in the source server can be deleted only when all data is migrated successfully. If any failure happens during data migrating, the migration task will be stopped, data won't be deleted.
Consistency
Writing to the migrating slot is forbidden while transmitting the last piece of incremental data. It can guarantee data consistency.
asynchronous migration
Data migrating is processed in an independent thread, and does not affect the main threads to process requests.

3.2 Disadvantages

Currently, it only supports to migrate slot one by one.

4 Extra work

Support concurrent data migration.

The text was updated successfully, but these errors were encountered:

ShooterIT · 2021-12-02T11:51:10Z

Since we encode slot id into key only when enabling cluster mode, so i think we should only support slot migration in cluster mode, and the migrate command should be consistent with cluster command.

i prefer to use CLUSTER subcommand to implement slot migration instead of separate commands , see also redis issue redis/redis#2807 and in redis cluster v2 project redis/redis#8948, they also want to support this.

ChrisZMF · 2021-12-17T02:29:12Z

Thanks for your suggestion, it really makes a lot of sense. The implementation of slot-based migration will be modified to adapt to the cluster mode. @ShooterIT

A new command CLUSTERX MIGRATE is used for migrate slot data, slot-based migration process mainly includes the following stages: migrating existing data and migrating incremental data. Command format: CLUSTERX MIGRATE $slot $dst_nodeid - $slot is the slot which is to migrate - $dst_nodeid is the node id of destination server in the cluster. We also introduce an internal command CLUSTER IMPORT for importing the migrating slot data into destination server. Migration status are shown into the output of CLUSTER INFO command. After migration slot, you also should use CLUSTERX SETSLOT command to change cluster slot distribution. For more details, please see #412 and #430

A new command CLUSTERX MIGRATE is used for migrate slot data, slot-based migration process mainly includes the following stages: migrating existing data and migrating incremental data. Command format: CLUSTERX MIGRATE $slot $dst_nodeid - $slot is the slot which is to migrate - $dst_nodeid is the node id of destination server in the cluster. We also introduce an internal command CLUSTER IMPORT for importing the migrating slot data into destination server. Migration status are shown into the output of CLUSTER INFO command. After migration slot, you also should use CLUSTERX SETSLOT command to change cluster slot distribution. For more details, please see apache#412 and apache#430

A new command CLUSTERX MIGRATE is used for migrate slot data, slot-based migration process mainly includes the following stages: migrating existing data and migrating incremental data. Command format: CLUSTERX MIGRATE $slot $dst_nodeid - $slot is the slot which is to migrate - $dst_nodeid is the node id of destination server in the cluster. We also introduce an internal command CLUSTER IMPORT for importing the migrating slot data into destination server. Migration status are shown into the output of CLUSTER INFO command. After migration slot, you also should use CLUSTERX SETSLOT command to change cluster slot distribution. For more details, please see #412 and #430

git-hulk added A-cluster area cluster major decision Requires project management committee consensus feature type new feature labels Dec 1, 2021

ShooterIT added the release notes label Dec 2, 2021

ChrisZMF mentioned this issue Dec 17, 2021

Support slot-based data migration #430

Merged

ShooterIT closed this as completed in #430 Jan 27, 2022

ChrisZMF mentioned this issue Feb 7, 2022

Avoid accessing slot_migrate_ before it is created #472

Merged

PokIsemaine mentioned this issue Jul 1, 2024

feat(cluster): support migrate slot range #2389

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NEW] Support slot-based data migration #412

[NEW] Support slot-based data migration #412

ChrisZMF commented Dec 1, 2021 •

edited

Loading

ShooterIT commented Dec 2, 2021

ChrisZMF commented Dec 17, 2021

[NEW] Support slot-based data migration #412

[NEW] Support slot-based data migration #412

Comments

ChrisZMF commented Dec 1, 2021 • edited Loading

1 Background

2 Implementation

2.1 Data encoding format

2.2 Slot-based migration brief design

2.3 Detail implementation

2.3.1 Details of migrating process

1) Start migrating stage

2) Migrating existing data stage

3) Migrating incremental data stage

4) End migrating stage

2.3.2 Support commands

3 Advantages and Disadvantages

3.1 Advantages

3.2 Disadvantages

4 Extra work

ShooterIT commented Dec 2, 2021

ChrisZMF commented Dec 17, 2021

ChrisZMF commented Dec 1, 2021 •

edited

Loading