File system offload #4403

congbobo184 · 2019-05-29T11:44:59Z

Fixes #3216
Implementation of offload to HDFS

Motivation

Implementation of offload to HDFS

Verifying this change

Add the test for this

Does this pull request potentially affect one of the following parts:
If yes was chosen, please highlight the changes

Dependencies (does it add or upgrade a dependency): (no)

The public API: (no)
The schema: (no)
The default values of configurations: (no)
The wire protocol: (no)
The rest endpoints: (no)
The admin cli options: (no)
Anything that affects deployment: (yes)

Documentation

Does this pull request introduce a new feature? (yes / no)
If yes, how is the feature documented? (not applicable / docs / JavaDocs / not documented)
If a feature is not applicable for documentation, explain why?
If a feature is not documented yet in this PR, please create a followup issue for adding the documentation

sijie

@congbobo184

I think the approach in this pull request follows what we have done for cloud storage. It is an okay approach.

However I would actually suggest using what hadoop ecosystem already provides. There are many file formats in hadoop ecosystem.

I would actually suggest using SequenceFile which is the most common file format in hadoop ecosystem. SequenceFile already supports multiple types of compression, which we don't need to reinvent again.

https://hadoop.apache.org/docs/r2.7.5/api/org/apache/hadoop/io/SequenceFile.html

So you can have two SequenceFile per offloaded ledger, one for data, the other for index.

congbobo184 · 2019-06-03T01:48:40Z

@congbobo184

I think the approach in this pull request follows what we have done for cloud storage. It is an okay approach.

However I would actually suggest using what hadoop ecosystem already provides. There are many file formats in hadoop ecosystem.

I would actually suggest using SequenceFile which is the most common file format in hadoop ecosystem. SequenceFile already supports multiple types of compression, which we don't need to reinvent again.

https://hadoop.apache.org/docs/r2.7.5/api/org/apache/hadoop/io/SequenceFile.html

So you can have two SequenceFile per offloaded ledger, one for data, the other for index.

OK, I'm going to look at the API and change my PR

sijie

Great job! @congbobo184

It is much simpler after using MapFile. Well done!

Overall looks pretty good. I left a few comments. PTAL!

sijie · 2019-06-13T09:39:14Z

tiered-storage/file-system/pom.xml

+        <groupId>org.apache.pulsar</groupId>
+        <artifactId>tiered-storage-parent</artifactId>
+        <version>2.4.0-SNAPSHOT</version>
+        <relativePath>../../../../congbo184/pulsar/tiered-storage</relativePath>


:) the path seems to be wrong?

sijie · 2019-06-13T09:40:35Z

tiered-storage/file-system/pom.xml

+        <dependency>
+            <groupId>org.apache.hadoop</groupId>
+            <artifactId>hadoop-hdfs-client</artifactId>
+            <version>3.2.0</version>


@congbobo184 can you define the dependencies on root pom file?

sijie · 2019-06-13T09:41:26Z

tiered-storage/file-system/pom.xml

+        </dependency>
+        <dependency>
+            <groupId>org.eclipse.jetty</groupId>
+            <artifactId>jetty-server</artifactId>


define the dependencies at root pom file.

sijie · 2019-06-13T09:43:06Z

...in/java/org/apache/bookkeeper/mledger/offload/filesystem/TieredStorageConfigurationData.java

+ * Configuration for tiered storage.
+ */
+@Data
+public class TieredStorageConfigurationData implements Serializable, Cloneable {


How about call it FileSystemConfigurationData? Since this class only about FileSystem offloader, right?

sijie · 2019-06-13T09:45:41Z

...ava/org/apache/bookkeeper/mledger/offload/filesystem/impl/FileStoreBackedReadHandleImpl.java

+            reader.get(key, value);
+            this.ledgerMetadata = parseLedgerMetadata(value.copyBytes());
+        } catch (IOException e) {
+            log.error("Fail to read LedgerMetadata for key {}",


Since this code logic is all about reading metadata, I don't think you need to add for key -1 in the log statement and the exception error message. Adding for key -1 is actually a bit confused.

Maybe we could define a constant for -1?
e.g.
public static final long METADATA_KEY_INDEX = -1 ?

Maybe we could define a constant for -1?
e.g.
public static final long METADATA_KEY_INDEX = -1 ?

yes, i will fix it

sijie · 2019-06-13T09:55:38Z