[FLINK-8862] [HBase] Support HBase snapshot read #5639
Closed
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What is the purpose of the change
Flink-hbase connector only supports reading/scanning HBase over region server scanner, there is also snapshot scanning solution, just like Hadoop provides 2 ways to scan HBase, one is TableInputFormat, the other is TableSnapshotInputFormat, so it would be great if flink supports both solutions to ensure more wider usage scope and provide alternatives for users.
Brief change log
TableInputSplitStrategy
interface and its implementations as abstraction logic forAbstractTableInputFormat
HBaseRowInputFormat
andTableInputFormat
HBaseSnapshotRowInputFormat
andTableSnapshotInputFormat
HBaseTableScannerAware
andResultToTupleMapper
HBaseSnapshotReadExample
Verifying this change
This change is already covered by existing tests as follows, and new test cases has been added as well.
org.apache.flink.addons.hbase.HBaseConnectorITCase
This change added tests and can be verified as follows:
Does this pull request potentially affect one of the following parts:
@Public(Evolving)
: (yes / no)Documentation