HBase、ElasticSearch批量读写更新,HBase扫表批量更新Elasticsearch测试
数据规格: 1000+万行,1000+列,多种数据格式
Hbase插入、更新、查询(批量)
参数含义,参考HBaseInit
> nohup java -jar hbase-project.jar \
-itn ${insertThreadNum} \
-utn ${updateThreadNum} \
-stn ${selectThreadNum} \
-table ${tableName} \
-cf ${cf} \
-region ${region} \
-offset ${offset} \
-is ${insertSize} \
-us ${updateSize} \
-ss ${selectSize} \
-batch ${batch} > hbase-project.log 2>&1 &
# HBase connection
hbase.zookeeper.quorum=${zk.quorum}
hbase.zookeeper.property.clientPort=${zk.port}
hbase.rootdir=${hbase.rootdir}
hbase.fs.defaultFS=${hbase.fs.defaultFS}
hbase.client.scanner.timeout.period=${hbase.client.scanner.timeout.period}
HBase插入 HBaseInsert
HBase更新 HBaseUpdate
HBase查询 HBaseSelect
HBaseScan HBaseStatistics
- HBase Scan
HBaseUtils hBaseUtils = HBaseUtils.getInstance(); hBaseUtils.printScan(tableName, cf);
- HBase ClientSideScan
ClientSideScanner clientSideScanner = new ClientSideScanner(); clientSideScanner.tableScan(tableName, cf);
Elasticsearch插入、更新、查询(批量)
参数含义,参考ElasticInit
> nohup java -jar elasticsearch-project.jar \
-itn ${insertThreadNum} \
-utn ${updateThreadNum} \
-stn ${selectThreadNum} \
-index ${index} \
-type ${type} \
-offset ${offset} \
-is ${insertSize} \
-us ${updateSize} \
-ss ${selectSize} \
-batch ${batch} \
-delete ${deleteExist} > elasticsearch-project.log 2>&1 &
# elastic connection info
es.cluster.name=${es.cluster.name}
es.node.ip=${es.node.ip}
es.node.port=${es.node.port}
ElasticSearch插入 ElasticInsert
ElasticSearch更新 ElasticUpdate
ElasticSearch查询 ElasticQuery
Flink Scan Hbase, 写入Elasticsearch
Flink elasticsearch connector
Spark Scan Hbase,写入Elasticsearch