ParaFlow is an interactive analysis system for OLAP developed at DBIIR Lab @ RUC.
Hadoop file system is required.
This is required by Kafka. what need to deploy is simply config the cluster ip and port;
-
MetaServer(one node)
-
Loader [cn.edu.ruc.iir.paraflow.example.loader.BasicLoader]
config the ./paraflow-loader.sh then:
./sbin/paraflow-loader.sh deploy
-
Collector [cn.edu.ruc.iir.paraflow.example.loader.BasicCollector]
config the ./paraflow-collector.sh then:
./sbin/paraflow-collector.sh deploy
-
Presto connector
- Create user and database in pg for metadata.
CREATE USER paraflow WITH PASSWORD 'paraflow'
;
CREATE DATABASE paraflowmeta
;
GRANT ALL ON DATABASE paraflowmeta TO paraflow
.
- Start Zookeeper cluster
- Start Kafka
- Start PostgreSql
- Start Paraflow MetaServer
./bin/paraflow-metaserver-start.sh [-daemon]
- Start Paraflow Loader
./sbin/paraflow-loader.sh start
- Start Paraflow Collector
./sbin/paraflow-collector.sh start
- Start Presto cluster or single node to execute queries;