ParaFlow is an interactive analysis system for OLAP developed at DBIIR Lab @ RUC.
Hadoop file system is required.
This is required by Kafka. what need to deploy is simply config the cluster ip and port;
-
MetaServer(one node)
-
Loader [cn.edu.ruc.iir.paraflow.example.loader.BasicLoader]
config the ./paraflow-loader.sh then:
./sbin/paraflow-loader.sh deploy -
Collector [cn.edu.ruc.iir.paraflow.example.loader.BasicCollector]
config the ./paraflow-collector.sh then:
./sbin/paraflow-collector.sh deploy -
Presto connector
- Create user and database in pg for metadata.
CREATE USER paraflow WITH PASSWORD 'paraflow';
CREATE DATABASE paraflowmeta;
GRANT ALL ON DATABASE paraflowmeta TO paraflow.
- Start Zookeeper cluster
- Start Kafka
- Start PostgreSql
- Start Paraflow MetaServer
./bin/paraflow-metaserver-start.sh [-daemon] - Start Paraflow Loader
./sbin/paraflow-loader.sh start - Start Paraflow Collector
./sbin/paraflow-collector.sh start - Start Presto cluster or single node to execute queries;