Skip to content
Jae Hyeon Bae edited this page Sep 1, 2014 · 9 revisions

Suro is a distributed data pipeline which enables services for moving, aggregating, routing, storing data. Its design is focused on easy configuration and operation for multiple data sources. It supports the following features:

  • The client can send any type of Java object to remote servers using customizable SerDe through either asynchronous or synchronous communication.
  • Suro server can be used as Kafka consumer routing application with kafka consumer input configuration.
  • According to the routing key contained in the message, message routing and sinking behaviors can be dynamically configurable without server restart
  • Currently, the following sinks are provided
    • Local file sink: Stores the data at the local file system. It supports text file and Hadoop sequence file.
    • HDFS file sink: Uploads files generated by local file sink to Hadoop File System.
    • S3 file sink: Uploads files generated by local file sink to AWS S3.
    • Kafka sink: Sends messages to Kafka 0.8 broker.
    • ElasticSearch sink: Sends messages to ElasticSearch server v1.1.0.