Flink


Website	http://flink.apache.org/
Supported versions	1.5.1 for Hadoop 2.8 and Scala 2.11
	1.5.1 for Hadoop 2.7 and Scala 2.11
	1.5.0 for Hadoop 2.8 and Scala 2.11
	1.5.0 for Hadoop 2.7 and Scala 2.11
	1.4.2 for Hadoop 2.8 and Scala 2.11
	1.4.2 for Hadoop 2.7 and Scala 2.11
	1.4.1 for Hadoop 2.8 and Scala 2.11
	1.4.1 for Hadoop 2.7 and Scala 2.11
	1.4.0 for Hadoop 2.8 and Scala 2.11
	1.4.0 for Hadoop 2.7 and Scala 2.11
	1.3.2 for Hadoop 2.7 and Scala 2.11
	1.3.1 for Hadoop 2.7 and Scala 2.11
	1.3.0 for Hadoop 2.7 and Scala 2.11
	1.2.1 for Hadoop 2.7 and Scala 2.11
	1.2.0 for Hadoop 2.7 and Scala 2.11
	1.1.4 for Hadoop 2.7 and Scala 2.11
	1.1.3 for Hadoop 2.7 and Scala 2.11
	0.10.2 for Hadoop 2.7 and Scala 2.11
	0.10.1 for Hadoop 2.7 and Scala 2.11
Current responsible(s)	Gezim Sejdiu @ UBO -- g.sejdiu@gmail.com
	Hajira Jabeen @ UBO -- hajirajabeen@gmail.com
	Ivan Ermilov @ InfAI -- ivan.s.ermilov@gmail.com
Docker image(s)	bde2020/flink-master:latest
	bde2020/flink-worker:latest
	bde2020/flink-submit:latest
	bde2020/flink-maven-template:latest
	bde2020/flink-sbt-template:latest
More info	https://ci.apache.org/projects/flink/flink-docs-release-0.10/apis/programming_guide.html

Short description

Apache Flink is an open source platform for distributed stream and batch data processing.

Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.

Flink includes several APIs for creating applications that use the Flink engine:

DataSet API for static data embedded in Java, Scala, and Python,
DataStream API for unbounded streams embedded in Java and Scala, and
Table API with a SQL-like expression language embedded in Java and Scala.

Flink also bundles libraries for domain-specific use cases:

CEP, a complex event processing library,
Machine Learning library, and
Gelly, a graph processing API and library.

You can integrate Flink easily with other well-known open source systems both for data input and output as well as deployment.

Example usage

Building and running your Flink application on top of the Flink cluster can be done by extending a template Docker image. Check the template's README for further documentation.

Scaling

DataSet and DataStream are fault-tolerant collections of elements that can be operated on in parallel. As a consequence, Flink applications scale automatically when augmenting the number of Flink worker nodes in the cluster.

Computational frameworks
- Flink
- Spark
- Storm
Data storage
- Hadoop
- Hue HDFS File Browser
- Cassandra
- Hive
- Redis
- Virtuoso
- 4store
- PostGIS
- Zeppelin
Data acquisition
- Flume
Message passing
- Kafka
Search engines
- Elasticsearch
- Solr
Semantic components
- DEER
- EDCAT
- FOX
- GeoTriples
- Silk
- Limes
- SEMAGROW engine
- Sextant
- Strabon
- UnifiedViews

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flink

Short description

Example usage

Scaling

Home

BDE stack

Implementing pilot on BDE stack

Implementing pilot on BDI platform

Installation

Components

Clone this wiki locally