Kafka to Logstash Adapter with status webserver integrated in the Data Analytics Stack

This component subscribes to topics from the Apache Kafka message broker and forwards them to the Logstash instance of a running ELK Stack. Optionally, the adapter maps IDs with the SensorThings Server in order to provide a plain and intuitive data channel management.

The Kafka Adapter based on the components:

Kafka Client librdkafka version 0.11.1
Python Kafka module confluent-kafka-python version 0.9.1.2

Requirements

Install Docker version 1.10.0+
Install Docker Compose version 1.6.0+
Clone this repository
To deploy this adapter on a cluster, a docker swarm is required.

Usage

Make sure the elastic stack is running on the same host if you start the DB-Adapter. Test the elastic stack in your browser http://hostname:5601/status.

Testing

Using docker-compose:

cd /DB-Adapter
sudo docker-compose up --build -d

The flag -d stands for running it in background (detached mode):

Watch the logs with:

sudo docker-compose logs -f

By default, the stack exposes the following ports:

3030: DB-Adapter HTTP

Deployment in a docker swarm

Using docker stack:

If not already done, add a regitry instance to register the image

cd /DB-Adapter
sudo docker service create --name registry --publish published=5001,target=5000 registry:2
curl 127.0.0.1:5001/v2/

This should output {}:

If running with docker-compose works, start the following script to push the image and deploy the services in the swarm:

cd ../DB-Adapter
./start_adapter.sh

Watch if everything worked fine with:

./show_adapter.sh
sudo docker service ls
sudo docker stack ps db-adapter
sudo docker service logs db-adapter_kafka -f

By default, the stack exposes the following ports:

3030: DB-Adapter HTTP

Configuration

the basic configurations of the adapter are done in the .env file. Set the kafka topics on which the adapter should listen on and more settings.

# Versions of needed packages
LIBRDKAFKA_VERSION=0.11.1
CONFLUENT_KAFKA_VERSION=0.9.1.2

# Kafka parameters:
# Seperate entries with ","
KAFKA_TOPICS=SensorData,Malfunctions
BOOTSTRAP_SERVERS=192.168.48.61,192.168.48.62,192.168.48.63
# Should be a string and unique in the kafka cluster
KAFKA_GROUP_ID=il060

# Logstash parameters
LOGSTASH_HOST=192.168.48.60
LOGSTASH_PORT=5000

# Enable to hear on sensorthings and its URL
enable_kafka_adapter=true
enable_sensorthings=true
ST_SERVER=http://192.168.48.60:8082/v1.0/

It is recommended to use for the variable KAFKA_GROUP_ID the hostname of the instance, in order to make it unique in the kafka cluster. The KAFKA_GROUP_ID will be noticed by the kafka broker which stores the offset for consumed data. If any data is not consumed at streaming, kafka stores the messages up to 6 weeks. You can consume missed data by temporarily changing the KAFKA_GROUP_ID to another value.

Trouble-shooting

Can't apt-get update in Dockerfile:

Restart the service

sudo service docker restart

or add the file /etc/docker/daemon.json with the content:

{
    "dns": [your_dns, "8.8.8.8"]
}

where your_dns can be found with the command:

nmcli device show <interfacename> | grep IP4.DNS

Traceback of non zero code 4 or 128:

Restart service with sudo service docker restart

or add your dns address as described above

Elasticsearch crashes instantly:

Check permission of elasticsearch/data.

sudo chown -r USER:USER .
sudo chmod -R 777 .

or remove redundant docker installations or reinstall it

Error starting userland proxy: listen tcp 0.0.0.0:3030: bind: address already in use

Bring down other services, or change the hosts port number in docker-compose.yml.

Find all running services by:

sudo docker ps

errors while removing docker containers:

Remove redundant docker installations

"entire heap max virtual memory areas vm.max_map_count [...] likely too low, increase to at least [262144]"

Run on host machine:

sudo sysctl -w vm.max_map_count=262144

Redis warning: vm.overcommit_memory Run on host:

sysctl vm.overcommit_memory=1

Redis warning: "WARNING you have Transparent Huge Pages (THP) support enabled in your kernel."

Just ignore this

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
.idea		.idea
src		src
.env		.env
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
show-adapter.sh		show-adapter.sh
start-adapter.sh		start-adapter.sh
stop-adapter.sh		stop-adapter.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

src

src

.env

.env

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

docker-compose.yml

docker-compose.yml

show-adapter.sh

show-adapter.sh

start-adapter.sh

start-adapter.sh

stop-adapter.sh

stop-adapter.sh

Repository files navigation

Kafka to Logstash Adapter with status webserver integrated in the Data Analytics Stack

Contents

Requirements

Usage

Testing

Deployment in a docker swarm

Configuration

Trouble-shooting

About

Releases 1

Packages

Languages

i-maintenance/DB-Adapter

Folders and files

Latest commit

History

Repository files navigation

Kafka to Logstash Adapter with status webserver integrated in the Data Analytics Stack

Contents

Requirements

Usage

Testing

Deployment in a docker swarm

Configuration

Trouble-shooting

About

Resources

Stars

Watchers

Forks

Languages