Apache Nifi

Deploy custom Apache Nifi 3 node cluster on Kubernetes platform with added features:

Logstash/Fluentd json logging support
Prometheus reporting task. Publish Nifi JVM and flow metrics to Prometheus Push gateway

Cluster coordination is through an external Zookeeper deployment in the same namespace as Nifi. Make sure Zookeeper is already running before proceeding to next step

Deploy to Openshift

Build anair/nifi-base:1.9.2-1 base image locally
Decide namespace to deploy
Login to Openshift docker registry
Tag image with openshift namespace
Push image to openshift project
Create configMap nifi-conf with logback.xml entry. Copy the contents of the custom conf/logback.xml into the configmap with key as "logback.xml"
Create a Route/Ingress mapped to port 8080. This will be the external facing url.
Open and review the 3 node statefulset
a. Update image name
b. Update namespace
c. Review mounts and PVCs
d. Review and update zookeeper cluster nodes
e. Update Cluster name
f. Review anti-affinity rule to make sure the Nifi nodes are running in different K8s worker nodes
Deploy statefulset
Verify logs and pod health

Custom logging

Logs will be streamed to STDOUT in json format that is easily consumable by logstash\fluentd. Every log will have a custom attribute "logtype" which can be used as a filter and alert criteria in Kibana.

Nifi bootstrap activity will set logtype as "bootstrap". This will happen on startup
User activity like accessing flows, stop/start flows will set logtype as "user"
Regular flow activity will set logtype as "app". This will be most logged type based on business activity
Import custom Nifi Kibana dashboards at logging/kibana/visualization/nifi-error-by-pod.json to Kibana. Monitor errors by pod.

Configure reporting task

Click on "Controller Settings"
Click on "Reporting Tasks" tab
Add new reporting task
Type "prom" in the UI filter
Select "PrometheusReportingTask" and click Add button
Edit "PrometheusReportingTask" properties a. Add Prometheus PushGateway url b. Update "The job name" is the job name that will be displayed in Prometheus

Healthcheck

Monitor CPU, memory and filesystem usage through an external monitroing system like Nagios, Zabbix
Cluster health: curl -ks https://NIFI_HOST/nifi-api/flow/cluster/summary | grep -c "{\"clusterSummary\":{\"connectedNodes\":\"3 \/ 3\",\"connectedNodeCount\":3,\"totalNodeCount\":3,\"connectedToCluster\":true,\"clustered\":true}}". If response = 0, cluster is unhealthy.
Cluster JVM diagnostics: curl -ks "http://NIFI_HOST/nifi-api/system-diagnostics"
With Prometheus Reporting Task, Nifi JVM diagnostics are pushed to Prometheus and can be visualized in Grafana. Refer the attached Nifi Grafana dashboard in monitoring/grafana/dashboard/nifi.json

Security

Encryption at motion
Encryption at rest on Provenance
User Authentication
User Authorization

Reference

Rest API
How Nifi works

Provide feedback

Saved searches

Use saved searches to filter your results more quickly