Skip to content

jveverka/data-lab

Repository files navigation

License Java11 Gradle Build Status

Data Lab Project

Data Lab Project provides advanced analytics and query services on various document sources like images, video streams, text documents, file system. This project is work in progress. datalab

Features

  • File system indexing - queries on file system meta-data
  • Image meta-data indexing - queries on exif and geo-location meta data.
  • Video meta-data indexing - queries on exif and geo-location meta data.
  • Image content object recognition - queries on objects contained in images.

Microservices

Components

Architecture

architecture

Technology stack

  • Microservices - REST, Message Broker integrations, K8s, WIP
  • ElasticSearch 7.15.x - main meta-data database
  • Kibana 7.15.x - basic data visualizations
  • RabbitMQ 3.8 - message broker
  • Java 11 - microservice implementations
  • Python 3.8.x - microservice implementations
  • TensorFlow 2.0 / Keras - ML related tasks
  • Gradle 7.2.x - build system or later
  • Ubuntu 20.04 LTS - default target environment

Build, Test and Run

gradle clean installDist distZip test

Follow this user guide to run microservices locally.