Arango ElasticSearch Syncer

Description

NodeJs app to sync data from arangodb to elasticsearch. The app works as a service and will continously run on the server. dockerfile is included if you want to deploy it as a container.

In most of the cases easiest way to steup ETL pipeline between your database and elasticsearch is, using logstash. Many popular databases like mongodb, couchbase etc. provides a logstash input plugin to listen db changes. However there is no official logstash input plugin for arangodb which you can use to setup ETL pipeline to push db changes to es.

This project provides a good starting point to write down your own solution to sync db between elasticsearch and arangodb. It uses latest WAL access API to listen database changes. The project is not very much polished but can act as a good starting point.

Configurations

Configuration are passed via environment variables. During development you can use .env file to pass configurations. During development configs are loaded with dotenv package. However on production environment it expects configs via Environment variables.

Files

Index.js - Starting point. It creates global http agent and elasticsearch client which are re-used in every run.
syncer-state.service.ts - Used to save syncer state. fromTick and lastScannedTick, in case process stops. You can implement graceful shutodown and save syncer state before shutting down the process.
wal-tail.reader.ts - WAL tail reader which reads WAL tail logs incrementally and emit them for further consuming (transformation and es indexing).
tail.transformer.ts - Tail logs comes in form of ndjson. You can filter only document update, delete and create changes based on type. Based on cuid (collection id) you can figure out document type.
es.indexer.ts - Used to index changes via Bulk API. Bulk API also accepts NDJSON.

Installation

$ npm install

Running the app

# development
$ npm run start:local

# watch mode
$ npm run start:dev

# debug mode
$ npm run start:debug

# production mode
$ npm run start:prod

Stay in touch

Author - Neeraj Kumar

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.vscode		.vscode
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
arango.ca.pem		arango.ca.pem
nodemon-debug.json		nodemon-debug.json
nodemon.json		nodemon.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json
tslint.json		tslint.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Arango ElasticSearch Syncer

Description

Configurations

Files

Installation

Running the app

Stay in touch

About

Releases

Packages

Languages

neerajyadav/arango-es-syncer

Folders and files

Latest commit

History

Repository files navigation

Arango ElasticSearch Syncer

Description

Configurations

Files

Installation

Running the app

Stay in touch

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages