Kafka Connect Mongodb

The connector is used to load data both from Kafka to Mongodb and from Mongodb to Kafka.

Building

You can build the connector with Maven using the standard lifecycle phases:

mvn clean
mvn package

Source Connector

When the connector is run as a Source Connector, it reads data from Mongodb oplog and publishes it on Kafka. 3 different types of messages are read from the oplog:

Insert
Update
Delete

For every message, a SourceRecord is created, having the following schema:

{
  "type": "record",
  "name": "schemaname",
  "fields": [
    {
      "name": "timestamp",
      "type": [
        "null",
        "int"
      ]
    },
    {
      "name": "order",
      "type": [
        "null",
        "int"
      ]
    },
    {
      "name": "operation",
      "type": [
        "null",
        "string"
      ]
    },
    {
      "name": "database",
      "type": [
        "null",
        "string"
      ]
    },
    {
      "name": "object",
      "type": [
        "null",
        "string"
      ]
    }
  ],
  "connect.name": "stillmongotesting"
}

timestamp: timestamp in seconds when the event happened
order: order of the event between events with the same timestamp
operation: type of operation the message represent. i: insert, u: update, d: delete
database: database in which the operation took place
object: inserted/updated/deleted object

Sample Configuration

name=mongodb-source-connector
connector.class=org.apache.kafka.connect.mongodb.MongodbSourceConnector
tasks.max=1
host=127.0.0.1
port=27017
batch.size=100
schema.name=mongodbschema
topic.prefix=optionalprefix
databases=mydb.test1,mydb.test2,mydb.test3

name: name of the connector
connector.class: class of the implementation of the connector
tasks.max: maximum number of tasks to create
host: mongodb host
port: mongodb port
batch.size: maximum number of messages to write on Kafka at every poll() call
schema.name: name to use for the schema, it will be formatted as {schema.name}_{database}_{collection}
topic.prefix: optional prefix to append to the topic names. The topic name is formatted as {topic.prefix}_{database}_{collection}
databases: comma separated list of collections from which import data

Sink Connector

When the connector is run as Sink, it retrieves messages from Kafka and writes them on mongodb collections. The structure of the written document is derived from the schema of the messages.

Sample Configuration

name=mongodb-sink-connector
connector.class=org.apache.kafka.connect.mongodb.MongodbSinkConnector
tasks.max=1
host=127.0.0.1
port=27017
bulk.size=100
mongodb.database=databasetest
mongodb.collections=mydb_test1,mydb_test2,mydb_test3
topics=optionalprefix_mydb_test1,optionalprefix_mydb_test2,optionalprefix_mydb_test3

name: name of the connector
connector.class: class of the implementation of the connector
tasks.max: maximum number of tasks to create
host: mongodb host
port: mongodb port
bulk.size: maximum number of documents to write on Mongodb at every put() call
mongodb.database: database to use
mongodb.collections: comma separated list of collections on which write the documents
topics: comma separated list of topics to write on Mongodb

The number of collections and the number of topics should be the same.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

pom.xml

pom.xml

Repository files navigation

Kafka Connect Mongodb

Building

Source Connector

Sample Configuration

Sink Connector

Sample Configuration

About

Releases

Packages

Languages

License

vHanda/kafka-connect-mongodb

Folders and files

Latest commit

History

Repository files navigation

Kafka Connect Mongodb

Building

Source Connector

Sample Configuration

Sink Connector

Sample Configuration

About

Resources

License

Stars

Watchers

Forks

Languages