Bullet DSL

A DSL for users to plug in their datasource into Bullet (Spark, Storm, etc.)

Background

Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Storm, Spark or Flink. It lets you run queries on this data stream - including hard queries like Count Distincts, Top K etc. The main project is available here.

Install

Bullet DSL is a library written in Java and published to Maven Central.

Versions 1.1.3 and below can be found on Bintray and mirrored to JCenter. To see the various versions and set up your project for your package manager (Maven, Gradle etc), see here.

Usage

Bullet DSL consists of two major components: the BulletConnector and the BulletRecordConverter. The BulletConnector is used to read data (objects) from a pluggable datasource while the BulletRecordConverter converts those objects into BulletRecords. There is an optional component called the BulletDeserializer that can translate the BulletConnector output to the appropriate BulletRecordConverter input.

Bullet Storm and Spark (and others) will provide a reading component that will use BulletConnector and BulletRecordConverter, so users will not have to write code themselves but will instead provide configuration.

Below are some examples for users that wish to use BulletConnector or BulletRecordConverter separately.

BulletConnector

The currently implemented BulletConnectors are KafkaConnector and PulsarConnector which support Apache Kafka and Apache Pulsar respectively.

Example usage:

BulletDSLConfig config = new BulletDSLConfig();
BulletConnector connector = BulletConnector.from(config);
try {
    connector.initialize();
} catch (BulletDSLException e) {
    // handle exception
}
...
List<Object> messages;
try {
    messages = connector.read();
} catch (BulletDSLException e) {
    // handle exception
}
...
try {
    connector.close();
} catch (Exception e) {
    // handle exception
}

BulletDeserializer

This is an optional layer that can be configured in the Bullet Backends that support it. If one is not needed, the IdentityDeserializer can be used. This is primarily needed if the connector reads serialized bytes or some other custom format that needs to be converted into the appropriate input formats that the converter supports. A simple example could be if you were reading POJOS as Kafka messages that you needed to convert to BulletRecords. However, for some reason, the POJOs were serialized to raw bytes before being ingested into Kafka and Kafka itself is not aware that they are POJOs. The KafkaConnector would produce raw serialized bytes of the POJO and you would not be able to feed that into the POJOBulletRecordConverter. You could then use the JavaDeserializer to reify those bytes back into the POJO that the POJOBulletRecordConverter could convert.

BulletRecordConverter

The currently implemented BulletRecordConverters are AvroBulletRecordConverter, MapBulletRecordConverter, and POJOBulletRecordConverter. These converters support converting Apache Avro records, maps, and POJOs to BulletRecords.

Note, BulletRecordConverter can be used with or without a BulletRecordSchema; the schema can be specified in the configuration as a json file. If the schema is provided, the types provided there can be used to convert your source data records into BulletRecords without any type discovery (although you can turn this on even if you provide a schema).

Example usage:

BulletDSLConfig config = new BulletDSLConfig();
BulletRecordConverter converter = BulletRecordConverter.from(config);
try {
    BulletRecord record = converter.convert(object);
} catch (BulletDSLException e) {
    // handle exception
}

BulletRecordSchema

An array of objects where each object is a BulletRecordField that consists of a name, reference, and type.

When an object is converted, the name of the fields in the resulting BulletRecord are specified by the schema and the corresponding values by the corresponding references. If a reference is null, the corresponding name will be used instead.

The values for the possible types are the same as the valid types defined in Bullet Record:

BOOLEAN
INTEGER
LONG
FLOAT
DOUBLE
STRING
BOOLEAN_MAP
INTEGER_MAP
LONG_MAP
FLOAT_MAP
DOUBLE_MAP
STRING_MAP
BOOLEAN_MAP_MAP
INTEGER_MAP_MAP
LONG_MAP_MAP
FLOAT_MAP_MAP
DOUBLE_MAP_MAP
STRING_MAP_MAP
BOOLEAN_LIST
INTEGER_LIST
LONG_LIST
FLOAT_LIST
DOUBLE_LIST
STRING_LIST
BOOLEAN_MAP_LIST
INTEGER_MAP_LIST
LONG_MAP_LIST
FLOAT_MAP_LIST
DOUBLE_MAP_LIST
STRING_MAP_LIST

Records

Note, there is a special case where if you omit the type and the name for an entry in the schema, the reference is assumed to be a map containing arbitrary fields with types in the list above. You can use this if you have a map field that contains various objects with one or more types in the list above and want to flatten that map out into the target record using the respective types of each field in the map. The names of the fields in the map will be used as the top-level names in the resulting record.

Example schema and fields:

[
  {
    "name": "myBool",
    "type": "BOOLEAN"
  },
  {
    "name": "myBoolMap",
    "type": "BOOLEAN_MAP"
  },
  {
    "name": "myLongMapMap",
    "type": "LONG_MAP_MAP"
  },
  {
    "name": "myIntFromSomeMap",
    "reference": "someMap.myInt",
    "type": "INTEGER"
  },
  {
    "name": "myIntFromSomeIntList",
    "reference": "someIntList.0",
    "type": "INTEGER"
  },
  {
    "name": "myIntFromSomeNestedMapsAndLists",
    "reference": "someMap.nestedMap.nestedList.0",
    "type": "INTEGER"
  },
  {
    "reference" : "someMap"
  }
]

Documentation

All documentation is available at Github Pages here.

Links

Bullet DSL to see the complete DSL documentation.

Quick Links

Spark Quick Start to start with a Bullet instance running locally on Spark.
Storm Quick Start to start with a Bullet instance running locally on Storm.
Spark Architecture to see how Bullet is implemented on Storm.
Storm Architecture to see how Bullet is implemented on Storm.
Setup on Spark to see how to setup Bullet on Spark.
Setup on Storm to see how to setup Bullet on Storm.
API Examples to see what kind of queries you can run on Bullet.
Setup Web Service to setup the Bullet Web Service.
Setup UI to setup the Bullet UI.

Contributing

All contributions are welcomed! Feel free to submit PRs for bug fixes, improvements or anything else you like! Submit issues, ask questions using Github issues as normal and we will classify it accordingly. See Contributing for a more in-depth policy. We just ask you to respect our Code of Conduct while you're here.

License

Code licensed under the Apache 2 license. See the LICENSE for terms.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
screwdriver		screwdriver
src		src
.gitignore		.gitignore
Code-of-Conduct.md		Code-of-Conduct.md
Contributing.md		Contributing.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pom.xml		pom.xml
screwdriver.yaml		screwdriver.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bullet DSL

Table of Contents

Background

Install

Usage

BulletConnector

BulletDeserializer

BulletRecordConverter

BulletRecordSchema

Records

Documentation

Links

Quick Links

Contributing

License

About

Releases 18

Packages

Contributors 4

Languages

License

bullet-db/bullet-dsl

Folders and files

Latest commit

History

Repository files navigation

Bullet DSL

Table of Contents

Background

Install

Usage

BulletConnector

BulletDeserializer

BulletRecordConverter

BulletRecordSchema

Records

Documentation

Links

Quick Links

Contributing

License

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 18

Packages 0

Contributors 4

Languages

Packages