Executing jobs

This is a collection of Spark jobs that implement the document pipeline. The rules use Cassandra, Kafka streams and MongoDB as sources of information. Rule output is written to several different Kafka topics.

Executing jobs

There are a few scripts in the project that help execute Spark jobs against a Kubernetes or local Spark master. These scripts take care of specifying the dependent JARs required by the jobs. These scripts read from configuration files (just bash files that are sourced into the executing script) that provide additional configuration per job and per deployment target. Each of the scripts merely takes the name of the job to run:

# execute locally and show the log
$ ./local-spark-submit.sh <name>

# deploy to Kubernetes Spark master
$ ./deploy-kube.sh <name>

Available jobs

ApplicableRules: Runs a job that receives JSON input from Kafka that includes a reference to a document and a rule id. If this job determines that the rule is applicable to the document, it pushes the input data to the next queue in the pipeline.
ValidateApplicableRules: Same as ApplicableRules expect that it the Kafka topics that the CLI uses for validation.
EffectiveRules: Runs a job that receives JSON input from Kafka that includes a reference to a document. The job determines all of the rules that are effective for the document using the meta-data stored in Cassandra. The document reference and rule id are pushed to a Kafka topic that is subscribed to by the ApplicableRules job.
ValidateEffectiveRules: Same as EffectiveRules expect that it the Kafka topics that the CLI uses for validation.

Name		Name	Last commit message	Last commit date
Latest commit History 202 Commits
project		project
src		src
.gitignore		.gitignore
.travis.yml		.travis.yml
ApplicableRules.opts.development		ApplicableRules.opts.development
ApplicableRules.opts.production		ApplicableRules.opts.production
EffectiveRules.opts.development		EffectiveRules.opts.development
EffectiveRules.opts.production		EffectiveRules.opts.production
LICENSE		LICENSE
README.md		README.md
ValidateApplicableRules.opts.development		ValidateApplicableRules.opts.development
ValidateApplicableRules.opts.production		ValidateApplicableRules.opts.production
ValidateEffectiveRules.opts.development		ValidateEffectiveRules.opts.development
ValidateEffectiveRules.opts.production		ValidateEffectiveRules.opts.production
build-topics.sh		build-topics.sh
build.sbt		build.sbt
deploy-kube.sh		deploy-kube.sh
local-spark-submit.sh		local-spark-submit.sh
spark-execute.sh		spark-execute.sh

License

Xalgorithms/services-il-jobs

Folders and files

Latest commit

History

Repository files navigation

Executing jobs

Available jobs

About

Resources

License

Stars

Watchers

Forks

Languages