something to help you spark
Pull request Compare This branch is 27 commits behind radanalyticsio:develop.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
extras
project
src
.gitignore
.travis.yml
LICENSE
README.md
build.sbt

README.md

silex

something to help you spark

This is a library of reusable code for Spark applications, factored out of applications we've built at Red Hat. It will grow in the future but for now we have an application skeleton, some useful extensions to data frames and RDDs, utility functions for handling time and stemming text, and helpers for feature extraction.

Using silex

Add the following resolver to your project:

resolvers += "Will's bintray" at "https://dl.bintray.com/willb/maven/"

and then add Silex as a dependency:

libraryDependencies += "com.redhat.et" %% "silex" % "0.1.2"

Since version 0.0.9, Silex is built for both Scala 2.10 and Scala 2.11. Since version 0.1.0, Silex depends on Spark 2.0.

Documentation

The Silex web site includes some examples of Silex functionality in use and API docs.

Notes for developers

To cut a new release, use the git flow release workflow.

  1. Start a new release branch with git flow release start x.y.z
  2. Incorporate any release-specific patches that do not belong on the develop branch
  3. Bump version numbers in the README, build definition, and Jekyll configuration.
  4. Run tests for every cross build: sbt +test
  5. Publish binary artifacts to bintray for each cross-build: sbt +publish
  6. Publish an updated site for the project: sbt ghpages-push-site

CI Status

Build Status Coverage Status