Skip to content

ccri/accumulo-scalding

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

accumulo-scalding

Cascading bindings for enabling Accumulo as a source/sink for Cascading/Scalding

Notional example usage:

class TestScaldingJob(args : Args) extends Job(args) {
  lazy val out = new AccumuloSource(args("instance"), args("table"), args("outputDir"))

  TextLine(input).flatMap('line -> List('key,'value)) { line: String =>
    line.split(",").groupBy(_).map { case (k, v) =>
        (new Key(new Text(k)), new Value(ByteBuffer.allocate(8).put(v.length)))
  }.groupBy('key) { _.sortBy('value).reducers(32) }.write(out)
}

About

Accumulo source/sink bindings for Accumulo

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages