Skip to content
P. Oscar Boykin edited this page Jul 19, 2016 · 94 revisions

Scalding is a Scala library that makes it easy to write MapReduce jobs in Hadoop. It's similar to other MapReduce platforms like Pig and Hive, but offers a higher level of abstraction by leveraging the full power of Scala and the JVM.

Scalding is built on top of Cascading, a Java library that abstracts away much of the complexity of Hadoop (such as the need to write raw map and reduce functions).

Need a suggestion for where to start? Try the Alice in Wonderland walkthrough which shows how to use Scalding step by step to learn about the book's text.

Getting help

Documentation

Third Party Modules

Videos

How-tos

Tutorials

NOTE: all of the following tutorials use the Fields API, which is deprecated

Articles and presentations from around the web

Other

Documentation Todo

Contents

Getting help

Documentation

Matrix API

Third Party Modules

Videos

How-tos

Tutorials

Articles

Other

Clone this wiki locally