Custom API Examples For Apache Spark

The examples are basic and only for newbies in Scala and Spark.

Motivation

There are a lot of developers who love Spark and want to have custom APIs for Spark. For example, a Spark integration with another software or simple customized APIs, which can be modulized and used frequently as a thrid-party library. For those guys, here are some custom API examples in this library. This can be imported by a jar as a third-party with Spark.

Custom APIs in this library

customPipe(..) equivalent to pipe(..) in RDD

import com.company.spark.custom._

val rdd = sc.parallelize(Seq(1, 2, 3))
rdd.customPipe("cat").collect()

customCount() equivalent to count() in DataFrame

import com.company.spark.custom._

val data = Seq(1, 2, 3, 4, 5)
val rdd = sc.parallelize(numList)
val df = numRDD.toDF
df.customCount()

customTextFile(..) equivalent to textFile(..) in SparkContext

import com.company.spark.custom._

val path = "path-to-file"
sc.customTextFile(path)

customLoadJsonRDD(..) equivalent to jsonRDD(..) in SQLContext

import com.company.spark.custom._

val jsonRDD = sparkContext.parallelize(
  """{"a": 1}""" ::
  """{"a": 2}""" :: Nil)
sqlContext.customLoadJsonRDD(jsonRDD)

Test

For test in your local, just run below:
```
./dev/run-tests
```

Building From Source

This library is built with SBT, which is automatically downloaded by the included shell script. To build a JAR file simply run sbt/sbt package from the project root. The build configuration includes support for both Scala 2.10 and 2.11.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
dev		dev
project		project
sbt		sbt
src		src
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt
scalastyle-config.xml		scalastyle-config.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dev

dev

project

project

sbt

sbt

src

src

.gitignore

.gitignore

.travis.yml

.travis.yml

LICENSE

LICENSE

README.md

README.md

build.sbt

build.sbt

scalastyle-config.xml

scalastyle-config.xml

Repository files navigation

Custom API Examples For Apache Spark

Motivation

Custom APIs in this library

Test

Building From Source

About

Releases

Packages

Languages

License

HyukjinKwon/spark-custom-api

Folders and files

Latest commit

History

Repository files navigation

Custom API Examples For Apache Spark

Motivation

Custom APIs in this library

Test

Building From Source

About

Resources

License

Stars

Watchers

Forks

Languages