Skip to content

Latest commit

 

History

History
14 lines (10 loc) · 1.12 KB

building_streaming_recommendation_engines_on_spark.adoc

File metadata and controls

14 lines (10 loc) · 1.12 KB

Building Streaming Recommendation Engines on Spark

Collaborative filtering is a well known method to implement recommendation engines. Although modern techniques, such as Alternating Least Squares (ALS), allow us to perform rating predictions with large amounts of observations, typically ALS is implemented as a distributed batch algorithm where retraining must be performed with the entirety of the data. However, when dealing with large amounts of data as a stream, batch retraining might be problematic.

In this talk Rui will guide us in building a streaming ALS implementation using Apache Spark and based on Stochastic Gradient Descent, where training can be performed using observations as they arrive.

The advantages of real-time streaming collaborative filtering will be discussed as well as the scenarios where batch ALS might be preferable.