Optimized Analytics Package for Spark* Platform (OAP)
* LEGAL NOTICE: Your use of this software and any required dependent software (the "Software Package") is subject to the terms and conditions of the software license agreements for the Software Package, which may also include notices, disclaimers, or license terms for third party or open source software included in or with the Software Package, and your use indicates your acceptance of all such terms. Please refer to the "TPP.txt" or other similarly-named text file included with the Software Package for additional details.
OAP is a project to optimize Spark by providing optimized implementation of packages for various aspects including cache, shuffle, native SQL engine, Mllib and so on. In this version, OAP contains the optimized implementations of SQL Index and Data Source Cache supporting DRAM and PMem, RDD Cache PMem Extension, Shuffle Remote PMem Extension, Remote Shuffle, Intel MLlib, Unified Arrow Data Source and Native SQL Engine.
Please follow the link below for the guide to compile and install OAP to your system.
Please refer to the corresponding documents below for the introductions on how to use the features.
- SQL Index and Data Source Cache
- RDD Cache PMem Extension
- Shuffle Remote PMem Extension
- Remote Shuffle
- Intel MLlib
- Unified Arrow Data Source
- Native SQL Engine
Please follow the link below for the guide for developers.