RAPIDS Accelerator For Apache Spark

NOTE: For the latest stable README.md ensure you are on the main branch. The RAPIDS Accelerator for Apache Spark provides a set of plugins for Apache Spark that leverage GPUs to accelerate processing via the RAPIDS libraries and UCX. Documentation on the current release can be found here.

The RAPIDS Accelerator for Apache Spark provides a set of plugins for Apache Spark that leverage GPUs to accelerate processing via the RAPIDS libraries and UCX.

To get started and try the plugin out use the getting started guide.

Compatibility

The SQL plugin tries to produce results that are bit for bit identical with Apache Spark. Operator compatibility is documented here

Tuning

To get started tuning your job and get the most performance out of it please start with the tuning guide.

Configuration

The plugin has a set of Spark configs that control its behavior and are documented here.

Issues

We use github issues to track bugs, feature requests, and to try and answer questions. You may file one here.

Download

The jar files for the most recent release can be retrieved from the download page.

Building From Source

See the build instructions in the contributing guide.

Testing

Tests are described here.

Integration

The RAPIDS Accelerator For Apache Spark does provide some APIs for doing zero copy data transfer into other GPU enabled applications. It is described here.

Currently, we are working with XGBoost to try to provide this integration out of the box.

You may need to disable RMM caching when exporting data to an ML library as that library will likely want to use all of the GPU's memory and if it is not aware of RMM it will not have access to any of the memory that RMM is holding.

Name		Name	Last commit message	Last commit date
Latest commit History 2,409 Commits
.github		.github
api_validation		api_validation
build		build
dist		dist
docs		docs
integration_tests		integration_tests
jenkins		jenkins
python/rapids		python/rapids
scripts		scripts
shims		shims
shuffle-plugin		shuffle-plugin
sql-plugin		sql-plugin
tests-spark310+		tests-spark310+
tests		tests
udf-compiler		udf-compiler
udf-examples		udf-examples
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
NOTICE-binary		NOTICE-binary
README.md		README.md
pom.xml		pom.xml
scalastyle-config.xml		scalastyle-config.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAPIDS Accelerator For Apache Spark

Compatibility

Tuning

Configuration

Issues

Download

Building From Source

Testing

Integration

About

Releases

Packages

Languages

License

jlowe/spark-rapids

Folders and files

Latest commit

History

Repository files navigation

RAPIDS Accelerator For Apache Spark

Compatibility

Tuning

Configuration

Issues

Download

Building From Source

Testing

Integration

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages