Skip to content
cuGraph
Branch: branch-0.8
Clone or download
Latest commit d488887 May 22, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.github/ISSUE_TEMPLATE Update issue templates Jan 7, 2019
ci Merge pull request #271 from raydouglass/fix-remove-nvgraph May 6, 2019
conda Merge pull request #289 from rapidsai/branch-0.7 May 9, 2019
cpp Changed static_cast to reinterpret_cast where compiler complained May 22, 2019
datasets preparing for hibench test May 9, 2019
docs DOC Update version Apr 29, 2019
img images for README Feb 7, 2019
python comment out imports used only by commented out test May 8, 2019
thirdparty/mmio Removed rmm submodule Mar 6, 2019
.dockerignore Reorganized directory structure to match cuDf, added in files to matc… Feb 4, 2019
.gitattributes Reorganized directory structure to match cuDf, added in files to matc… Feb 4, 2019
.gitignore Quick and Dirty CMake fix and .gitignore Jan 25, 2019
.gitmodules removed cnmem module May 10, 2019
CHANGELOG.md Updated Change Log May 21, 2019
CONTRIBUTING.md Update CONTRIBUTING.md May 3, 2019
Dockerfile Dockerfile comment about source of "cudf" image Dec 6, 2018
LICENSE Added License file Dec 6, 2018
MANIFEST.in Updated repo files Feb 5, 2019
README.md readme fix May 8, 2019
conda_build.sh BLD Update conda recipes Mar 11, 2019
print_env.sh Reorganized directory structure to match cuDf, added in files to matc… Feb 4, 2019
readthedocs.yml Reorganized directory structure to match cuDf, added in files to matc… Feb 4, 2019
requirements.txt BLD Initial gpuCI scripts Feb 12, 2019
setup_pip.py BLD Update setup_pip.py Mar 11, 2019

README.md

 cuGraph - GPU Graph Analytics

The RAPIDS cuGraph library is a collection of graph analytics that process data found in GPU Dataframes - see cuDF. cuGraph aims to provide a NetworkX-like API that will be familiar to data scientists, so they can now build GPU-accelerated workflows more easily.

For more project details, see rapids.ai.

NOTE: For the latest stable README.md ensure you are on the master branch.

import cugraph

# assuming that data has been loaded into a cuDF (using read_csv) Dataframe
# create a Graph using the source (src) and destination (dst) vertex pairs the GDF  
G = cugraph.Graph()
G.add_edge_list(gdf["src"], gdf["dst"])

# Call cugraph.pagerank to get the pagerank scores
gdf_page = cugraph.pagerank(G)

for i in range(len(gdf_page)):
	print("vertex " + str(gdf_page['vertex'][i]) + 
		" PageRank is " + str(gdf_page['pagerank'][i]))  

Supported Algorithms:

Algorithm Scale Notes
PageRank Single-GPU
Jaccard Similarity Single-GPU
Weighted Jaccard Single-GPU
Overlap Similarity Single-GPU
SSSP Single-GPU
BSF Single-GPU
Triangle Counting Single-GPU
Subgraph Extraction Single-GPU
Spectral Clustering - Balanced-Cut Single-GPU
Spectral Clustering - Modularity Maximization Single-GPU
Louvain Single-GPU
Renumbering Single-GPU
Basic Graph Statistics Single-GPU

cuGraph 0.7 Notice

cuGraph version 0.7 has some limitations:

  • Only Int32 Vertex ID are supported
  • Only float (FP32) edge data is supported
  • Vertex numbering is assumed to start at zero

These limitations are being addressed and will be fixed future versions.

Getting cuGraph

Intro

There are 3 ways to get cuGraph :

  1. Quick start with Docker Demo Repo
  2. Conda Installation
  3. Build from Source

Quick Start

Please see the Demo Docker Repository, choosing a tag based on the NVIDIA CUDA version you’re running. This provides a ready to run Docker container with example notebooks and data, showcasing how you can utilize all of the RAPIDS libraries: cuDF, cuML, and cuGraph.

Conda

It is easy to install cuGraph using conda. You can get a minimal conda installation with Miniconda or get the full installation with Anaconda.

Install and update cuGraph using the conda command:

# CUDA 9.2
conda install -c nvidia -c rapidsai -c numba -c conda-forge -c defaults cugraph cudatoolkit=9.2

# CUDA 10.0
conda install -c nvidia -c rapidsai -c numba -c conda-forge -c defaults cugraph cudatoolkit=10.0

Note: This conda installation only applies to Linux and Python versions 3.6/3.7.

Build from Source and Contributing

Please see our guide for building and contributing to cuGraph.

Documentation

Python API documentation can be generated from docs directory.


Open GPU Data Science

The RAPIDS suite of open source software libraries aim to enable execution of end-to-end data science and analytics pipelines entirely on GPUs. It relies on NVIDIA® CUDA® primitives for low-level compute optimization, but exposing that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.

Apache Arrow on GPU

The GPU version of Apache Arrow is a common API that enables efficient interchange of tabular data between processes running on the GPU. End-to-end computation on the GPU avoids unnecessary copying and converting of data off the GPU, reducing compute time and cost for high-performance analytics common in artificial intelligence workloads. As the name implies, cuDF uses the Apache Arrow columnar data format on the GPU. Currently, a subset of the features in Apache Arrow are supported.

You can’t perform that action at this time.