Skip to content

Commit

Permalink
Add low_memory option to FAQ
Browse files Browse the repository at this point in the history
  • Loading branch information
lmcinnes committed Sep 9, 2019
1 parent ded60d0 commit 439db74
Showing 1 changed file with 22 additions and 0 deletions.
22 changes: 22 additions & 0 deletions doc/faq.rst
Original file line number Diff line number Diff line change
Expand Up @@ -79,6 +79,19 @@ show the underlying detail that can otherwise be lost. We
highly recommend investing the time to learn datashader for
UMAP plot particularly for larger datasets.

I ran out of memory. Help!
--------------------------

For some datasets the default options for approximate
nearest neighbor search can result in excessive memory use.
If your dataset is not especially large but you have found
that UMAP runs out of memory when operating on it consider
using the ``low_memory=True`` option, which will switch
to a slower but less memory intensive approach to computing
the approximate nearest neighbors. This may alleviate your
issues.


Is there GPU or multicore-CPU support?
--------------------------------------

Expand All @@ -99,6 +112,10 @@ been deferred for now. If you have expertise in GPU
programming with Numba and would be interested in
adding GPU support we would welcome your contributions.

There is a UMAP implementation for GPU available in
the NVIDIA RAPIDS cuML library, so if you need GPU
support that is currently the best palce to go.

Can I add a custom loss function?
---------------------------------

Expand Down Expand Up @@ -230,5 +247,10 @@ Successful use-cases

UMAP can be / has been Successfully applied to the following domains:

- Single cell data visualization in biology;
- Mapping malware based on behavioural data;
- Pre-processing phrase vectors for clustering;
- Pre-processing image embeddings (Inception) for clustering;

and many more -- if you have a successful use-case please submit
a pull request adding it to this list!

0 comments on commit 439db74

Please sign in to comment.