Makes prediction work on GPUs #149

dirkgr · 2019-04-09T20:13:14Z

When you use numpy.zeros to create the embed_arr, you can't later add it to embed_vector, because embed_vector might not be a numpy array. This re-works the code such that the array type of embed_vector is preserved all the way through.

I stole this approach from explosion/spaCy#3362. Thanks, @danielkingai2!

dirkgr · 2019-04-09T20:14:37Z

To help those who Google, the error message you get ends with

  File "neuralcoref.pyx", line 596, in neuralcoref.neuralcoref.NeuralCoref.__call__
  File "neuralcoref.pyx", line 723, in neuralcoref.neuralcoref.NeuralCoref.predict
  File "neuralcoref.pyx", line 913, in neuralcoref.neuralcoref.NeuralCoref.get_mention_embeddings
  File "neuralcoref.pyx", line 904, in neuralcoref.neuralcoref.NeuralCoref.get_average_embedding
  File "cupy/core/core.pyx", line 1238, in cupy.core.core.ndarray.__array_ufunc__
  File "cupy/core/_kernel.pyx", line 816, in cupy.core._kernel.ufunc.__call__
  File "cupy/core/_kernel.pyx", line 99, in cupy.core._kernel._preprocess_args
TypeError: Unsupported type <class 'numpy.ndarray'>

Though it does seem there are other problems too when you run on GPU.

dirkgr · 2019-04-09T22:29:29Z

I fixed all the other problems I am aware of at this point. On my machine, it runs about 8x faster on one GPU.

thomwolf · 2019-04-11T13:34:13Z

neuralcoref/neuralcoref.pyx

@@ -20,6 +20,13 @@ import array
 from libc.stdint cimport uint16_t, uint32_t, uint64_t, uintptr_t, int32_t

 import numpy
+try:
+    import cupy
+    to_numpy = cupy.asnumpy


Maybe we can avoid this dependency check by relying on Thinc for that.

How do you feel about this?

def to_numpy(a): if thinc.neural.util.is_cupy_array(a): import cupy return cupy.asnumpy(a) else: return a

That way there isn't a conditional import, but we still have to import cupy. I'm not that familiar with thinc, but the thinc source does not use cupy.asnumpy() anywhere, so there probably isn't a good wrapper.

thomwolf · 2019-04-11T13:36:10Z

neuralcoref/neuralcoref.pyx

-        cdef int n = 0
-        embed_arr = numpy.zeros(self.static_vectors.shape[1], dtype='float32')
-        for token in span:
-            if token.lower not in PUNCTS:


Why did you remove these?

This is all about removing the call to numpy.zeros(). Once I had replaced it with sum(), the code collapsed into just those two lines. Other than the location of the output vector, it should perform exactly the same way.

HarshTrivedi · 2019-04-21T01:23:39Z

@thomwolf Can you take a look at this PR? I am trying to use neuralcoref on a large corpus but w/o GPU it's taking too much of time.

HarshTrivedi · 2019-04-26T05:57:35Z

I could get it to work on GPU using this fix, thanks @dirkgr!

jqueguiner · 2019-06-21T04:14:16Z

Hi guys this is awesome work is it possible to merge the PR ?

stale · 2019-08-31T20:37:05Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

dirkgr · 2019-09-14T19:41:47Z

Hi guys this is awesome work is it possible to merge the PR ?

Looks like the answer is "no".

svlandeg · 2019-10-15T19:21:06Z

Sorry for the late response to this. The PR closed automatically but I think this is valuable work so I reopened. We're probably going to work on a closer integration with spaCy & thinc too.

stale · 2019-12-14T19:22:20Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

jkhalsa-arabesque · 2020-05-22T16:40:01Z

Any updates on this?

m1 · 2020-09-02T13:23:24Z

Would love some updates on this!

svlandeg · 2020-09-07T13:20:06Z

I want to keep this PR open as the code may be useful for those who want to build from source and try this out.

However moving forward, when spaCy v.3 will be released, we'll update this code significantly to be compatible with Thinc 8. At that point, GPU support will be automatic...

sreyemnayr · 2020-12-31T18:31:03Z

For those trying to make the pipeline work with GPU support on spaCy > 2.1, here's the additional step for patching prior to installing from source (after activating your venv and pip installing spaCy):

git clone https://github.com/huggingface/neuralcoref.git
cd neuralcoref
git fetch origin pull/149/head:gpufix
git checkout gpufix
pip install -r requirements.txt
pip install -e .

c0stya · 2021-02-02T14:14:00Z

Any progress on spaCy v.3 integration?

svlandeg · 2021-02-02T15:03:43Z

You mean the v3 that was released yesterday? ;-)

It's definitely on our roadmap, but it's not the only thing we're working on ;-)

LifeIsStrange · 2022-02-07T22:10:33Z

@svlandeg Friendly ping :)

svlandeg · 2022-02-08T14:23:56Z

Hi! Please refer to #295 (comment) for more info :-)

LifeIsStrange · 2022-02-08T14:51:56Z

@svlandeg Thx for the update, I'm curious wether the updated implementation target the latest state of the art (cf #334 ) AKA 81% accuracy on Ontonotes (or at least 79% since the second paper is quite old already)

Makes this work on GPUs

4f9c84c

Further fixes for GPU prediction

a305b6d

dirkgr changed the title ~~Makes this work on GPUs~~ Makes prediction work on GPUs Apr 9, 2019

Makes the cupy dependency optional

de5f93b

thomwolf reviewed Apr 11, 2019

View reviewed changes

One more fix. This should have been part of the original set of fixes.

457c10d

stale bot added the wontfix label Aug 31, 2019

stale bot closed this Sep 7, 2019

svlandeg added gpu and removed wontfix labels Oct 14, 2019

svlandeg reopened this Oct 14, 2019

stale bot added the wontfix label Dec 14, 2019

svlandeg removed the wontfix label Dec 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Makes prediction work on GPUs #149

Makes prediction work on GPUs #149

dirkgr commented Apr 9, 2019

dirkgr commented Apr 9, 2019

dirkgr commented Apr 9, 2019

thomwolf Apr 11, 2019

dirkgr Apr 12, 2019

thomwolf Apr 11, 2019

dirkgr Apr 11, 2019

HarshTrivedi commented Apr 21, 2019

HarshTrivedi commented Apr 26, 2019

jqueguiner commented Jun 21, 2019 •

edited

stale bot commented Aug 31, 2019

dirkgr commented Sep 14, 2019

svlandeg commented Oct 15, 2019 •

edited

stale bot commented Dec 14, 2019

jkhalsa-arabesque commented May 22, 2020

m1 commented Sep 2, 2020

svlandeg commented Sep 7, 2020

sreyemnayr commented Dec 31, 2020

c0stya commented Feb 2, 2021

svlandeg commented Feb 2, 2021

LifeIsStrange commented Feb 7, 2022

svlandeg commented Feb 8, 2022

LifeIsStrange commented Feb 8, 2022

Makes prediction work on GPUs #149

Are you sure you want to change the base?

Makes prediction work on GPUs #149

Conversation

dirkgr commented Apr 9, 2019

dirkgr commented Apr 9, 2019

dirkgr commented Apr 9, 2019

thomwolf Apr 11, 2019

Choose a reason for hiding this comment

dirkgr Apr 12, 2019

Choose a reason for hiding this comment

thomwolf Apr 11, 2019

Choose a reason for hiding this comment

dirkgr Apr 11, 2019

Choose a reason for hiding this comment

HarshTrivedi commented Apr 21, 2019

HarshTrivedi commented Apr 26, 2019

jqueguiner commented Jun 21, 2019 • edited

stale bot commented Aug 31, 2019

dirkgr commented Sep 14, 2019

svlandeg commented Oct 15, 2019 • edited

stale bot commented Dec 14, 2019

jkhalsa-arabesque commented May 22, 2020

m1 commented Sep 2, 2020

svlandeg commented Sep 7, 2020

sreyemnayr commented Dec 31, 2020

c0stya commented Feb 2, 2021

svlandeg commented Feb 2, 2021

LifeIsStrange commented Feb 7, 2022

svlandeg commented Feb 8, 2022

LifeIsStrange commented Feb 8, 2022

jqueguiner commented Jun 21, 2019 •

edited

svlandeg commented Oct 15, 2019 •

edited