First attempt to use more gpus #339

tfarago · 2014-10-21T12:17:25Z

A collaborative PR for making UFO work on more GPUs from concert.

tfarago · 2014-10-21T12:18:00Z

This is a deep WIP so far...

tfarago · 2014-10-24T12:16:42Z

@matze so I tried this:

shape = 2048, 4096
image = np.random.normal(1000, 10, size=shape).astype(np.float32)


def generate(num_images=10):
    image = np.random.normal(1000, 10, size=shape).astype(np.float32)

    for i in range(num_images):
        print 'yielding', id(image), i + 1
        yield image
        print 'yielded', id(image), i + 1


@async
def process(backproject, arch, gpu, result, num_images=10):
    try:
        inject(generate(num_images=num_images), backproject(result(), arch=arch, gpu=gpu)) 
    finally:
        backproject.wait()


def run(num_images=10):
    arch = Ufo.ArchGraph()
    gpus = arch.get_gpu_nodes()
    resources = {}
    for i, gpu in enumerate(gpus):
        resources[gpu] = (Backproject(image.shape[1] / 2), Result())

    futures = []
    st = time.time()
    for gpu, (bp, result) in resources.iteritems():
        futures.append(process(bp, arch, gpu, result, num_images=num_images))

    wait(futures)
    print 'duration: {} s'.format(time.time() - st)

    return zip(*resources.values())[1]

But unfortunately I got no speedup plus the memory was not released.

tfarago · 2014-10-24T12:20:51Z

Even though I call wait the resources are not released. But when I type exit in the session I see the release messages from the framework, so it actually does the job but a little late. Maybe we need to release resources manually with this scheduler?

tfarago · 2014-10-24T12:22:01Z

And funnily, those yielding and yielded printouts look like the stuff gets parallelized... weird.

matze · 2014-10-27T08:06:44Z

There is a nasty race condition for me in this example. Waiting for the futures begins before the processes started which results in an AttributeError because there is no Backproject.thread object yet created.

matze · 2014-10-27T08:40:31Z

Actually I have problems to get this running at all. Do you made any changes to either ufo-core or Concert?

tfarago · 2014-10-27T08:51:09Z

Did you install this branch when you tried?

matze · 2014-10-27T08:51:58Z

Yes, of course.

tfarago · 2014-10-27T08:53:09Z

Can you re-check? I tried with master and got the same error you mentioned above

AttributeError: 'Backproject' object has no attribute 'thread'

tfarago · 2014-10-27T08:54:16Z

I just logged in, did no changes to ufo-* and it works.

matze · 2014-10-27T09:11:17Z

As I said, it's a race condition. If it starts executing before waiting on the futures begins it should be fine. But I am not sure how to solve this generally at the moment.

tfarago · 2014-10-27T09:30:29Z

It is not a race condition because when you wait for the futures (if you mean the ones just before the duration printout) you are waiting for the process function which executes these commands serially (pseudo):

try:
    backproject.start() # inside the __call__, backproject.thread exists from now on
    while True:
        # crunch
finally:
    backproject.wait()

Try this process function instead:

@async
def process(backproject, arch, gpu, result, num_images=10):
    inject(generate(num_images=num_images), backproject(result(), arch=arch, gpu=gpu)) 
    backproject.wait()

Does it by any chance say:

TypeError: __call__() got an unexpected keyword argument 'gpu'

matze · 2014-10-27T09:35:03Z

Does it by any chance say:

No, it never did so because I told you already that I am using the correct branch. After removing the try/finally blocks it kind of works now.

tfarago · 2014-10-27T09:41:52Z

Hm, but then I have no idea what the problem is. Especially since for me it works every time. Actually I experienced some random hangings and segfaults but that was very rare and before your final python multithreading fix, now I have no idea if it still happens.

matze · 2014-10-27T11:26:58Z

I release the GIL in the output task now which let's all schedulers run fully in parallel. I haven't checked the memory consumption yet though.

tfarago · 2014-10-27T12:34:12Z

Should I try? Do I need to update/do smth. special on the ufo server?

matze · 2014-10-27T12:53:45Z

Yeah, try what you wanted to try. I see the speed up and I guess you have to update ufo-core if you don't use a local installation.

tfarago · 2014-10-27T13:51:51Z

After a fresh pull it doesn't build...

EDIT: All OK, make clean rules.

tfarago · 2014-10-27T13:53:18Z

Nope, it builds on my machine but not on the server...

matze · 2014-10-27T13:59:50Z

Nope, it builds on my machine but not on the server...

I ran it on the server ...

tfarago · 2014-10-27T14:24:18Z

In /opt/ufo-core:

[ 47%] Building C object ufo/CMakeFiles/ufo.dir/ufo-output-task.c.o
/opt/ufo-core/ufo/ufo-output-task.c: In function ‘ufo_output_task_get_output_buffer’:
/opt/ufo-core/ufo/ufo-output-task.c:116:5: error: ‘buffer’ may be used uninitialized in this function [-Werror=uninitialized]
cc1: all warnings being treated as errors

make[2]: *** [ufo/CMakeFiles/ufo.dir/ufo-output-task.c.o] Error 1
make[1]: *** [ufo/CMakeFiles/ufo.dir/all] Error 2
make: *** [all] Error 2

How did you manage locally?

matze · 2014-10-27T14:37:42Z

I have no idea why it didn't complain in my case. Anyway, I pushed a change that removes that warning, compiled and installed it. Have fun.

tfarago · 2014-10-27T22:10:37Z

Thanks, I will!

tfarago · 2015-03-03T15:05:58Z

I am trying to make this work with CopyTask but no luck. The code from here crashes with the following awfully:

f = FooProcess()
im = np.empty((512, 512), dtype=np.float32)
result = Result()
inject((im,), f(result()))

Can @matze take a look please?

to UniversalBackprojectArgs for user convenience.

to unify naming with ufo-filters.

i.e. there are no slice consumers.

tfarago · 2020-01-08T11:34:30Z

This has been around for long enough.

tfarago added in progress enhancement labels Oct 21, 2014

tfarago added this to the Concert 1.0 milestone Oct 21, 2014

This was referenced Oct 21, 2014

Refactor resource usage and allow setting used GPUs ufo-kit/ufo-core#44

Merged

GIL needs to be released when we are in C (UFO, libuca) #341

Closed

tfarago force-pushed the ufo-multi-gpus branch from f757a30 to 6a2ce76 Compare March 3, 2015 08:10

tfarago added 25 commits February 13, 2019 16:31

find_parameter: don't force splittin policy 'one'

af40e3a

find_parameter: generalize setting found parameter

cc05850

find_parameter: Enable maximization

cd4f314

Document find_parameter

8ebed28

Add z_parameters and slice_metrics attributes

4e29e52

to UniversalBackprojectArgs for user convenience.

Rename Universal to General

c7d18af

to unify naming with ufo-filters.

find_parameter: make value storing optional

88b54ae

manager: enable gpus specification

25e2909

Enable finding of more parameters at once

38d1dd2

Don't flip the metric by default

6cbd4d5

Use straightforward sag instead of msag

640c9c4

imageprocessing: add filter_low_frequencies

a8c705b

Add fwhm to find_parameters

f90379d

Enable input specification in find_parameter

a181273

reco addon: Use first normalization image

ccf5d3d

Enable OutputTask to return NULL

42bc524

Make manager abortable

bcd9686

Fix flake8

3544207

Add dummy ImagingExperiment

e7326bd

Simplify normalization computation in online reco

c8b09ba

ufo: Add FlatCorrect class

ec90dcd

Online reco: don't broadcast result if unnecessary

8784ec1

i.e. there are no slice consumers.

Don't require width and height by reco args

d414b3e

Allow projection crop disabling

4c4a048

Add number of projections info to reco manager

acad4df

tfarago force-pushed the ufo-multi-gpus branch from 3f48902 to acad4df Compare February 13, 2019 15:31

Simplify source creation by general reco graph

9ea89fb

tfarago merged commit fa18e84 into master Jan 8, 2020

tfarago deleted the ufo-multi-gpus branch January 8, 2020 11:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First attempt to use more gpus #339

First attempt to use more gpus #339

tfarago commented Oct 21, 2014

tfarago commented Oct 21, 2014

tfarago commented Oct 24, 2014

tfarago commented Oct 24, 2014

tfarago commented Oct 24, 2014

matze commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

tfarago commented Mar 3, 2015

tfarago commented Jan 8, 2020

First attempt to use more gpus #339

First attempt to use more gpus #339

Conversation

tfarago commented Oct 21, 2014

tfarago commented Oct 21, 2014

tfarago commented Oct 24, 2014

tfarago commented Oct 24, 2014

tfarago commented Oct 24, 2014

matze commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

matze commented Oct 27, 2014

tfarago commented Oct 27, 2014

tfarago commented Mar 3, 2015

tfarago commented Jan 8, 2020