Improved Parallelization for Training and Inference #129

robbymeals · 2014-12-31T20:09:24Z

Jan 8, 2015 EDIT: Adding TL;DR to this original comment to make it easier to review, since this PR has become something of a monster.

Summary of changes:
Added mixin class ParallelMixin in new pystruct.utils.parallel that does the following:

adds private method _spawn_pool to spawn a pool as attribute
pool can be ThreadPool, MemmapingPool or Pool, depending on ecosystem and parameters
exposes public method parallel with same api as standard library map, taking as args function to be mapped and mappable args iterable
__getstate__ and __setstate__ handle pool attribute
parallel method handles KeyboardInterrupt somewhat gracefully, allowing for user interruption that stops training but preserves model object and pool in usable forms.
removal of all Parallel calls and any parallelization logic from anywhere in module other than utils.parallel
added hacky wrappers for all functions that are currently used as mappable, could potentially change to decorator but then would have to update usage of original function elsewhere.
other stuff?

Original comment:

Hi! So using Parallel causes the respawning of pools with every iteration, which can be slow from the start, causes memory issues and is very slow once the matrices get to be at all large. @amueller what do you think of this approach? I am sure there are other moving parts that I'm not aware of that need to be considered, but replacing the Parallel calls as in this PR in both OneSlackSSVM and SubgradientSSVM worked really well for me.

robbymeals · 2014-12-31T20:18:59Z

Right, would have add check for sklearn version to ensure it is available. You could move the creation of self.pool into SSVM base object. I think even just using a pre-instantiated Pool would work, just have to do more of the annoying error handling and stuff required when you use multiprocessing directly.

You could also conditionally instantiate and terminate self.pool in the fit method.

amueller · 2015-01-02T19:05:04Z

Thanks for the PR.
I am a bit surprised that this has an impact on performance. Could you give some numbers / benchmarks?

robbymeals · 2015-01-02T19:54:19Z

Hey, sure! So currently I am playing around with this on an r3.4xlarge ec2 box that I have running for other tasks, so 16 vcpu and 122 gb RAM. So not every day capacity. I can't really share data that I am using, but it is text data and I am currently just looking at the MultilabelClf and the GridCrf models. I will mock up a more detailed example to show you what I'm talking about on data that looks more similar to mine. But just using your multi_label.py example, adding ``n_jobs=-1 to the learner instantiations, here are the benchmarks I get for the scene dataset:

running first with my current master branch, using vanilla Pool not MemmapingPool:

(nlp-venv)nlp@ip-10-153-157-196:~/benchmarks$ time python multi_label.py
fitting independent model...
fitting full model...
fitting tree model...
Training loss independent model: 0.066612
Test loss independent model: 0.111204
Training loss tree model: 0.059868
Test loss tree model: 0.106048
Training loss full model: 0.049408
Test loss full model: 0.097408

real    0m58.619s
user    1m41.180s
sys     0m5.705s

and then with your master, using Parallel:

(nlp-venv)nlp@ip-10-153-157-196:~/benchmarks$ time python multi_label.py
fitting independent model...

fitting full model...
fitting tree model...
Training loss independent model: 0.066612
Test loss independent model: 0.111204
Training loss tree model: 0.059868
Test loss tree model: 0.106048
Training loss full model: 0.049408
Test loss full model: 0.097408

real    4m3.351s
user    5m49.086s
sys     1m17.763s

I'll run a couple other examples to get similar comparisons for other datasets and models.
sorry for all the commits, this pr is just meant to illustrate.

amueller · 2015-01-02T19:57:40Z

Sorry, did you mean n_jobs=1 or n_jobs=-1? with n_jobs=1 there is hopefully no difference.

robbymeals · 2015-01-02T19:58:00Z

Yes n_jobs=-1 sorry

robbymeals · 2015-01-02T19:58:44Z

These are the changes to the multi_label.py example:

full_ssvm = OneSlackSSVM(full_model, inference_cache=50, C=.1, tol=0.01, n_jobs=-1)

tree_ssvm = OneSlackSSVM(tree_model, inference_cache=50, C=.1, tol=0.01, n_jobs=-1)

independent_ssvm = OneSlackSSVM(independent_model, C=.1, tol=0.01, n_jobs=-1)

amueller · 2015-01-02T20:00:40Z

That looks pretty promising. I want to run a couple more tests on the larger graphs (or you can, if you like). I'd like to understand what is happening a bit better. maybe @ogrisel can help. I did not think Parallel would have such an overhead over Pool. Maybe it is because the models are so small and inference is so quick?

robbymeals · 2015-01-02T20:11:53Z

No, it's even worse on larger models. The models I'm playing around with are much larger than these and the problem gets worse with larger input matrices and actually worse with each iteration. I have run into this before in other contexts. I think it is just because when Parallel reinstantiates the pool with every call and so stuff has to get copied to the subprocesses again each loop. Or whatever the correct way of saying that actually is. :)

Yeah I'd be happy to run them, did you have any specific examples in mind?

Btw, here's the benchmarks for multilabel.py for the yeast dataset:

My master, using Pool:

(nlp-venv)nlp@ip-10-153-157-196:~/benchmarks$ time python multi_label.py
fitting independent model...
fitting full model...
fitting tree model...
Training loss independent model: 0.191571
Test loss independent model: 0.199330
Training loss tree model: 0.191333
Test loss tree model: 0.200499
Training loss full model: 0.191095
Test loss full model: 0.199797

real    3m42.851s
user    6m15.402s
sys     0m8.750s

and current master using Parallel:

(nlp-venv)nlp@ip-10-153-157-196:~/benchmarks$ time python multi_label.py
fitting independent model...
fitting full model...
fitting tree model...
Training loss independent model: 0.191571
Test loss independent model: 0.199330
Training loss tree model: 0.191333
Test loss tree model: 0.200499
Training loss full model: 0.191095
Test loss full model: 0.199797

real    9m11.171s
user    13m14.096s
sys     2m13.286s

Sidenote: kudos on this package, it's very exciting!

amueller · 2015-01-02T20:56:53Z

Thanks. I hope to improve it a lot in the near future, with respect to documentation, fexibility and sparse matrix support.
The snake example would be interesting, and the image segmentation example, too.

robbymeals · 2015-01-02T22:35:47Z

Ok those two are running now. Taking the approach I have here, with self.pool, I'd have to use https://docs.python.org/2/library/copy_reg.html to register a pickler that is aware of the pool and ignores it when dumping and recreates it if necessary when loading. I've done that before for very similar use cases, it's not that painful. I think that's why the checks are failing?

robbymeals · 2015-01-02T22:55:27Z

Not seeing as much of difference in these examples:

image segmentation benchmarks using Pool:

...
real    16m47.309s
user    28m43.323s
sys     0m49.150s
(nlp-venv)nlp@ip-10-153-157-196:~/benchmarks$

and using Parallel:

...
real    16m49.758s
user    29m11.565s
sys     0m49.593s

and plot_snakes.py, using Pool:

(nlp-venv)nlp@ip-10-153-157-196:~/benchmarks$ time python plot_snakes.py
Please be patient. Learning will take 5-20 minutes.
Results using only directional features for edges
Test accuracy: 0.829
[[2750    0    0    0    0    0    0    0    0    0    0]
 [   0   98    0    0    1    0    0    0    1    0    0]
 [   0    6   38    3   34    8    1    2    5    1    2]
 [   0    9    8   10    8   41    1   12    3    7    1]
 [   0    1   14    2   37    8    1    9   21    5    2]
 [   0    4    2    9   16   29    2   19   11    7    1]
 [   0    2   13    3   30   16    2    7   20    5    2]
 [   0    7    5    8   15   29    3   14    8   11    0]
 [   0    3   10    3   29   10    1    6   20    3   15]
 [   0    9    3    2   10    8    0   15    4   46    3]
 [   0    2    7    3    9    1    1    3    7    3   64]]
Results using also input features for edges
Test accuracy: 0.998
[[2749    0    0    0    0    0    0    0    1    0    0]
 [   0  100    0    0    0    0    0    0    0    0    0]
 [   0    0  100    0    0    0    0    0    0    0    0]
 [   0    0    0  100    0    0    0    0    0    0    0]
 [   0    0    0    0   98    0    2    0    0    0    0]
 [   0    0    0    0    0   99    0    1    0    0    0]
 [   0    0    0    0    0    0  100    0    0    0    0]
 [   0    0    0    0    0    1    0   99    0    0    0]
 [   0    0    0    0    0    0    0    0  100    0    0]
 [   0    0    0    0    0    0    0    1    0   99    0]
 [   0    0    0    0    0    0    0    0    0    0  100]]

real    5m5.135s
user    21m33.105s
sys     0m5.142s

and using Parallel:

(nlp-venv)nlp@ip-10-153-157-196:~/benchmarks$ time python plot_snakes.py
Please be patient. Learning will take 5-20 minutes.
Results using only directional features for edges
Test accuracy: 0.829
[[2750    0    0    0    0    0    0    0    0    0    0]
 [   0   98    0    0    1    0    0    0    1    0    0]
 [   0    6   38    3   34    8    1    2    5    1    2]
 [   0    9    8   10    8   41    1   12    3    7    1]
 [   0    1   14    2   37    8    1    9   21    5    2]
 [   0    4    2    9   16   29    2   19   11    7    1]
 [   0    2   13    3   30   16    2    7   20    5    2]
 [   0    7    5    8   15   29    3   14    8   11    0]
 [   0    3   10    3   29   10    1    6   20    3   15]
 [   0    9    3    2   10    8    0   15    4   46    3]
 [   0    2    7    3    9    1    1    3    7    3   64]]
Results using also input features for edges
Test accuracy: 0.998
[[2749    0    0    0    0    0    0    0    1    0    0]
 [   0  100    0    0    0    0    0    0    0    0    0]
 [   0    0  100    0    0    0    0    0    0    0    0]
 [   0    0    0  100    0    0    0    0    0    0    0]
 [   0    0    0    0   98    0    2    0    0    0    0]
 [   0    0    0    0    0   99    0    1    0    0    0]
 [   0    0    0    0    0    0  100    0    0    0    0]
 [   0    0    0    0    0    1    0   99    0    0    0]
 [   0    0    0    0    0    0    0    0  100    0    0]
 [   0    0    0    0    0    0    0    1    0   99    0]
 [   0    0    0    0    0    0    0    0    0    0  100]]

real    5m6.173s
user    21m33.023s
sys     0m5.185s

robbymeals · 2015-01-02T23:27:10Z

Also following up on sidenote, I have done some work to get sparse matrix support, have you already made lots of progress on that? If so, maybe a feature branch would make sense, so no one reinvents your wheels?

ogrisel · 2015-01-03T19:02:58Z

I want to run a couple more tests on the larger graphs (or you can, if you like). I'd like to understand what is happening a bit better. maybe @ogrisel can help. I did not think Parallel would have such an overhead over Pool. Maybe it is because the models are so small and inference is so quick?

Yes I would like to experiment with a version of joblib that would be able to reuse an existing instance of a pool of worker processes. Typically I would like joblib to create a single pool with n_cpus workers by default and use a subset of those workers when n_jobs < n_cpus and all the workers when n_jobs == -1 But it's tedious to sub-slices an existing pool with the multiprocessing API. It was not designed to do this...

ogrisel · 2015-01-03T19:06:11Z

Ok those two are running now. Taking the approach I have here, with self.pool, I'd have to use https://docs.python.org/2/library/copy_reg.html to register a pickler that is aware of the pool and ignores it when dumping and recreates it if necessary when loading. I've done that before for very similar use cases, it's not that painful. I think that's why the checks are failing?

The default multiprocessing Pool is very restricted. The MemmapingPool of joblib makes it possible to:

register custom picklers,
avoid memory copy when the input data is already a np.memmap instance,
automatically memory map large numpy arrays to limit the number of memory copies when the same array is used in many concurrent workers.

robbymeals · 2015-01-04T03:25:48Z

@ogrisel yes my intention definitely would be to use MemmapingPool in any real implementation, I have reverted to Pool just because one of the travis checks is using a version of scikit-learn without the sklearn.externals.joblib.pool submodule. I was talking about pickling the model object as in the logger functionality, which would require registering a function to strip out the self.pool attribute in some smart way on dump and reinstantiating it on load, I think.

robbymeals · 2015-01-04T03:28:57Z

Yes I would like to experiment with a version of joblib that would be able to reuse an existing instance of a pool of worker processes. Typically I would like joblib to create a single pool with n_cpus workers by default and use a subset of those workers when n_jobs < n_cpus and all the workers when n_jobs == -1 But it's tedious to sub-slices an existing pool with the multiprocessing API. It was not designed to do this...

@ogrisel How were you thinking of referencing the pool, namespace wise? I have had difficulty figuring out a way to do this in other applications that didn't involve ugly global hacks, if that makes sense, but I may be missing some key piece.

robbymeals · 2015-01-04T03:31:47Z

@amueller so if i did some further work to try and implement this approach more fully, would you consider inclusion in the project, obviously after review and approval and whatever else? or would you want to build it out yourself?

robbymeals · 2015-01-04T03:33:16Z

if it does interest you, happy to take marching orders before undertaking it. will probably do some version of it for use internally at my job, but would rather do it in a way that can be pushed back into the project and conform to your standards and practices.

robbymeals · 2015-01-04T06:25:42Z

I enabled use of the learner pool for inference as well, got a few more seconds for the multilabel examples and some noticeable improvement on the other examples, rerunning those now. Still failing four tests in the crammer singer test file, can't figure out why.

multilabel, scenes dataset:

(nlp-venv)nlp@ip-10-153-157-196:~/benchmarks$ time python multi_label.py
fitting independent model...
fitting full model...
fitting tree model...
Training loss independent model: 0.066612
Test loss independent model: 0.111204
Training loss tree model: 0.059868
Test loss tree model: 0.106048
Training loss full model: 0.049408
Test loss full model: 0.097408

real    0m51.503s
user    1m30.192s
sys     0m2.084s

multilabel, yeast dataset:

(nlp-venv)nlp@ip-10-153-157-196:~/benchmarks$ time python multi_label.py
fitting independent model...
fitting full model...
fitting tree model...
Training loss independent model: 0.191571
Test loss independent model: 0.199330
Training loss tree model: 0.191333
Test loss tree model: 0.200499
Training loss full model: 0.191095
Test loss full model: 0.199797

real    3m38.756s
user    6m13.319s
sys     0m3.004s

robbymeals · 2015-01-04T06:29:53Z

(nlp-venv)nlp@ip-10-153-157-196:~/benchmarks$ time python plot_snakes.py
Please be patient. Learning will take 5-20 minutes.
Results using only directional features for edges
Test accuracy: 0.829
[[2750    0    0    0    0    0    0    0    0    0    0]
 [   0   98    0    0    1    0    0    0    1    0    0]
 [   0    6   38    3   34    8    1    2    5    1    2]
 [   0    9    8   10    8   41    1   12    3    7    1]
 [   0    1   14    2   37    8    1    9   21    5    2]
 [   0    4    2    9   16   29    2   19   11    7    1]
 [   0    2   13    3   30   16    2    7   20    5    2]
 [   0    7    5    8   15   29    3   14    8   11    0]
 [   0    3   10    3   29   10    1    6   20    3   15]
 [   0    9    3    2   10    8    0   15    4   46    3]
 [   0    2    7    3    9    1    1    3    7    3   64]]
Results using also input features for edges
Test accuracy: 0.998
[[2749    0    0    0    0    0    0    0    1    0    0]
 [   0  100    0    0    0    0    0    0    0    0    0]
 [   0    0  100    0    0    0    0    0    0    0    0]
 [   0    0    0  100    0    0    0    0    0    0    0]
 [   0    0    0    0   98    0    2    0    0    0    0]
 [   0    0    0    0    0   99    0    1    0    0    0]
 [   0    0    0    0    0    0  100    0    0    0    0]
 [   0    0    0    0    0    1    0   99    0    0    0]
 [   0    0    0    0    0    0    0    0  100    0    0]
 [   0    0    0    0    0    0    0    1    0   99    0]
 [   0    0    0    0    0    0    0    0    0    0  100]]

real    4m54.910s
user    21m15.152s
sys     0m4.272s

robbymeals · 2015-01-04T19:37:42Z

@amueller i still can't figure out why those four crammer-singer tests are failing. probably something simple I just don't know about.

robbymeals · 2015-01-08T17:57:24Z

multilabel.py benchmarks, for n_jobs=-1 runs with

default OneSlackSSVM config (using MemmapingPool),
use_memmapping_pool=0, and
use_threads=1,

respectively:

(nlp-venv)nlp@ip-10-234-187-58:~/benchmarks$ time python multi_label.py
fitting independent model...
fitting full model...
fitting tree model...
Training loss independent model: 0.066612
Test loss independent model: 0.111204
Training loss tree model: 0.059868
Test loss tree model: 0.106048
Training loss full model: 0.049408
Test loss full model: 0.097408

real    1m32.018s
user    2m1.699s
sys     0m13.477s
(nlp-venv)nlp@ip-10-234-187-58:~/benchmarks$ time python multi_label.py
fitting independent model...
fitting full model...
fitting tree model...
Training loss independent model: 0.066612
Test loss independent model: 0.111204
Training loss tree model: 0.059868
Test loss tree model: 0.106048
Training loss full model: 0.049408
Test loss full model: 0.097408

real    0m49.551s
user    1m29.923s
sys     0m2.540s
(nlp-venv)nlp@ip-10-234-187-58:~/benchmarks$ time python multi_label.py
fitting independent model...
fitting full model...
fitting tree model...
Training loss independent model: 0.066612
Test loss independent model: 0.111204
Training loss tree model: 0.059868
Test loss tree model: 0.106048
Training loss full model: 0.049408
Test loss full model: 0.097408

real    1m46.923s
user    2m53.827s
sys     0m16.664s
(nlp-venv)nlp@ip-10-234-187-58:~/benchmarks$

robbymeals · 2015-01-08T18:02:10Z

After some extensive testing, I'm actually leaning towards having vanilla Pool be default, since it is so much faster, and have MemmapingPool be option. But I'll stop making changes here until review.

amueller · 2015-02-06T09:21:01Z

Hey. Just wanted to say this is not forgotten ;) I'll try to work on it next week. I had a bunch of scikit-learn stuff to do as we want to release soon.
Thank you so much for your extensive testing.

Btw, have you looked into sparse matrix support any more?

robbymeals · 2015-02-06T14:21:19Z

No worries, I follow scikit-learn repo, so I figured that was it, given the crazy number of times you've popped up in the newsfeed. :)

RE: sparse matrix support, I think I am missing some fundamental pieces of how to get there from here, honestly. The stuff I came up with is pretty janky and probably not a good starting point. I have been using LSA (TruncatedSVD) in my prototype, to temporarily get around the sparse support issue.

I am definitely willing and able to work on it but I think I would need a bit of guidance from you to make real progress. If you have a bit of time, maybe you could do a high level brain dump of how you envision the best way to implement it and I could work off of that.

amueller · 2015-02-06T14:23:08Z

If you don't have a working prototype, I think I'll just try and work it out. I wouldn't want to point in a wrong direction.

robbymeals · 2015-02-06T14:25:34Z

Yes, I think that's probably best, if you have the time.

Conflicts: pystruct/learners/subgradient_latent_ssvm.py pystruct/utils/inference.py

robbymeals · 2015-04-23T15:47:27Z

Hey so I am trying to get my master (with parallelization changes) up to date with your changes and i am failing some tests in the docs and in python3. I am currently using my fork as the dependency for stuff I am working on. Totally willing to do the work to make those tests pass, but didn't want to waste time if you think my implementation is a nonstarter. Again, no worries if so, I am actually reusing the parallel utils I created here in other stuff so it wasn't a waste of time on my end regardless :).

amueller · 2015-04-23T17:44:36Z

I'm sorry I still haven't looked at your contribution.
I think they are good, but I had too much else to work on.
Did you sync with my master? And there are doctests in the userguide
failing?
I guess I have to check with travis, I didn't try the doctests on python3.

On 04/23/2015 11:47 AM, Robert Mealey wrote:

Hey so I am trying to get my master (with parallelization changes) up
to date with your changes and i am failing some tests in the docs and
in python3. I am currently using my fork as the dependency for stuff I
am working on. Totally willing to do the work to make those tests
pass, but didn't want to waste time if you think my implementation is
a nonstarter. Again, no worries if so, I am actually reusing the
parallel utils I created here in other stuff so it wasn't a waste of
time on my end regardless :).

—
Reply to this email directly or view it on GitHub
#129 (comment).

amueller · 2015-04-23T17:53:45Z

It looks like the doctest failure are actually caused by your changes.
Feel free to fix them, but it is not that urgent. I hope I have time to
look at your changes soon, but I will probably add sparse matrix support
before that.

Cheers,
Andy

On 04/23/2015 11:47 AM, Robert Mealey wrote:

Hey so I am trying to get my master (with parallelization changes) up
to date with your changes and i am failing some tests in the docs and
in python3. I am currently using my fork as the dependency for stuff I
am working on. Totally willing to do the work to make those tests
pass, but didn't want to waste time if you think my implementation is
a nonstarter. Again, no worries if so, I am actually reusing the
parallel utils I created here in other stuff so it wasn't a waste of
time on my end regardless :).

—
Reply to this email directly or view it on GitHub
#129 (comment).

robbymeals · 2015-04-23T19:55:58Z

Sorry, yeah, my comment wasn't all that clear.
Right, I was syncing with your master and there were a few failed tests that seemed to be caused by new params added in my changes in the docs. The python3 failures are different, those are core tests failing for some reason that I need to dig into. Basically just making sure you weren't trying to politely decline my pr before putting the work in to fix tests that don't directly affect the work I am using the package for :) No worries on time, I get it.

amueller · 2015-04-23T20:21:41Z

I think your approach is good, and it would be great if you could try to
stay in sync with master.
I hope there will be a couple of new features arriving soon.

On 04/23/2015 03:55 PM, Robert Mealey wrote:

Sorry, yeah, my comment wasn't all that clear.
Right, I was syncing with your master and there were a few failed
tests that seemed to be caused by new params added in my changes in
the docs. The python3 failures are different, those are core tests
failing for some reason that I need to dig into. Basically just making
sure you weren't trying to politely decline my pr before putting the
work in to fix tests that don't directly affect the work I am using
the package for :) No worries on time, I get it.

—
Reply to this email directly or view it on GitHub
#129 (comment).

robbymeals · 2015-04-25T16:32:05Z

Ok will do. I actually have made a bit more progress on sparse matrix support, still pretty hacky but I will put it up in a feature branch if you want to take a look.

amueller · 2015-05-03T20:16:53Z

On 04/25/2015 12:32 PM, Robert Mealey wrote:

Ok will do. I actually have made a bit more progress on sparse matrix
support, still pretty hacky but I will put it up in a feature branch
if you want to take a look.

Definitely :) please submit a PR. It is on my to-do for the next one or
two weeks.

Cheers,
Andy

amueller · 2015-06-30T19:26:26Z

It would be interesting to bench your approach against the improved joblib here:
joblib/joblib#157 (comment)

It still allocates many more pools then necessary, so it might not be as good.

amueller · 2015-06-30T19:26:43Z

Also, pystruct has gotten not enough attention lately, sorry :-/

amueller · 2015-07-16T10:28:47Z

the branch I mentioned above was merged into scikit-learn. Can you check your patch against current pystruct with the dev branch of scikit-learn please?

ogrisel · 2015-07-16T14:08:13Z

Also there is a new context manager API in the work at: joblib/joblib#221. Feel free to test that branch with the monkey-patch mentioned in the first comment.

robbymeals · 2015-07-16T19:15:43Z

I'll try and do this by the end of this weekend. Bit swamped at the moment.

robbymeals · 2016-04-28T02:01:25Z

Hey a long time later, is this still an optimization you're interested in pursuing? If you've come up with a better solution that I missed or this isn't the direction you want to go, happy to close this PR.

robbymeals · 2016-04-28T02:02:08Z

oh I see that I said I would try against latest joblib. I can do that it if you haven't, if this is still something worth looking into.

amueller · 2016-05-02T16:15:20Z

Hey. Yeah so the latest joblib has pools, so that might be a simpler solution than yours. I haven't benchmarked it, and I think it would be worth to try. I'd love to have the better of the two in ;)

use MemmapingPool instead of joblib.Parallel

f406406

default to plain Pool if MemmapingPool not available

4ef2460

robbymeals added 3 commits January 2, 2015 19:23

both one slack and subgradient ssvm using vanilla pools

9cc7cf6

remove verbose from Pool instantiation

80d758e

make one_slack_ssvm.py pool usage match subgradient

e60c739

use mp Pool not joblib.pool

90e9c7b

move self.pool creation to base ssvm class

ab3de7c

robbymeals added 2 commits January 4, 2015 05:20

also use learner pool for inference

1daf26b

make all pool usage conditional

ebdedf5

robbymeals added 4 commits February 6, 2015 09:39

Merge branch 'master' of github.com:pystruct/pystruct

90758ec

Merge branch 'master' of github.com:pystruct/pystruct

8fe6c0d

Conflicts: pystruct/learners/subgradient_latent_ssvm.py pystruct/utils/inference.py

use sys.maxsize as timeout for python3 compat

0db6225

update user guide with new parallel params

d7aa03a

robbymeals added 4 commits May 7, 2015 16:53

Merge remote-tracking branch 'upstream/master'

e2e33c9

fix python3 bug

474a38f

Merge branch 'master' of github.com:pystruct/pystruct

3c0b960

fix user guide tests

44959c9

robbymeals closed this Feb 7, 2017

Improved Parallelization for Training and Inference #129

Improved Parallelization for Training and Inference #129

Conversation

robbymeals commented Dec 31, 2014

robbymeals commented Dec 31, 2014

amueller commented Jan 2, 2015

robbymeals commented Jan 2, 2015

amueller commented Jan 2, 2015

robbymeals commented Jan 2, 2015

robbymeals commented Jan 2, 2015

amueller commented Jan 2, 2015

robbymeals commented Jan 2, 2015

amueller commented Jan 2, 2015

robbymeals commented Jan 2, 2015

robbymeals commented Jan 2, 2015

robbymeals commented Jan 2, 2015

ogrisel commented Jan 3, 2015

ogrisel commented Jan 3, 2015

robbymeals commented Jan 4, 2015

robbymeals commented Jan 4, 2015

robbymeals commented Jan 4, 2015

robbymeals commented Jan 4, 2015

robbymeals commented Jan 4, 2015

robbymeals commented Jan 4, 2015

robbymeals commented Jan 4, 2015

robbymeals commented Jan 8, 2015

robbymeals commented Jan 8, 2015

amueller commented Feb 6, 2015

robbymeals commented Feb 6, 2015

amueller commented Feb 6, 2015

robbymeals commented Feb 6, 2015

robbymeals commented Apr 23, 2015

amueller commented Apr 23, 2015

amueller commented Apr 23, 2015

robbymeals commented Apr 23, 2015

amueller commented Apr 23, 2015

robbymeals commented Apr 25, 2015

amueller commented May 3, 2015

amueller commented Jun 30, 2015

amueller commented Jun 30, 2015

amueller commented Jul 16, 2015

ogrisel commented Jul 16, 2015

robbymeals commented Jul 16, 2015

robbymeals commented Apr 28, 2016

robbymeals commented Apr 28, 2016

amueller commented May 2, 2016