Added PyForwardFromTo and PyBackwardFromTo to Net, for releasing GIL … #4360

alessandroferrari · 2016-06-24T09:04:31Z

…in pycaffe. Added ScopedGILRelease for easy GIL release. Modified _caffe.cpp in pycaffe accordingly.

seanbell · 2016-06-26T00:07:53Z

The tests failed due to style issues -- please run make lint.

…gil symbols.

alessandroferrari · 2016-06-28T08:10:35Z

Multithreading in python is doomed by GIL. Many users (as I do) are often using caffe for performing nets prediction by pycaffe, and often they are doing so from multithreaded applications. Pycaffe interface is not releasing GIL, which means that if caffe is doing predictions, other python threads are stuck. Since predictions are time consuming, this can lead to terrible slow down of the application. By means of this pull-request, GIL is released for forward pass and backward pass of the network. Within the pass, it is required temporarily only for coping with python layers, that require to hold GIL.
Thanks to this, within the same process, from the python interface is possible to perform predictions of different networks in parallel on multiple threads, still mantaining complete thread safety. Before everything was locked by GIL (false sharing).

ajtulloch · 2016-06-29T00:29:06Z

include/caffe/net.hpp

+   */
+  Dtype PyForwardFromTo(int start, int end) {
+    // Release GIL
+    m_thread_state = PyEval_SaveThread();


This should be an RAII wrapper around Save/Restore thread for exception safety.

This definitely isn't the right place to put this logic though, FWIW.

RAII wrapper: Thank you for the suggestion. I will take a look to fix it.

Not right place: I agree. However, where would you suggest to place it? (consider that you have to reacquire GIL in case you forward/backward a python layer within the net). Even though this solution insert "Pythonish" code within the C++ logic, not releasing GIL for the pycaffe is a painful bug for python interface. If there is a cleaner alternative I would love to discuss about it.

ajtulloch · 2016-07-06T02:24:48Z

Why not just simply:

a) Release the GIL in PyCaffe
b) Reaquire the GIL in PythonLayer before running the function in that layer?

alessandroferrari · 2016-07-07T16:36:24Z

It would be cleaner and it was my first thought. However, I have been stopped by two reasons:

PyEval_SaveThread return the thread_state of the current thread, you need somehow to pass from PyCaffe to the PythonLayer this state, without relying on Net (otherwise it would be the same);
I suspect that in python layer, python-boost does some magic that require GIL for passing the python objects arguments to the C++ code, since I get segfault if I acquire GIL in PythonLayer before calling the inner forward. GIL needs to be acquired before calling forward method for Python layers.
Thus, even if releasing GIL within the Net is not clean, since pythonish stuffs are inserted in the C++ logic, looked like the only viable solution. But if you have any ideas on how to circumvent those problems, I will be happy to give them a try.

crowsonkb · 2016-07-15T19:38:02Z

I use the multiprocessing package instead of threading both to work around the GIL and to parallelize PyCaffe over multiple GPUs. My main Python interpreter spawns several other interpreters each of which has one instance of PyCaffe.

alessandroferrari · 2016-07-15T20:05:57Z

Of course multiprocessing does not have synchronization problems......
Still, multiprocessing come with a interprocess communication cost. There are many applications in production environment where multithreading may be beneficial, such as model ensembling and performance critical apps. Numpy, OpenCv, scikit all release GIL....

seanbell · 2016-07-30T15:45:49Z

Makefile

@@ -593,19 +597,19 @@ $(TEST_ALL_BIN): $(TEST_MAIN_SRC) $(TEST_OBJS) $(GTEST_OBJ) \
 		| $(DYNAMIC_NAME) $(TEST_BIN_DIR)
 	@ echo CXX/LD -o $@ $<
 	$(Q)$(CXX) $(TEST_MAIN_SRC) $(TEST_OBJS) $(GTEST_OBJ) \
-		-o $@ $(LINKFLAGS) $(LDFLAGS) -l$(LIBRARY_NAME) -Wl,-rpath,$(ORIGIN)/../lib
+		-o $@ $(LINKFLAGS) -l$(LIBRARY_NAME)  $(LDFLAGS) -Wl,-rpath,$(ORIGIN)/../lib


Why is this change necessary?

seanbell · 2016-07-30T15:54:42Z

Thanks @alessandroferrari for the great PR, and I agree that the GIL should be released during forward/backward. This enables pre-fetching with multi-threading (not multi-processing).

Can you please squash this into a single commit?

@shelhamer @longjon I think this is an important feature when using Python data layers. Do you have any thoughts?

seanbell · 2016-08-03T14:44:21Z

include/caffe/net.hpp

@@ -308,6 +333,9 @@ class Net {
  /// The root net that actually holds the shared layers in data parallelism
  const Net* const root_net_;
  DISABLE_COPY_AND_ASSIGN(Net);
+
+  // For releasing/reacquiring GIL with pycaffe
+  shared_ptr<ScopedGILRelease> scoped_gil_release;


This should be scoped_gil_release_ to be consistent with other naming.

cypof · 2017-01-17T20:51:21Z

Each net can run in it's own fork now, there is an example in /python/train.py. #4563

willyd · 2017-03-30T13:27:04Z

Thanks @alessandroferrari for the great PR, and I agree that the GIL should be released during forward/backward. This enables pre-fetching with multi-threading (not multi-processing).

I think that even with new multiprocessing parallel training releasing the GIL is still relevant for the above use case. I can a get better throughput of data using the threading module and releasing the GIL in Net::ForwardFromTo and Net::BackwardFromTo that by using the multiprocessing module for single GPU training on windows. Of course we have to reacquire the GIL before calling any python layer functions like setup, reshape, etc.

willyd · 2017-03-30T13:35:18Z

PyEval_SaveThread return the thread_state of the current thread, you need somehow to pass from PyCaffe to the PythonLayer this state, without relying on Net (otherwise it would be the same);
I suspect that in python layer, python-boost does some magic that require GIL for passing the python objects arguments to the C++ code, since I get segfault if I acquire GIL in PythonLayer before calling the inner forward. GIL needs to be acquired before calling forward method for Python layers.

Thus, even if releasing GIL within the Net is not clean, since pythonish stuffs are inserted in the C++ logic, looked like the only viable solution. But if you have any ideas on how to circumvent those problems, I will be happy to give them a try.

@alessandroferrari I am not sure I completely understand what you are saying here. Have you tried something like:

Using:

class ReleasePyGIL {
 public:
  ReleasePyGIL() { state = PyEval_SaveThread(); }
  ~ReleasePyGIL() { PyEval_RestoreThread(state); }
 private:
  PyThreadState* state;

inside ForwardFromTo and BackwardFromTo and

class AcquirePyGIL {
 public:
  AcquirePyGIL() { state = PyGILState_Ensure(); }
  ~AcquirePyGIL() { PyGILState_Release(state); }
 private:
  PyGILState_STATE state;
};

inside PythonLayer::LayerSetup, PythonLayer::Reshape, etc.

This works flawlessly for me on Windows.

Added PyForwardFromTo and PyBackwardFromTo to Net, for releasing GIL …

8d0562b

…in pycaffe. Added ScopedGILRelease for easy GIL release. Modified _caffe.cpp in pycaffe accordingly.

alessandroferrari mentioned this pull request Jun 24, 2016

Reconcile PythonLayer with Multi-GPU #2936

Closed

Alessandro Ferrari added 3 commits June 27, 2016 10:53

Corrected lint errors. Added link against libpython2.7 for releasing …

3c23d53

…gil symbols.

Correctly compiles python tests... still segfault on python layers.

cddaf04

Fixed segmentation fault on python-layers test.

d5056ea

ajtulloch reviewed Jun 29, 2016
View reviewed changes

RAII wrapping for PySave/Restor thread.

f6d0bc4

seanbell reviewed Jul 30, 2016
View reviewed changes

seanbell added the Python label Jul 30, 2016

seanbell reviewed Aug 3, 2016
View reviewed changes

cypof closed this Jan 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added PyForwardFromTo and PyBackwardFromTo to Net, for releasing GIL … #4360

Added PyForwardFromTo and PyBackwardFromTo to Net, for releasing GIL … #4360

alessandroferrari commented Jun 24, 2016

seanbell commented Jun 26, 2016 •

edited

alessandroferrari commented Jun 28, 2016

ajtulloch Jun 29, 2016 •

edited

alessandroferrari Jun 29, 2016

ajtulloch commented Jul 6, 2016

alessandroferrari commented Jul 7, 2016

crowsonkb commented Jul 15, 2016

alessandroferrari commented Jul 15, 2016

seanbell Jul 30, 2016

seanbell commented Jul 30, 2016 •

edited

seanbell Aug 3, 2016

cypof commented Jan 17, 2017

willyd commented Mar 30, 2017

willyd commented Mar 30, 2017

Added PyForwardFromTo and PyBackwardFromTo to Net, for releasing GIL … #4360

Added PyForwardFromTo and PyBackwardFromTo to Net, for releasing GIL … #4360

Conversation

alessandroferrari commented Jun 24, 2016

seanbell commented Jun 26, 2016 • edited

alessandroferrari commented Jun 28, 2016

ajtulloch Jun 29, 2016 • edited

Choose a reason for hiding this comment

alessandroferrari Jun 29, 2016

Choose a reason for hiding this comment

ajtulloch commented Jul 6, 2016

alessandroferrari commented Jul 7, 2016

crowsonkb commented Jul 15, 2016

alessandroferrari commented Jul 15, 2016

seanbell Jul 30, 2016

Choose a reason for hiding this comment

seanbell commented Jul 30, 2016 • edited

seanbell Aug 3, 2016

Choose a reason for hiding this comment

cypof commented Jan 17, 2017

willyd commented Mar 30, 2017

willyd commented Mar 30, 2017

seanbell commented Jun 26, 2016 •

edited

ajtulloch Jun 29, 2016 •

edited

seanbell commented Jul 30, 2016 •

edited