Shogun base classes #2880

karlnapf · 2015-08-03T12:38:17Z

Shall we add another one in the layer

A minimal base class CSGObject that just allows for reference counting and all the basic Shogun stuff.
A second one CSerializableObject that adds all the parameter stuff and inherits from the first?

I guess this could minimize Shoguns overhead quite a bit (many classes dont use parameters)

@yorkerlin @lisitsyn @sonney2k @vigsterkr @lambday @iglesias @tklein23 @besser82 what are your thoughts?

The text was updated successfully, but these errors were encountered:

karlnapf · 2015-08-03T13:01:58Z

Could also add another layer that provides all the sg_* fields

lisitsyn · 2015-08-03T20:43:42Z

We would have to use multiple inheritance, it could be PITA you know :)

karlnapf · 2015-08-05T08:43:01Z

Why multiple inheritance? That's not my idea.

If the base classes are a hierarchy, one could just inherit from a certain stage in there?

sonney2k · 2015-08-05T09:00:24Z

I did add SGReferencedData (or so) at some point that did exactly that.
I guess it did get removed for some reason by someone who knows why that
was maybe not a good idea?

On Wed, 2015-08-05 at 01:43 -0700, Heiko Strathmann wrote:

Why multiple inheritance? That's not my idea.

If the base classes are a hierarchy, one could just inherit from a
certain stage in there?

—
Reply to this email directly or view it on GitHub.

karlnapf · 2015-08-05T10:54:06Z

Thats very similar, but for the SGMatrix, SGVector classes.
Exactly that type of thing is what I want to do for the base class CSGObject.

karlnapf · 2015-08-05T11:12:31Z

What about this:

CReferencedObject -> CSerialisableObject -> CSGObject

Where the first one contains only the reference counting, the second adds the parameters/serialisation, and the third adds all the Shogun stuff

sonney2k · 2015-08-05T12:05:41Z

checkout

9305aa9

and look at src/shogun/base/SGRefObject.h

On Wed, 2015-08-05 at 04:12 -0700, Heiko Strathmann wrote:

What about this:

CReferencedObject -> CSerialisableObject -> CSGObject

Where the first one contains only the reference counting, the second adds the parameters/serialisation, and the third adds all the Shogun stuff

Reply to this email directly or view it on GitHub:
#2880 (comment)

lisitsyn · 2015-08-05T16:57:08Z

@karlnapf okay this works. Though we don't really know whether referenced is always serializable or serializable is always referenced :)

karlnapf · 2015-08-06T10:29:15Z

I see now. This is good enough for @yorkerlin s patch. Thanks I did not know

iglesias · 2015-08-10T18:14:21Z

The idea makes sense to me, and I like it because it would separate in each of the classes Heiko has mentioned above reference counting, serialisation, and rest of Shogun stuff (although I am not sure what this rest would be :-)

I am not entirely sure however how much overhead it would reduce. What overhead are we talking about? SWIG or memory footprint per SGObject?

Another (very) wild idea somewhat related would be to use smart pointers as they are now part of the default C++ compiler. Then we could get rid of our own reference counting.

karlnapf · 2015-08-10T21:29:58Z

yeah smart pointers would be best.

@lisitsyn what s your take on all this?

making the base class overhead much smaller
serialisation
reference counting?

karlnapf · 2015-08-10T21:30:19Z

@sonney2k why was the SGRefObject removed?

sonney2k · 2015-08-11T05:52:09Z

Do not remember. Git logs say that Thorsten removed it. "Obsolete ".

On August 10, 2015 11:30:21 PM GMT+02:00, Heiko Strathmann notifications@github.com wrote:

@sonney2k why was the SGRefObject removed?

Reply to this email directly or view it on GitHub:
#2880 (comment)

Sent from Kaiten Mail. Please excuse my brevity.

iglesias · 2015-08-11T06:50:04Z

The class was removed during the last summit when we were reducing SWIG overhead, in terms of memory used to compile or, equivalently, the size of the (huge) file SWIG creates. Before it was removed, it looked like this SGRefObject -> CSGObject, and all the classes were inheriting from CSGObject, not from SGRefObject.

Although it should be possible to use SGRefOject for what you have mentioned above, it was not used like that. It was just pulling out the reference counting logic from CSGObject. Thoralf removed it to reduce the number of classes (by one, in this case) that are exposed via SWIG.

iglesias · 2015-08-11T06:54:35Z

#2581

lisitsyn · 2015-08-11T07:00:50Z

@karlnapf

making the base class overhead much smaller

I like the principle, although it is not really clear for me what to remove.

serialisation

I think it should stay as long as we are rare ML library with this feature.

reference counting?

I believe it should be done via shared pointers.

karlnapf · 2015-08-11T20:43:46Z

I dont get the point of removing the class. We could just hide it from SWIG?
If one class contains what two classes used to contain, nothing is really gained. We could revert that?

@lisitsyn

remove migration at least
cant we use a library for serialisation? why re-invent the wheel there (and we do it badly, its slow as f***)
+1 for shared pointers

iglesias · 2015-08-12T05:01:30Z

There is more dicussion about it here #1853 #1764.

Reverting is easy peasy with git revert #2883. But I suggest not to merge this until we have checked the above pull requests.

iglesias · 2015-08-12T05:08:12Z

Hmm, it seems that SGRefObject was not introduced due to what @sonney2k pointed out above: #1770. Still, I believe it should be possible to use it for what we were talking about.

iglesias · 2015-08-12T05:20:20Z

This is the pr where it was merged #1771.

yorkerlin · 2015-08-22T14:01:48Z

@lisitsyn
@lambday
Do you guys plan to rewrite SGObject?
Do we use the existing parameter selection (model selection) framework? The existing model selection framework is not flexible.
Currently, the existing framework searches for all model parameters since class Parameter only supports add but does not support remove

The existing framework only supports:

const/fixed parameters
model parameters (need to be "optimized" in model selection)

I want the framework supports variational parameters (need to be optimized but are not model parameters)

In future, I may have to extend the framework in the following ways:

Enable/Disable a specific model parameter in model selection ( my first priority)
support variational parameters
support Bayesian model selection (eg, @karlnapf Bayesian optimization for hyper-parameters using GP)

lambday · 2015-08-23T06:54:36Z

Hi @yorkerlin . If we in fact happen to rewrite SGObject, probably quite a few things would change. @lisitsyn proposed having a class Property for each parameter which fits nicely and intuitively for model selection. In this scenario, I think we'd be able to implement the enable/disable feature you're suggesting quite easily.

I am unaware of variational parameters ATM. If they are expected to be optimized differently, maybe we can have a separate class for them and provide its functionality accordingly.

yorkerlin · 2015-08-23T16:05:50Z

@lambday
Cool!
variational parameters are auxiliary variables.

For batch inference,

step1: given model parameters \theta^{t}
step2: we usually optimize variational parameters until converge given \theta^{t} are fixed
step3: then update model parameters \theta^{t+1} and go back to step 1 until some conditions meet

For stochastic inference,We can update model parameters and variational parameters at the same time.

step1: draw sample data point(s)
step2: update model parameters and variational parameters and go back to step 1 until some conditions meet

For stochastic inference, variaitonal parameters can be viewed as model parameters.
Enable/Disable some parameters will be the key!

yorkerlin · 2015-08-25T03:31:02Z

CMap does not support serialization. However, CMap is a subclass of SGObject.
At least I want CStringMap (that is CMap<std::string, SGVector<float64_t> >) is serialable.

ref
#2903

lambday · 2015-08-25T09:45:05Z

@yorkerlin from the requirements, going by policy based design for the parameters sound reasonable to me, where the policies are different optimization policies. So laying down the requirements

We can use Parameter or Property class for normal/fixed parameters. Present TParameter and Parameter can be an inspiration for that. Getting rid of the ADD things would be nice. We don't have to specify MS_AVAILABLE or MS_NOT_AVAILABLE every time.
We need another ModelParameter class for parameters that participate in model selection. This can be a policy based class with default policy being the present model selection scenario. For variational parameters we'll use a separate policy implementing the optimization technique for them as required. ModelParameter would have enable/disable feature. In general I like policies more since they go well with combinatorial scenario without having to enforce a is-a-relationship .
We'll use pimpl [1][2] pattern the classes which helps the software be backward compatible and also reduce significant compilation time. Header inclusion should be minimal and we have to take that extra care. All the data would go inside the Impl classes and eventually will be saved inside the parameter_map in SGObjectImpl. Getting rid of unnecessary setters/getters would also be nice [3][4][5]

[1] http://www.gotw.ca/publications/mill04.htm
[2] http://www.gotw.ca/publications/mill05.htm
[3] http://stackoverflow.com/questions/8447972/how-to-combine-auto-gettersetter-with-pimpl-design-pattern-in-a-public-api-inte#
[4] http://www.idinews.com/quasiClass.pdf
[5] http://www.javaworld.com/article/2073723/core-java/why-getter-and-setter-methods-are-evil.html

yorkerlin · 2015-08-29T13:29:33Z

@lambday
@lisitsyn
some old issues related to this
#1251
#779

lambday · 2015-08-31T09:05:45Z

Thanks @yorkerlin for tagging the related issues.

karlnapf added the Tag: Discussion label Aug 3, 2015

karlnapf mentioned this issue Aug 3, 2015

the minimizer framework #2876

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shogun base classes #2880

Shogun base classes #2880

karlnapf commented Aug 3, 2015

karlnapf commented Aug 3, 2015

lisitsyn commented Aug 3, 2015

karlnapf commented Aug 5, 2015

sonney2k commented Aug 5, 2015

karlnapf commented Aug 5, 2015

karlnapf commented Aug 5, 2015

sonney2k commented Aug 5, 2015

lisitsyn commented Aug 5, 2015

karlnapf commented Aug 6, 2015

iglesias commented Aug 10, 2015

karlnapf commented Aug 10, 2015

karlnapf commented Aug 10, 2015

sonney2k commented Aug 11, 2015

iglesias commented Aug 11, 2015

iglesias commented Aug 11, 2015

lisitsyn commented Aug 11, 2015

karlnapf commented Aug 11, 2015

iglesias commented Aug 12, 2015

iglesias commented Aug 12, 2015

iglesias commented Aug 12, 2015

yorkerlin commented Aug 22, 2015

lambday commented Aug 23, 2015

yorkerlin commented Aug 23, 2015

yorkerlin commented Aug 25, 2015

lambday commented Aug 25, 2015

yorkerlin commented Aug 29, 2015

lambday commented Aug 31, 2015

Shogun base classes #2880

Shogun base classes #2880

Comments

karlnapf commented Aug 3, 2015

karlnapf commented Aug 3, 2015

lisitsyn commented Aug 3, 2015

karlnapf commented Aug 5, 2015

sonney2k commented Aug 5, 2015

karlnapf commented Aug 5, 2015

karlnapf commented Aug 5, 2015

sonney2k commented Aug 5, 2015

lisitsyn commented Aug 5, 2015

karlnapf commented Aug 6, 2015

iglesias commented Aug 10, 2015

karlnapf commented Aug 10, 2015

karlnapf commented Aug 10, 2015

sonney2k commented Aug 11, 2015

iglesias commented Aug 11, 2015

iglesias commented Aug 11, 2015

lisitsyn commented Aug 11, 2015

karlnapf commented Aug 11, 2015

iglesias commented Aug 12, 2015

iglesias commented Aug 12, 2015

iglesias commented Aug 12, 2015

yorkerlin commented Aug 22, 2015

lambday commented Aug 23, 2015

yorkerlin commented Aug 23, 2015

yorkerlin commented Aug 25, 2015

lambday commented Aug 25, 2015

yorkerlin commented Aug 29, 2015

lambday commented Aug 31, 2015