SGVector cleanup. #2582

iglesias · 2014-10-27T06:56:02Z

Remove all the methods producing clutter in SGVector. This includes (although it is not limited to) math operations like linspace, dot, and other operations like find. The objective is to have an SGVector as lean as possible.

Systematically, the work to be done for each method reduced is:

Move the method from SGVector to another class, like for instance to CMath or to whichever it makes sense the most.
Refactor Shogun's internal code.
Refactor examples (libshogun and interface examples, if any).
In case the method already had unit tests in SGVector, then refactor them (and move them). Otherwise, write some good unit tests.

See this pull request to gain understanding of what this task consists of, #2579.

The text was updated successfully, but these errors were encountered:

iglesias · 2014-10-28T09:14:30Z

I forgot to mention that it is important to do this separately, both to ease the review and the cleanup process. That means that the best practice would be to send small and individual pull requests for each method.

iglesias · 2014-10-28T09:19:22Z

See the wiki for more motivation why we want to do this.

sanuj · 2014-12-08T17:08:20Z

@iglesias, I would like to work on this. Shall I start by moving dot to CMath? Should I do more changes in the first pull request?

iglesias · 2014-12-08T18:04:06Z

According to what I wrote above, moving dot to CMath sounds good :-)

I think that doing that one alone in a first pull request is good.

sanuj · 2014-12-13T06:17:19Z

@iglesias I'll leave the functions related to linear algebra (we might want to move them to linalg). So I'll leave functions like add, sum, norm, add_scalar, product, vector_multiply etc for now while we are working on linalg NATIVE implementations. Should I meanwhile move functions like max, min, mean, arg_max, arg_min, max_abs to CMath? Don't know where to put linspace and sorting functions.

iglesias · 2014-12-15T09:25:49Z

Sure. In my opinion, max, min, argmax, argmin, and linspace make sense in CMath. Mean might make more sense in CStatistics. About sorting I am not sure, I leave it up to you :-) But there might be already some sorting implemented in CMath (iirc).

sanuj · 2014-12-15T17:44:10Z

I am a bit confused. As far as I know, the declaration and definition of a template should be kept in one single file, but if we take the case of SGVector, then it has a header file and a cpp file. How is this working out?

iglesias · 2014-12-15T18:56:48Z

We do a small trick. Look at the end of SGVector.cpp. There, we have defined the classes for which SGVector can be templated (which are basically primitive data types).

sanuj · 2014-12-17T17:42:03Z

There is SGVector::linspace_vec() which returns a vector, SGVector::linspace() which returns a float64_t * and depends on CMath::linspace which returns void (passes the output through an input parameter pointer). None of them are used anywhere apart from SGVector and CMath. What to do with these?

iglesias · 2014-12-17T18:24:41Z

If they are not used anywhere, I'd say we can drop them. We can for sure drop the ones in SGVector. The CMath one can stay.

sanuj · 2014-12-22T15:44:28Z

There are three functions related to sorting in SGVector: qsort(), argsort() and is_sorted()
qsort() looks like this

template <class T>
void SGVector<T>::qsort()
{
    CMath::qsort<T>(vector, vlen);
}

So if we move qsort to CMath then we will have to pass the vector as an argument or we can remove SGVector<T>::qsort entirely and use the one mentioned in the above patch but in this case we'll need to pass the length as well.

Should I also move argsort? (moving is_sorted() doesn't make sense to me)

iglesias · 2014-12-22T23:02:59Z

In my opinion, void CMath::qsort(SGVector<T>), SGVector<index_t> CMath::argsort(SGVector), and bool CMath::is_sorted(SGVector<T>) make sense.

curiousguy13 · 2015-02-14T11:18:38Z

@iglesias would it be fine , if I moved range_fill to CMath ?

iglesias · 2015-02-14T17:54:49Z

@curiousguy13, it sounds good to me, both range_fill and range_fill_vector should go out of SGVector. Do something nice with them (perhaps, simplifying to only one method) and move to CMath. Keep in mind unit tests. Looking forward to see the pull request.

lambday · 2015-02-18T09:26:17Z

@iglesias I think range_fill, zeros, these are perfect to be shifted to linalg instead.

iglesias · 2015-02-19T18:47:35Z

It sounds good!
On 18 Feb 2015 10:26, "Soumyajit De" notifications@github.com wrote:

@iglesias https://github.com/iglesias I think range_fill, zeros, these
are perfect to be shifted to linalg instead.

—
Reply to this email directly or view it on GitHub
#2582 (comment)
.

Hephaestus12 · 2020-03-21T22:22:23Z

I would like to work on this. Shall I start by moving add to CMath?

gf712 · 2020-03-22T07:10:11Z

I would like to work on this. Shall I start by moving add to CMath?

You shouldn’t move to CMath but to linalg instead. However linalg already has add. But it would be good to remove add from SGVector. AFAIK CMath (now Math) should be dropped.

Hephaestus12 · 2020-03-23T13:16:30Z

This is for the method add() in SGVector.
I don't think simply removing it and replacing its calls with the add method in the Linalg framework will help. For the following reasons:

In file: src/shogun/mathematics/linalg/backend/eigen/BasicOps.cpp :

#define BACKEND_GENERIC_IN_PLACE_ADD(Type, Container)                          \
	void LinalgBackendEigen::add(                                              \
	    const Container<Type>& a, const Container<Type>& b, Type alpha,        \
	    Type beta, Container<Type>& result) const                              \
	{                                                                          \
		add_impl(a, b, alpha, beta, result);                                   \
	}

add() calls add_impl()

In file: src/shogun/mathematics/linalg/backend/eigen/BasicOps.cpp :

template <typename T>
void LinalgBackendEigen::add_impl(
    const SGVector<T>& a, const SGVector<T>& b, T alpha, T beta,
    SGVector<T>& result) const
{
	typename SGVector<T>::EigenVectorXtMap a_eig = a;
	typename SGVector<T>::EigenVectorXtMap b_eig = b;
	typename SGVector<T>::EigenVectorXtMap result_eig = result;

	result_eig = alpha * a_eig + beta * b_eig;
}

add_impl() uses the operator+

In file: src/shogun/lib/SGVector.cpp :

/** addition operator */
template<class T>
SGVector<T> SGVector<T>::operator+ (SGVector<T> x)
{
	assert_on_cpu();
	require(x.vector && vector, "Addition possible for only non-null vectors.");
	require(x.vlen == vlen, "Length of the two vectors to be added should be same. [V({}) + V({})]", vlen, x.vlen);

	SGVector<T> result=clone();
	result.add(x);
	return result;
}

The operator+ uses SGVector's add() method.

So indirectly, even linalg's add() method uses SGVector's add() method.
Will we need to entirely refactor the linalg framework code for this?

gf712 · 2020-03-23T13:40:11Z

hmm, I think you are confusing things. operator+ in linalg is from Eigen, which uses SIMD instructions where possible.
result_eig = alpha * a_eig + beta * b_eig; is written with Eigen types. See the definition of EigenVectorXtMap in SGVector

Hephaestus12 · 2020-03-24T23:36:01Z

Is there a method in linalg to add a dense vector to a sparse vector and/or add two sparse vectors?

Hephaestus12 · 2020-03-24T23:40:12Z

Is there a method in linalg to add a dense vector to a sparse vector and/or add two sparse vectors?
Or will we need to write one ourselves?
@gf712

karlnapf · 2020-03-25T18:27:10Z

don't think there is atm.

iglesias added the good first issue label Oct 27, 2014

iglesias added the Tag: Cleanup label Oct 28, 2014

lambday mentioned this issue Jan 15, 2015

Added Shogun's native dot impl to linalg under NATIVE backend #2672

Merged

curiousguy13 mentioned this issue Feb 18, 2015

move range_fill from SGVector to CMath #2721

Closed

lambday mentioned this issue Feb 20, 2015

Add scale and add to linalg #2717

Closed

vigsterkr added this to the Shogun 6.2.0 milestone Dec 14, 2017

vigsterkr modified the milestones: Shogun 6.2.0, Shogun 7.0.0 May 22, 2018

Hephaestus12 mentioned this issue Mar 24, 2020

remove add and operator+ from SGVector and replace with linalg::add calls #4958

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SGVector cleanup. #2582

SGVector cleanup. #2582

iglesias commented Oct 27, 2014

iglesias commented Oct 28, 2014

iglesias commented Oct 28, 2014

sanuj commented Dec 8, 2014

iglesias commented Dec 8, 2014

sanuj commented Dec 13, 2014

iglesias commented Dec 15, 2014

sanuj commented Dec 15, 2014

iglesias commented Dec 15, 2014

sanuj commented Dec 17, 2014

iglesias commented Dec 17, 2014

sanuj commented Dec 22, 2014

iglesias commented Dec 22, 2014

curiousguy13 commented Feb 14, 2015

iglesias commented Feb 14, 2015

lambday commented Feb 18, 2015

iglesias commented Feb 19, 2015

Hephaestus12 commented Mar 21, 2020

gf712 commented Mar 22, 2020

Hephaestus12 commented Mar 23, 2020

gf712 commented Mar 23, 2020 •

edited

Hephaestus12 commented Mar 24, 2020

Hephaestus12 commented Mar 24, 2020

karlnapf commented Mar 25, 2020

SGVector cleanup. #2582

SGVector cleanup. #2582

Comments

iglesias commented Oct 27, 2014

iglesias commented Oct 28, 2014

iglesias commented Oct 28, 2014

sanuj commented Dec 8, 2014

iglesias commented Dec 8, 2014

sanuj commented Dec 13, 2014

iglesias commented Dec 15, 2014

sanuj commented Dec 15, 2014

iglesias commented Dec 15, 2014

sanuj commented Dec 17, 2014

iglesias commented Dec 17, 2014

sanuj commented Dec 22, 2014

iglesias commented Dec 22, 2014

curiousguy13 commented Feb 14, 2015

iglesias commented Feb 14, 2015

lambday commented Feb 18, 2015

iglesias commented Feb 19, 2015

Hephaestus12 commented Mar 21, 2020

gf712 commented Mar 22, 2020

Hephaestus12 commented Mar 23, 2020

gf712 commented Mar 23, 2020 • edited

Hephaestus12 commented Mar 24, 2020

Hephaestus12 commented Mar 24, 2020

karlnapf commented Mar 25, 2020

gf712 commented Mar 23, 2020 •

edited