refactor: Integrate count and sum feat sq into the interaction generation routine #2987

jackgerrits · 2021-05-07T16:32:53Z

This change updates the way in which features are counted and sum of features squared is calculated. For some multi_ex reductions, ccb and slates in particular, the number of features reported to the logger is incorrect. They were incorrect before this change and this doesn't fix it. The way in which features are counted needs to be updated for the multi_ex abstraction.

…nconsistencies

python/pylibvw.cc

lalo · 2021-05-10T20:40:21Z

vowpalwabbit/gd_mf.cc

    ec.num_features -=
        ec.feature_space[static_cast<int>(i[0])].size() * ec.feature_space[static_cast<int>(i[1])].size();
    ec.num_features += ec.feature_space[static_cast<int>(i[0])].size() * d.rank;
    ec.num_features += ec.feature_space[static_cast<int>(i[1])].size() * d.rank;
+    ec.num_features_from_interactions +=
+        ec.feature_space[static_cast<size_t>(i[0])].size() * ec.feature_space[static_cast<size_t>(i[1])].size();


this is calculated on line 107 too

Yeah, gd_mf does it's interaction manually using quadratics only and does some weird stuff with feature counting. It's not clear to me why it subtracts that value. I can calc once though and reuse the value though.

lalo · 2021-05-10T20:44:58Z

test/train-sets/ref/ccb_reuse_small.stderr


 finished run
 number of examples = 4
 weighted example sum = 4.000000
 weighted label sum = 0.000000
 average loss = 0.000000
-total feature number = 32
+total feature number = 48


what is the explanation behind these 2 counters, for example how does one arrive at 8 or 12 features in this one example?

ccb shared |User b
ccb action |Action d
ccb action |Action e
ccb slot 0:0:0.2 |Slot h

For multi_ex, most of the time it is wrong at the moment. It is just an approximation. This is because the multi_ex abstraction and output example didn't really solve how to count features. The feature count reporting should happen at the base learner.

For that example: The slot is merged into shared, which is in turn merged into each action when is then sent down the stack. There is additionally the constant feature and the slot feature which are hidden as well, but do contribute to the count.

adding --noconstant to this cmd halves the total features (this case 4) of that line so its probably just counting d, e, b, h

vowpalwabbit/audit_regressor.cc

vowpalwabbit/example.h

vowpalwabbit/csoaa.cc

olgavrou · 2021-05-11T15:26:56Z

vowpalwabbit/example.h

+    total_sum_feat_sq_calculated = false;
+  }
+
+  friend void VW::copy_example_data(example* dst, const example* src);


instead of making these friends could we add copy methods/setters for the private members?

I toyed with copy methods for example a while ago but there are multiple different versions, with and without labels, metadata only etc. So I opted to just let the existing one have access rather than try and encapsulate it

yeah I agree on the copy_example makes sense for it to be a friend, and if we keep using the permutation we can think about making it public in future

olgavrou · 2021-05-11T15:30:32Z

vowpalwabbit/ftrl.cc

  }
+  ec.num_features_from_interactions = num_features_from_interactions;


should this assignment of num_feature_from_interactions be happening at the spot where they are calculated (since the example is also passed down)? I am worried that we are going to forget that this is needed at some future point when we call foreach or gd::inline_predict

Valid concern, I thought the same. But I think there are situations where foreach_feature is called and you don't want to set the num_feature_from_interactions. Additionally it prevents examples from being const where they otherwise can be

do you think it would be worth to wrap those calls?

(a) where examples can be const there is a non-const overload that calls the const version and then sets the number of features on the example and
(b) if a foreach call doesn't want to set the number of features we could again overload and just not set the example with the result

The feature count is a very non-critical operation and it's only used as a measure of work. If it's missed really all that happens is the progressive validation log will show fewer features. There are already quite a few overloads of foreach_feature and this PR is adding more overloads there. I don't think it would be worth it

vowpalwabbit/interactions_predict.h

peterychang · 2021-05-11T18:23:45Z

vowpalwabbit/example.h

  float partial_prediction = 0.f;  // shared data for prediction.
  float updated_prediction = 0.f;  // estimated post-update prediction.
  float loss = 0.f;
-  float total_sum_feat_sq = 0.f;  // precomputed, cause it's kind of fast & easy.
+
+  float total_sum_feat_sq = 0.f;


Can you add a comment here about how this value is used and what will invalidate it? I see it getting reset a lot in this PR and I don't understand why places that used to update the value now reset it.

Anything that used to change it now just becomes a cache invalidation

Added comment

peterychang · 2021-05-11T18:57:36Z

vowpalwabbit/interactions_predict.h

@@ -149,6 +150,7 @@ inline void generate_interactions(namespace_interactions& interactions, bool per
            auto begin = second.audit_cbegin();
            if (same_namespace) { begin += (PROCESS_SELF_INTERACTIONS(ft_value)) ? i : i + 1; }
            auto end = second.audit_cend();
+            num_features += std::distance(begin, end);


It looks like num_features isn't used by the callers all the time. Is it better to compute the distance every time than to have num_features be an optional input?
I'm assuming the distance calls are relatively expensive since the iterators are more complex than raw pointers

The std::distance call here is constant time (and very cheap) since the iterators are random access iterators. The addition of this extra logic hasn't affected the benchmarks in a meaningful way. Making it optional would require a branch instead of simple arithmetic.

The other option is an overload without this out parameter. But that results in a lot of code duplication with no noticeable difference in perf.

peterychang · 2021-05-11T19:04:02Z

vowpalwabbit/nn.cc

@@ -225,7 +223,7 @@ void predict_or_learn_multi(nn& n, single_learner& base, example& ec)

  CONVERSE:  // That's right, I'm using goto.  So sue me.

-    n.output_layer.total_sum_feat_sq = 1;
+    n.output_layer.reset_total_sum_feat_sq();


The original code sets the sum_feat_sq to 1 while the function sets it to 0. Is that ok?

Yes, since it will get calculated from scratch when needed and that 0 wont make it out

olgavrou

LGTM: my understanding is that the calculation of interaction features is still evolving (especially for ccb) and that it isn't always exact right now and that is ok since we are moving towards correct calculations (especially compared to before)

jackgerrits and others added 14 commits May 5, 2021 12:06

WIP

0690bf2

fix bug

57c8d96

Merge branch 'master' into jagerrit/stats_take2

2272427

Merge remote-tracking branch 'upstream/master' into jagerrit/stats_take2

fc71330

encapsulate

93c0b41

Merge branch 'master' into jagerrit/stats_take2

70c5768

rename and fix

289cf60

WIP

534ab32

fixes

2ce1d09

Update gd_mf

a940dc5

GD mf changes

0e6f069

ftrl and multi

da4a22d

Update refs, remove duplicate eval_count_of_generated_ft impls, fix i…

29ca5de

…nconsistencies

Fix formatting issues

04bbb00

jackgerrits marked this pull request as draft May 7, 2021 16:33

jackgerrits and others added 4 commits May 7, 2021 12:33

Merge branch 'master' into jagerrit/stats_take2

7a19e19

Fix merge conflict

5d3cc40

removed unused variable

2afcf67

Update ref file

a06f204

jackgerrits marked this pull request as ready for review May 7, 2021 17:31

olgavrou reviewed May 7, 2021

View reviewed changes

python/pylibvw.cc Show resolved Hide resolved

olgavrou reviewed May 7, 2021

View reviewed changes

python/pylibvw.cc Show resolved Hide resolved

lalo reviewed May 10, 2021

View reviewed changes

jackgerrits and others added 2 commits May 11, 2021 09:14

Update gd_mf.cc

6293309

Fix formatting issues

59d76bf

olgavrou reviewed May 11, 2021

View reviewed changes

vowpalwabbit/audit_regressor.cc Show resolved Hide resolved

olgavrou reviewed May 11, 2021

View reviewed changes

update name

2c4a04b

peterychang reviewed May 11, 2021

View reviewed changes

jackgerrits and others added 4 commits May 11, 2021 15:51

Add comment

87400db

Merge branch 'master' into jagerrit/stats_take2

3bb0d7d

update test

772f79a

fix test

ebac82d

olgavrou approved these changes May 12, 2021

View reviewed changes

lalo approved these changes May 12, 2021

View reviewed changes

jackgerrits merged commit b250d85 into VowpalWabbit:master May 12, 2021

jackgerrits deleted the jagerrit/stats_take2 branch May 12, 2021 15:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: Integrate count and sum feat sq into the interaction generation routine #2987

refactor: Integrate count and sum feat sq into the interaction generation routine #2987

jackgerrits commented May 7, 2021 •

edited

lalo May 10, 2021

jackgerrits May 11, 2021

lalo May 10, 2021

jackgerrits May 11, 2021

lalo May 11, 2021

olgavrou May 11, 2021

jackgerrits May 11, 2021

olgavrou May 11, 2021

olgavrou May 11, 2021

jackgerrits May 11, 2021

olgavrou May 11, 2021

jackgerrits May 11, 2021

peterychang May 11, 2021

jackgerrits May 11, 2021

jackgerrits May 11, 2021

peterychang May 11, 2021

jackgerrits May 11, 2021 •

edited

peterychang May 11, 2021

jackgerrits May 11, 2021

olgavrou left a comment

		}
		ec.num_features_from_interactions = num_features_from_interactions;

refactor: Integrate count and sum feat sq into the interaction generation routine #2987

refactor: Integrate count and sum feat sq into the interaction generation routine #2987

Conversation

jackgerrits commented May 7, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jackgerrits May 11, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

olgavrou left a comment

Choose a reason for hiding this comment

jackgerrits commented May 7, 2021 •

edited

jackgerrits May 11, 2021 •

edited