💃 🥤 Extract interaction function from models #107

mberr · 2020-10-15T06:51:05Z

This is a replacement for #88 , where the merge target is master.

@mali-git As discussed in today's call, I tried to draft an API for the interaction function. It is built around the most generic form of interaction function, which has one batch dimension, and then allows broadcasting over multiple entities/relations to meet the use cases for e.g. scoring all tail entities at once, but also supports, e.g. full CWA scores.

This can likely also help for the fast LCWA @lvermue once envisioned 😉

I did not define the methods to be static to allow for parametric interaction functions such as e.g. ER-MLP having some weights.

Overview

One shared implementation for score_hrt / score_h / score_r / score_t in the base class, done in InteractionFunction.
One shared implementation of _score for all models sharing the same set of embeddings (e.g. TransE/DistMult/ERMLP -> one vector for each entity/relation, TransH -> additional vector for each entity, etc.)
A state-less functional form of the interaction function where all necessary states are passed from the outside. This is done in pykeen.nn.modules.
A state-full implementation of interaction function encapsulating all shared parameters (e.g. weight matrices for ERMLP / ConvE, etc.), but delegating the actual interaction to the state-less version. This is done in pykeen.nn.functional.

Tasks:

What to do with interaction models where we have more than one vector for an entity/relation, such as e.g. TransH?
Re-introduce slicing (In best case, on the generic level)
Re-introduce regularization on generic level
Fix R-GCN (or keep it broken for 📈 ☠️ Improve R-GCN #110 ?)
Add regularizer/constrainer directly to modules, recursively search for modules having regularize and accumulate value.
Update to reshape in the generic Interaction
~~[ ] Update pipeline model composition~~ bumped to 🪢 🤗 Expose interactions and representations via pipeline #163

Dependencies:

Improve checking for abstract classes and applying post init hooks #137

mberr · 2020-10-15T06:55:27Z

TODO: What to do with interaction models where we have more than one vector for an entity/relation, such as e.g. TransH?

mberr · 2020-10-15T07:23:58Z

@cthoyt I used the interaction function abstraction to extract a common base class for DistMult, Complex and ER-MLP, since they all share that

they have one vector representation for each entity and relation
their interaction function is f(h, r, t) for appropriate f.

The class name is preliminary ~~, and, for now, it resides in the Complex model's file.~~ now it is in base.py.

src/pykeen/models/base.py

lvermue · 2020-10-31T15:28:08Z

@mberr
What is the considered distinction of the interaction function and model?
Is that just because of R-GCN? Because right now it seems to me that the explicit model interaction function doesn't offer any generalization and is always "only" handled to the respective model class.

mberr · 2020-10-31T16:48:05Z

@mberr
What is the considered distinction of the interaction function and model?
Is that just because of R-GCN? Because right now it seems to me that the explicit model interaction function doesn't offer any generalization and is always "only" handled to the respective model class.

To quote from this paper 😉

Here, we define a KGEM as four components: an interactionmodel, a training approach, a loss function, and its usage of explicit inverse relations.

The idea is to separate storing the entity/relation representations from the interaction. This support use cases, where we want to use an interaction model as a component, rather than a stand-alone model, e.g. since we have features for entities, or another component which generates representations (e.g. based on GNNs, but we could also think about e.g. NLP models on the entities' labels, etc.).

In the long term this might also help to generalize e.g. the regularization choices (right now, each Model defines which parameters are regularized, and which not), or constraints, such as unit length, orthogonality, etc.

It does however also help for R-GCN 🙂

src/pykeen/models/base.py

src/pykeen/models/unimodal/complex.py

nice fixes need #107 #110

* Add base module for representations * Add reset_parameters * Add wrapper for nn.Embedding * Implement forward * Add properties * Use Embedding for EntityRelationEmbedding * Replace old factory method * Reorganize * Revert for now * Make initialization functions work * Update utils.py * Reorganize again * Fix docs * Update base.py * Fix initialization * Update RepresentationModel.forward API * Add deprecation warning * Add post_parameter_update * Add constrainer * Forward kwargs from EntityRelationEmbeddingModel * Use Embedding.init_with_device for EntityEmbeddingModel * Use new-style Embeddings for DistMult * Fix delegation of post_parameter_update * Update code style * Add unittest to check whether the model can handle custom representations * Pass flake8 * New implemntations of init functions * Fix attribute names * Add first round of upgrades for _reset_parameters_ * Switch more over * Update utils.py * Add TODOs [skip ci] * Make more general * Enable more kwargs * Fix up KG2E * Fix unittest * Simplify get_embedding_in_canonical_shape * Directly use Embedding.init_with_device * Replace deprecated usage for ConvE * Use new-style embeddings for TransD * Add in-place norm clamping * Fix unittest for TransD; remove manual score comparison * Fix docs * Forward kwargs * More generic types * More generic types, now as TypeVar * rename parameters * fix renamed parameters * Add comments * Remove deprecated .weight property * Fix ConvE's weight usage * Fix .weight usage in test_save_load_model_state * reduce code duplication in test * do not require in-place initializer * Remove manual unittest for TransR * Fix .weight usage in test_models.py * Remove post_parameter_update tests after embeddings have been exchanged * Add TODO to complex * use indices * Remove no_grad annotation, since it was causing problems with general gradient tracking * cleanup conve * cleanup convkb * cleanup distmult * Use keyword parameter, fix .weight * Use keyword parameter, fix .weight * Add constrainer, fix .weight * fix .weight * fix .weight * fix .weight, use initializer and constrainer * Directly use Embedding.init_with_device; fix type annotation * use new-style embeddings * use *_kwargs instead of functools.partial * Use new-style embeddings in TransE * Use new-style embeddings in TransR * Use new-style embeddings in Tucker * Use new-style embeddings in UM * Add chain_ and normalize_ utilities * Fix chain_ * Fix complex special case for hrt * fix custom representation unittest * Add missing calls to reset_parameters * Fix tests for get_embedding_in_canonical_shape * Use magicmock to check whether reset_parameters was called * hotfix RGCN nice fixes need #107 #110 * Fix block decomposition * Fix TransR * Add trailing comma * Fix line too long * Add trailing comma * Fix line-too-long * Fix wrong comparison * Remove trailing comma * fix line too long * fix line too long * fix line too long * Cosmetic improvements and remove code graveyard * Update docs [skip ci] * Extend documentation * Remove in-place variants * remove xavier_uniform_normed_ * Remove todos * fix line-too-long * Fix docstring * Remove unused imports * Revert removal of manual tests for score_hrt * Avoid overriding test_score_hrt with manual tests * fix manual test_hrt for TransR * fix manual test_hrt for TransD * Fix issue in TransR test Co-authored-by: Charles Tapley Hoyt <cthoyt@gmail.com>

src/pykeen/models/base.py

src/pykeen/models/unimodal/ermlp.py

mberr · 2020-11-12T10:40:57Z

@cthoyt I extracted a few functional form interaction functions, cf. https://github.com/pykeen/pykeen/compare/a735593..f0c20d6

I wait for feedback before I continue with other interaction functions.

It is nice, since we e.g. could extract some common parts between distmult and complex in _normalize_terms_for_einsum.

src/pykeen/models/unimodal/trans_e.py

src/pykeen/models/base.py

cthoyt · 2020-12-17T13:14:17Z

tests/test_interactions.py

+
+    cls = pykeen.nn.modules.SimplEInteraction
+
+    def _exp_score(self, h, r, t, h_inv, r_inv, t_inv, clamp) -> torch.FloatTensor:


Is this right? I thought SimplE's interaction function was

0.5 * (distmult_interaction(h, r, t) + dismult_interaction(t_inv, r_inf, h_inv)

The h and t get switched for the inverse one

mberr · 2021-03-22T15:44:57Z

@cthoyt did we finish the migration? If yes, we may close this one.

cthoyt · 2021-03-22T16:15:10Z

@mberr well, we have the major code for RGCN (might be fine now), literal models (already in the new PR), and all of the examples of the new-style models for all of the pre-existing uni-modal models

cthoyt · 2021-03-22T16:15:21Z

oops my bad, hand slipped

cthoyt · 2021-11-05T22:31:53Z

@mberr so all of the big parts have been migrated. If we ever want to start converting old-style models, we can refer to this, but it's finally time to let this one go.

mberr mentioned this pull request Oct 15, 2020

Add draft of generic interaction function #88

Closed

cthoyt reviewed Oct 17, 2020

View reviewed changes

src/pykeen/models/base.py Outdated Show resolved Hide resolved

mberr mentioned this pull request Nov 5, 2020

Add tutorial for custom models #127

Merged

cthoyt reviewed Nov 5, 2020

View reviewed changes

src/pykeen/models/base.py Outdated Show resolved Hide resolved

cthoyt reviewed Nov 5, 2020

View reviewed changes

src/pykeen/models/unimodal/complex.py Outdated Show resolved Hide resolved

mberr mentioned this pull request Nov 6, 2020

Modularize embeddings #132

Merged

3 tasks

mberr added a commit that referenced this pull request Nov 6, 2020

hotfix RGCN

9a59704

nice fixes need #107 #110

mberr mentioned this pull request Nov 7, 2020

📈 ☠️ Improve R-GCN #110

Merged

cthoyt reviewed Nov 7, 2020

View reviewed changes

src/pykeen/models/base.py Outdated Show resolved Hide resolved

cthoyt mentioned this pull request Nov 7, 2020

🏁 📜 [Blocked] Use native complex tensors #134

Closed

cthoyt reviewed Nov 7, 2020

View reviewed changes

src/pykeen/models/base.py Outdated Show resolved Hide resolved

cthoyt reviewed Nov 7, 2020

View reviewed changes

src/pykeen/models/base.py Outdated Show resolved Hide resolved

cthoyt reviewed Nov 7, 2020

View reviewed changes

src/pykeen/models/base.py Outdated Show resolved Hide resolved

cthoyt mentioned this pull request Nov 8, 2020

Improve checking for abstract classes and applying post init hooks #137

Merged

cthoyt added the enhancement New feature or request label Nov 8, 2020

cthoyt mentioned this pull request Nov 9, 2020

Update type hints and random code cleanup #140

Merged

cthoyt reviewed Nov 9, 2020

View reviewed changes

src/pykeen/models/base.py Outdated Show resolved Hide resolved

cthoyt reviewed Nov 9, 2020

View reviewed changes

src/pykeen/models/unimodal/ermlp.py Outdated Show resolved Hide resolved

cthoyt mentioned this pull request Nov 9, 2020

Improve usage of interaction functions for user-specific embeddings #65

Closed

cthoyt mentioned this pull request Nov 13, 2020

Add score_t/h function for ComplEx #150

Merged

cthoyt reviewed Nov 13, 2020

View reviewed changes

src/pykeen/models/unimodal/trans_e.py Show resolved Hide resolved

cthoyt reviewed Nov 13, 2020

View reviewed changes

src/pykeen/models/base.py Outdated Show resolved Hide resolved

cthoyt reviewed Nov 13, 2020

View reviewed changes

src/pykeen/models/base.py Outdated Show resolved Hide resolved

cthoyt reviewed Nov 13, 2020

View reviewed changes

src/pykeen/models/base.py Outdated Show resolved Hide resolved

mberr added 2 commits December 17, 2020 08:33

Move variants to module

7d137c7

extract common code

c9b240e

cthoyt reviewed Dec 17, 2020

View reviewed changes

cthoyt added 8 commits December 17, 2020 14:18

Code cleanup

3bf1ab9

Update test_models.py

e3ca9bf

More cleanup

675eeac

Update losses.py

ff9a9db

Update typing.py

ebdddbd

Update losses.py

bf63db3

Update tests.yml

c988b49

Update test_utils.py

79b2187

This was referenced Jan 21, 2021

💃 🦜 Extract functional forms #238

Merged

Extract functional forms of KGEMs #239

Closed

Extract functional modules #240

Closed

cthoyt changed the title ~~Extract interaction function from models~~ 💃 🥤 Extract interaction function from models Jan 21, 2021

This was referenced Jan 22, 2021

🧨 💎 Add literal interaction and outline for tests #245

Merged

💃 💄 Excise non-core functionality from the base Model class #248

Merged

💃 🆕-style models #260

Merged

This was referenced Feb 9, 2021

🟣 🧬 Add embedding specification class #277

Merged

👽 🇪🇺 The shape of embeddings to come #287

Merged

🌻 ❄️ Add dtype annotation to pykeen.nn.Embedding #292

Merged

cthoyt closed this Mar 22, 2021

cthoyt reopened this Mar 22, 2021

cthoyt mentioned this pull request May 6, 2021

🔧🕗 Update model initialization #411

Merged

cthoyt closed this Nov 5, 2021

cthoyt deleted the add_interaction_function_2 branch November 5, 2021 22:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

💃 🥤 Extract interaction function from models #107

💃 🥤 Extract interaction function from models #107

mberr commented Oct 15, 2020 •

edited by cthoyt

mberr commented Oct 15, 2020

mberr commented Oct 15, 2020 •

edited

lvermue commented Oct 31, 2020

mberr commented Oct 31, 2020

mberr commented Nov 12, 2020 •

edited

cthoyt Dec 17, 2020

mberr commented Mar 22, 2021

cthoyt commented Mar 22, 2021

cthoyt commented Mar 22, 2021

cthoyt commented Nov 5, 2021


		cls = pykeen.nn.modules.SimplEInteraction

		def _exp_score(self, h, r, t, h_inv, r_inv, t_inv, clamp) -> torch.FloatTensor:

💃 🥤 Extract interaction function from models #107

💃 🥤 Extract interaction function from models #107

Conversation

mberr commented Oct 15, 2020 • edited by cthoyt

Overview

Tasks:

Dependencies:

mberr commented Oct 15, 2020

mberr commented Oct 15, 2020 • edited

lvermue commented Oct 31, 2020

mberr commented Oct 31, 2020

mberr commented Nov 12, 2020 • edited

cthoyt Dec 17, 2020

Choose a reason for hiding this comment

mberr commented Mar 22, 2021

cthoyt commented Mar 22, 2021

cthoyt commented Mar 22, 2021

cthoyt commented Nov 5, 2021

mberr commented Oct 15, 2020 •

edited by cthoyt

mberr commented Oct 15, 2020 •

edited

mberr commented Nov 12, 2020 •

edited