Strategy transformers #380

marcharper · 2015-10-23T05:55:04Z

This PR introduces strategy transformers, which are ways of transforming the behavior of a strategy without rewriting the strategy's class. There is a generic transformer factory that will take a user-defined function and wrap the Player's strategy method.

For example, using FlipTransformer, we can turn Cooperator into Defector:

new_class = FlipTransformer(axelrod.Cooperator)
player = new_class() # It's the same as Defector now

Similarly we could turn AntiCycler into DefectingAntiCycler, playing

D DC DDC DDDC DDDDC ...

Instead of

C CD CCD CCCD CCCCD ...

Another transform adds TFT style retaliation to any other strategy:

RUA = RetailiateUntilApologyTransformer()
TFT = RUA(axelrod.Cooperator)
player = TFT()

This actually does transform Cooperator exactly into TitForTat (it passes all the tests for TFT, which specify TFT completely). For other strategies, it only affects what follows an opponent's defection, otherwise the player's desired plays are simply passed through.

There are additional transformers for the following modifications:

Adding noise (similar to the noisy tournament but only to one of the players)
Adding probabilistic defection forgiveness
Defecting on the last N moves (only if the proper tournament attribute is available), or more generally finishing with any given sequence of moves
Starting with any specific sequences of moves then playing as the strategy intended
Internally tracking intended history (e.g. to infer ambient noise with)

If you've been wondering if MetaHunter would be better with a little TFT style retaliation, there's no need to write a new strategy! Moreover the transformations can be chained or composed, so you can add as many transformations to any strategy. The transformations are preserved when the player is cloned.

These transforms can also be used as class decorators. As an example in the library, I modified BackStabber's implementation:

@FinalTransformer([D, D, D]) # End with three defections
class BackStabber(Player):
...
    def strategy(self, opponent):
        if not opponent.history:
            return C
        if opponent.defections > 3:
            return D
        return C

while removing these two lines from the strategy method:

-        if len(opponent.history) > (self.tournament_attributes['length'] - 3):
-            return D

Now it works even if the tournament length is not known, and defects as intended if the length is available.

Some ideas for additional transforms:

A grudger style never forgive
Automatic noise detection
Automatic memory_depth inference
Adding hunting behavior to other strategies
Add other types of retaliation (e.g. TitForTwoTats)

I only ran into one significant issue while testing -- the fact that the classifier dict is sometimes a class variable made it difficult to test, since it would overwrite the original class variable (in the super class). One workaround would be to set the classifiers in a method rather than have them as class variables.

Once #372 is merged I'll add a page of usage documentation.

drvinceknight · 2015-10-23T08:08:08Z

This looks interesting and obviously fits the goal of the library of facilitating the study of IPDs.

I need to get my head around it a bit more I think. I haven't looked at the code yet, so just thinking about the problem this solves: can you help me out a bit more than you already have. Are you thinking the (main) use case is for people using the library and wanting to create on the fly strategies? Or are these (mainly) to help create strategies?

Very much in favour, for example the @FinalTransformer([D, D, D]) is a lovely decorator and it helps 'idiot proof' as you say:

Now it works even if the tournament length is not known, and defects as intended if the length is available.

Just my usual initial slowness at getting my head around it 👍 😄

Once #372 is merged I'll add a page of usage documentation.

This is a perfect example of a 'further_topics' tutorial I think?
This is where the new modular tutorial docs I think come in to their own, instead of a big terrible monster being creating (as the previous docs were) we can make modular adjustments...

Also: anything we can do about 0.1% drop in coverage? (Not a disaster obviously)

marcharper · 2015-10-23T14:32:38Z

I'm sure that I can get the coverage up.

This should prevent code duplication and expand the possible strategies the library can produce substantially. There are already many strategies that are e.g. TFT but defect on round one instead.

Plus you can, for example, run a tournament of many strategies and apply a transform to all of them. What happens if everyone defects on the first round? If everyone retaliates like TFT?

drvinceknight · 2015-10-23T16:27:01Z

This should prevent code duplication and expand the possible strategies the library can produce substantially. There are already many strategies that are e.g. TFT but defect on round one instead.

Plus you can, for example, run a tournament of many strategies and apply a transform to all of them. What happens if everyone defects on the first round? If everyone retaliates like TFT?

Have had more time to think about it: big big fan.

Will look through the code itself as soon as possible.

marcharper · 2015-10-23T16:42:23Z

Have had more time to think about it: big big fan.

Glad to hear about it, as I'm fairly certain these transformers are the key to studying the IPD with category theory. I'll write something up eventually...

drvinceknight · 2015-10-23T16:43:33Z

Oh wow: that sounds awesome! Category theory is something I know very
little about.

On Fri, Oct 23, 2015 at 5:42 PM Marc Harper, PhD notifications@github.com
wrote:

Have had more time to think about it: big big fan.

Glad to hear about it, as I'm fairly certain these transformers are the
key to studying the IPD with category theory. I'll write something up
eventually...

—
Reply to this email directly or view it on GitHub
#380 (comment)
.

meatballs · 2015-10-23T17:35:47Z

Category theory is something I know very
little about.

I can only aspire to attain such a level of knowledge

drvinceknight · 2015-10-23T17:44:25Z

Be careful what you wish for :)

On Fri, 23 Oct 2015, 18:35 Owen Campbell notifications@github.com wrote:

Category theory is something I know very
little about.

I can only aspire to attain such a level of knowledge

—
Reply to this email directly or view it on GitHub
#380 (comment)
.

marcharper · 2015-10-23T17:52:07Z

I can only aspire to attain such a level of knowledge

It's nbd really, you just have to waste years of your life in mathematics graduate school, postponing other life goals and the development of a viable career in the meantime. Or learn a bit of Haskell I suppose.

drvinceknight · 2015-10-23T17:55:06Z

Lol. Slightly related. I've seen a quote somewhere, something like:

'graduate school is reducing current income in order to reduce future
income'

On Fri, 23 Oct 2015, 18:52 Marc Harper, PhD notifications@github.com
wrote:

I can only aspire to attain such a level of knowledge

It's nbd really, you just have to waste years of your life in mathematics
graduate school, postponing other life goals and the development of a
viable career in the meantime. Or learn a bit of Haskell I suppose.

—
Reply to this email directly or view it on GitHub
#380 (comment)
.

meatballs · 2015-10-24T15:04:37Z

axelrod/strategies/backstabber.py

Shouldn't this be two defections? The orginal condition was for opponent history length to be greater than 197, so 198 is the first occurrence. If the the history is 198 long, then we are on turn 199.

Thanks -- I fixed it, and updated the tests.

meatballs · 2015-10-24T15:05:52Z

This is an excellent piece of work and opens up all sorts of possibilities. Other than my comment on backstabber, I'm more than happy to see this go in.

drvinceknight · 2015-10-24T21:30:49Z

axelrod/strategies/strategy_transformers.py

This comment isn't quite right? It's do seq on last len(seq) actions right? The default is default...

Good catch.

Thought as much :)

On Sat, 24 Oct 2015, 22:59 Marc Harper, PhD notifications@github.com
wrote:

In axelrod/strategies/strategy_transformers.py
#380 (comment):

return transformer

+def final_sequence(player, opponent, action, seq):

"""Play the moves in seq first, ignoring the strategy's

moves until the list is exhausted."""

length = player.tournament_attributes["length"]

if length < 0: # default is -1

return action

index = length - len(player.history)

if index <= len(seq):

return seq[-index]

return action

+# Defect on last N actions

Good catch -- that's what it was initially but then I generalized to an
arbitrary sequence.

—
Reply to this email directly or view it on GitHub
https://github.com/Axelrod-Python/Axelrod/pull/380/files#r42938684.

drvinceknight · 2015-10-24T21:39:50Z

Just some minor things I've commented on. I think the main thing this is missing is an advanced tutorial or further topics. I'd also suggest that a pointer gets put in the contribution docs to that tutorial but as you say I guess that is easiest if you wait for #372?

I completely agree with @meatballs: this is great and leaves open the possibility of further transformations to be added :) (that could be explained in the tutorial perhaps?)

marcharper · 2015-10-24T22:14:30Z

I will certainly improve the documentation -- both the docstrings and an advanced tutorial, both of which are needed since we're metaprogramming now. But I do want #372 to hit first so I don't have merge conflicts on the docs (and I wanted to make sure that you all liked the idea).

drvinceknight · 2015-10-24T22:17:27Z

Love the idea.

On Sat, 24 Oct 2015, 23:14 Marc Harper, PhD notifications@github.com
wrote:

I will certainly improve the documentation -- both the docstrings and an
advanced tutorial, both of which are needed since we're metaprogramming
now. But I do want #372
#372 to hit first so I
don't have merge conflicts on the docs (and I wanted to make sure that you
all liked the idea).

—
Reply to this email directly or view it on GitHub
#380 (comment)
.

…, plus a test to catch this

drvinceknight · 2015-10-26T08:35:20Z

docs/tutorials/advanced/strategy_transformers.rst

blank line needed here (before the code)

drvinceknight · 2015-10-26T08:37:22Z

Yeah this is brilliant. Just some blank lines to tidy things a bit.

I really like the tutorial.

Brilliant contribution, very exciting.

drvinceknight · 2015-10-26T18:32:19Z

👍

Strategy transformers

langner · 2015-10-27T05:22:21Z

This is very nice!

…ormers Strategy transformers

drvinceknight added the ready label Oct 23, 2015

meatballs reviewed Oct 24, 2015
View reviewed changes

drvinceknight reviewed Oct 24, 2015
View reviewed changes

marcharper changed the title ~~Strategy transformers~~ Strategy transformers (Pending #372 and a documentation commit) Oct 24, 2015

marcharper added 9 commits October 25, 2015 10:25

Strategy Transformers

7882c77

More tests

7d822fd

Noisy Transformer

753f623

History Tracking Wrapper

e87c7a0

Modify BackStabber to use FinalTransformer Decorator

23ce4f9

Update tests

7c6bad8

Add in test for Retaliation transformer

b2e988b

Improve coverage

486424f

Backstabber should defect twice at the end if the tournament is known…

6bc7004

…, plus a test to catch this

Improved docstrings for transformers and tests

3e3cb26

marcharper force-pushed the strategy_transformers branch from fb3cf46 to ece74d6 Compare October 25, 2015 21:23

marcharper changed the title ~~Strategy transformers (Pending #372 and a documentation commit)~~ Strategy transformers Oct 25, 2015

Strategy transformers advanced tutorial

a2221fc

marcharper force-pushed the strategy_transformers branch from 8953bca to a2221fc Compare October 25, 2015 21:46

Remove comment to prevent merge conflict with Axelrod-Python#378

9e92a19

drvinceknight reviewed Oct 26, 2015
View reviewed changes

docs/tutorials/advanced/strategy_transformers.rst Outdated

Copy link

Member

drvinceknight Oct 26, 2015

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

blank line needed here (before the code)

Blank lines for tutorial

62f684d

meatballs added a commit that referenced this pull request Oct 26, 2015

Merge pull request #380 from marcharper/strategy_transformers

a37d905

Strategy transformers

meatballs merged commit a37d905 into Axelrod-Python:master Oct 26, 2015

meatballs removed the ready label Oct 26, 2015

marcharper pushed a commit to marcharper/Axelrod that referenced this pull request Nov 2, 2015

Merge pull request Axelrod-Python#380 from marcharper/strategy_transf…

cd14776

…ormers Strategy transformers

Strategy transformers #380

Strategy transformers #380

Uh oh!

Conversation

marcharper commented Oct 23, 2015

Uh oh!

drvinceknight commented Oct 23, 2015

Uh oh!

marcharper commented Oct 23, 2015

Uh oh!

drvinceknight commented Oct 23, 2015

Uh oh!

marcharper commented Oct 23, 2015

Uh oh!

drvinceknight commented Oct 23, 2015

Uh oh!

meatballs commented Oct 23, 2015

Uh oh!

drvinceknight commented Oct 23, 2015

Uh oh!

marcharper commented Oct 23, 2015

Uh oh!

drvinceknight commented Oct 23, 2015

Uh oh!

meatballs Oct 24, 2015

Choose a reason for hiding this comment

Uh oh!

marcharper Oct 24, 2015

Choose a reason for hiding this comment

Uh oh!

meatballs commented Oct 24, 2015

Uh oh!

drvinceknight Oct 24, 2015

Choose a reason for hiding this comment

Uh oh!

marcharper Oct 24, 2015

Choose a reason for hiding this comment

Uh oh!

drvinceknight Oct 24, 2015

Choose a reason for hiding this comment

Uh oh!

drvinceknight commented Oct 24, 2015

Uh oh!

marcharper commented Oct 24, 2015

Uh oh!

drvinceknight commented Oct 24, 2015

Uh oh!

drvinceknight Oct 26, 2015

Choose a reason for hiding this comment

Uh oh!

drvinceknight commented Oct 26, 2015

Uh oh!

drvinceknight commented Oct 26, 2015

Uh oh!

langner commented Oct 27, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants