Desired New strategies #379

marcharper · 2015-10-22T15:09:49Z

Some of these may be implemented under other names already, please ask if you are unsure! Feel free to add any new ones to the list. Note that we are happy to have original contributions as well!

Binary decision strategies defined in "Varying Decision Inputs in Prisoner’s Dilemma", Barlow and Ashlock 2015
Function stack based strategies from "Ashlock, Daniel. "Training function stacks to play the iterated prisoner's dilemma." Computational Intelligence and Games, 2006 IEEE Symposium on. IEEE, 2006."
Pavlovian, Identifier strategies, Grudgian from n-Move Memory Evolutionarily Stable Strategies
for the Iterated Prisoner’s Dilemma

The "invincible strategies" in this paper which can all be implemented as special cases of the MemoryOne or LRPlayer classes.

The two "most abundant" memory one and memory two strategies in this paper.

Adaptor from Simple Adaptive Strategy Wins the Prisoner’s Dilemma second_pdf

Specific strategies evolved in Evolutionary game theory using agent-based methods such as GCA.

Strategy MO and Strategy SO from this paper

Strategies implemented in PRISON (look in classics.str):

soft_spiteful
~~slow_tft~~
~~better_and_better~~
~~worse_and_worse2~~, ~~worse_and_worse3~~

and see this paper

spiteful_cc
~~winner12~~ ~~winner 21~~
~~mem2~~
~~gradual_killer [Already done on another name?]~~
soft_tf2t [TF2T?]
and many others such as the 12 ZD strategies
Done: ~~c_then_per_dc~~, ~~doubler~~, ~~easy_go~~, ~~gradual~~, ~~per_ddc~~, ~~per_cccdcd~~, ~~prober4~~, ~~tft_spiteful~~, ~~worse_and_worse~~

From CoopSim:

~~ContriteTFT~~
TwoTitsForTwoTats -- and the generalization to NTitsForMTats
Others that you find interesting

Many strategies in this paper are not yet in the library:

From "Exploiting Evolutionary Modeling to Prevail in Iterated Prisoner’s Dilemma Tournaments":

Laran
Turan
Tages

From this page (see also the bibliography) for the 20th anniversary tournament:

~~Soft Grudger~~
~~Adaptive Tit For Tat~~
PavlovD: http://www.cs.nott.ac.uk/~pszjl/index_files/chapter4.pdf
StarSN, StarS, StarN, mem1, PoorD, ltft, MooD, and others see here and here
AITFT, GSTFT, Adept, emperor, PRobbary, HCO, etc. [See here, will require some sleuthing]http://www.cs.nott.ac.uk/~pszjl/index_files/IPDbook_chap01.pdf)
~~Adaptive~~
~~APavlov~~ : http://www.graham-kendall.com/papers/lhk2011.pdf
~~NEG: "NEG plays according to simple rules: if opponent plays COOPERATION, in next move NEG will play DEFECTION; if opponent plays DEFECTION NEG will play COOPERATION. First move will be random."~~
~~Omega Tit For Tat~~ (see also here)

From here:

Free Rider
Rover

From this paper and also here:

~~adaptive tft~~
~~contrite tft~~
~~handshake~~ ~~fortress3~~ ~~fortress4~~ ~~firm but fair~~ ~~gradual~~ ~~naive prober~~ ~~remorseful prober~~ ~~reverse pavlov~~ ~~soft grudger~~

Any of the interesting finite state machine strategies in the papers with fortress (and other papers authored by Wendy Ashlock and Daniel Ashlock, and collaborators)

E.g. from the 2015 paper "Multiple Opponent Optimization of
Prisoner’s Dilemma Playing Agents" including the unnamed sugar strategies and treasure hunt strategies in figures 2 and 3
~~Solution B1~~ and ~~Solution B5~~
Also from "Fingerprint Analysis of the Noisy Prisoner's Dilemma Using a Finite-State Representation"
vengeful, PSY, PSY-TFT, TFT-PSY, UD, UC

Many from this paper. Note the several are already in the library, including ~~ALLC, ALLD, TFT, WSLS, willing, hopeless, and desperate~~ (and possibly others).

From these two papers:

From this page:

forgiving
nasty TFT (randomly plays DD)

From the mythical tournament preliminary to Axelrod #1:

Analogy
Look Up / Look Ahead (different from LookerUp in the library)

From this publication:

~~Gradual~~
~~Adaptive tit-for-tat~~

From this paper:

Lenient Grim 3
Exp. TFT
False Cooperator
TF3T
Exp Grim 2
Lenient Grim 2
Exp TF3T
T2

From this paper:

~~shortmem~~
~~selfsteem~~
Boxer
~~VeryBad~~
ANN Agents
GADP1
GADP2
BM
MC
~~Stalker~~

From this library (if the license is compatible):

cautious
copycat
craby
forgetful
golden
Hardy
Mean
Mensa
Moron
Observant
Unforgiving
Waffely
killer

Others:

Opponent Modeller see also
~~DBS, DesiredBeliefStrategy ref~~
From the 20th anniversary tournament book | slides with some info Book
MaRS: Mimicry and Relative Similarity

No-tricks
Strategies described here

Theory of mind strategies discussed here.

Would be neat to have strategies based on:

cellular automata / ~~finite state machines~~ e.g.
bandit algorithms
the memory-based strategies described here
Markov chain Monte Carlo
Neural networks See this paper for examples
"Particle Swarm Optimization Approaches to Coevolve Strategies for the Iterated Prisoner’s Dilemma"
Tree based strategies from "Crossover and Evolutionary Stability in the Prisoner’s Dilemma"

Translate Fortran strategies available in https://github.com/Axelrod-Python/axelrod-fortan to python.

The text was updated successfully, but these errors were encountered:

drvinceknight · 2015-10-22T20:28:19Z

This is a brilliant issue: 👍

drvinceknight · 2016-02-29T09:18:57Z

This strategy was mentioned on reddit (Random TitForTat and/or Grudger: defects with random probability):

https://www.reddit.com/r/GAMETHEORY/comments/480xb3/how_to_beat_this_strategy_of_prisoners_dilemma/

flammino · 2016-08-23T13:28:42Z

Would you mind if I started working on a few of these? I'm sure I'll be slow since I'm new to programming and I'm going to school while working full time, but it looks like they've been posted for awhile. If there are any strategies on the list that are particularly easy to implement I'd be happy to start with one of those.

drvinceknight · 2016-08-23T13:41:11Z

Absolutely @Adam-Flammino: please do!

I suggest that when you decide on a strategy you run it past us just to make sure it hasn't been implemented yet (it's possible that they're on the list but have already been implemented).

It could also be worth checking here: http://axelrod.readthedocs.io/en/latest/reference/all_strategies.html

That's the list of the strategies that are definitely in the library :)

As far as a suggestion for an initial one to go for, perhaps pick "Nasty Tit For Tat" which is described as "tit-for-tat but attempts to exploit non-retaliatory strategies by playing a DD with some probability". If you think there's another one in the list that you like the look of please do go for that though :)

Also: if you need help you can always pop in to this chat room https://gitter.im/Axelrod-Python/Axelrod there are usually a few of us around that can help :)

flammino · 2016-08-23T13:52:07Z

That actually sounds a little similar to how I play risk. I've definitely got some more reading to do before I start, but I'd be happy to try. Thanks for the chat link too!

marcharper · 2016-08-23T15:29:04Z

@Adam-Flammino We're happy to e.g. help write tests, just open a PR or ask us here (or on gitter).

flammino · 2016-08-24T15:03:31Z

@marcharper thanks! Haven't gotten to start yet, but I appreciate the help! Hopefully before too long I'll be able to actually help out with the project- it looks interesting

flammino · 2016-08-26T21:21:59Z

So I got to actually sit down and look at things today. I appreciate all the variations of tit for tat being in the same .py file- makes it easier for someone who's still learning like me to see the right format. Another quick newb question- should I submit a new .py file with my pull request or just add my code to the end of the existing file?

marcharper · 2016-08-26T21:24:16Z

@Adam-Flammino Either way, really. We don't have a scheme for which strategies go in which files, we just group them as we go wherever they seem to fit.

flammino · 2016-08-26T21:30:04Z

@marcharper Cool. Thanks again.

drvinceknight · 2016-08-26T21:30:12Z

If you're going to go with nasty tit for that as the strategy I'd suggest that you just add that to the bottom of the existing tit for tat file:)

flammino · 2016-09-04T19:10:03Z

I appreciate how welcoming you all have been and I do think this is a very interesting project I'd like to help with eventually, but some life changes just came up and I don't know when I'll have time to actually contribute to this. I plan on coming back (hopefully with enough experience to make real contributions), but I'm not sure when that will be so don't hold any of these for me.

meatballs · 2016-09-05T08:22:39Z

@Adam-Flammino No problem - thanks for the contributions so far!

We'll still be here when you have time again. All the best.

souravsingh · 2016-09-19T12:59:15Z

I would like to help in writing some of the strategies.

meatballs · 2016-09-19T13:01:29Z

@souravsingh Are you here at PyCon UK?

souravsingh · 2016-09-19T13:06:52Z

@meatballs I couldn't come to PyCon UK due to visa issues. But I can try and help remotely.

drvinceknight · 2016-09-19T13:53:09Z

Welcome @souravsingh! Drop in to our gitter channel: gitter.im/Axelrod-Python/Axelrod

Otherwise, take a look at the contribution guidelines and don't hesitate to ask any questions :)

http://axelrod.readthedocs.io/en/latest/tutorials/contributing/strategy/index.html

mturzanska · 2016-10-07T15:58:17Z

Hi. Is soft_tf2t up for grabs?
gradual_killer needs crossing out because it's already implemented.

drvinceknight · 2016-10-07T16:35:06Z

@mturzanska I'm afraid that that's in already to under TitFor2Tats: http://axelrod.readthedocs.io/en/latest/_modules/axelrod/strategies/titfortat.html#TitFor2Tats

(Please correct me if you think I'm not quite right reading the two logics).

mturzanska · 2016-10-08T02:37:54Z

You're right. Then continuing from the top:

c_then_per_dc exists (as GrudgerAlternator)
per_cccdcd needs implementing (as a subclass of Cycler)
prober4 needs implementing
doubler needs implementing

Please let me know if I sorted it out right this time.
Would you mind if I start working on those?

drvinceknight · 2016-10-08T09:27:22Z

Would you mind if I start working on those?

That would be awesome! Let us know if we can assist in any way :) 👍

MariosZoulias · 2017-05-07T21:20:08Z

Thank you for your answer.
But i still have a question .
Lets assume that the game is Stein_and_Rapoport vs Random ...
So the Stein_and_Rapoport player will understand that the random one plays randomly .
So how does this fact (that the opponent playes randomly or not) changes the move of Stein_and_Rapoport.
Do we search for it just theoritically (just to know if the opponent plays random) or practically (e.g. if he plays random --> we always defect, if not --> we play tit for tat) ???

Thank you

marcharper · 2017-05-08T01:17:02Z

Hi @MariosZoulias -- the strategy isn't well-described but I assume that the Chi-squared test is used to determine if the opponent is playing randomly by some level of confidence, and if so, defect against it.

MariosZoulias · 2017-05-08T22:09:15Z

Thanks a lot for the answer.
I also have two more question (actually i need your advice if possible).
Working on the chi-squared test ,

in order to understand if the opponent behaves randomly do i have to take into account ,his next moves after C and D of my player (stein_and_rapoport) and then check the chis-squared ? Or it is even simpler ?
Strategy Random is random (ok). But i think strategy Alternator Cooperator and Defector (eg) are also random because they behave in the same way all the time . Also i believe TitForTat is not random strategy (player behaves differently according to the moves of the other player). A i right on my thinking ??

Thank you

drvinceknight · 2017-05-09T07:22:01Z

in order to understand if the opponent behaves randomly do i have to take into account ,his next moves after C and D of my player (stein_and_rapoport) and then check the chis-squared ? Or it is even simpler ?

I believe it's a straight forward chi squared test based on the two numbers: the number of cooperations and the number of defections. A chi squared test checks those counts and infers (from the total number of counts) whether or not this is a random distribution.

Strategy Random is random (ok). But i think strategy Alternator Cooperator and Defector (eg) are also random because they behave in the same way all the time . Also i believe TitForTat is not random strategy (player behaves differently according to the moves of the other player). A i right on my thinking ??

No Alternator, Cooperater and Defector are not random. If you were playing against Cooperator the count of cooperations after 40 turns would be 40 cooperations and 0 defections. That would be statistically different to 20 cooperations and 20 defections as would be indicated by the chi squared test.

If you were playing the Random player, perhaps after 4 turns you would have 4 cooperations and 0 defetcions and (perhaps) the chi squared test would say that that is statistically different to random behaviour however after 40 rounds maybe the count would be 24 and 16 which the chi squared test would say is random.

Here is the chi-squared test in scipy: https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.chisquare.html

>>> from scipy.stats import chisquare
>>> test = chisquare([4, 0])
>>> test.pvalue
0.04550026389635857
>>> test = chisquare([26, 16])
>>> test.pvalue
0.12282264810139186
>>> test = chisquare([40, 40])
>>> test.pvalue
1.0
>>> test = chisquare([101, 99])
>>> test.pvalue
0.88753708398171505

Here is a plot of test.pvalue for chisquare([100 - n, n]): ie looking at all possible number of counts of cooperations and defections after 100 turns:

>>> import matplotlib.pyplot as plt
>>> ns = range(1, 101)
>>> ps = [chisquare([100 - n, n]).pvalue for n in ns]
>>> plt.plot(ns, ps)

That's showing that the pvalue is high around the time where we're near to a 50/50 split.

What a chi squared test is doing is checking if the distribution given (the count of defections and cooperations) is statistically significant to the random distribution (50/50 split). This is done by comparing the pvalue to some significance level. So if pvalue < alpha then you would say that the distribution is significantly different to the random distribution. So, if pvalue >= alpha then the opponent is playing randomly. Often a value of alpha=0.05 is used in the literature but that's just an arbitrary choice so we would need to make a choice for the strategy (I assume none can be found in the literature) and that can also be a parameter of the strategy.

drvinceknight · 2017-05-09T07:37:30Z

Note that axl.Player has a cooperations and defections attribute that counts these things already. So using the chi squared test with the library will be straight forward:

>>> from scipy.stats import chisquare
>>> import axelrod as axl
>>> axl.seed(0)
>>> players = (axl.Cooperator(), axl.Random())
>>> match = axl.Match(players, turns=200)
>>> _ = match.play()
>>> players[0].cooperations, players[0].defections
(200, 0)
>>> chisquare([players[0].cooperations, players[0].defections]).pvalue
2.0884875837625688e-45
>>> players[1].cooperations, players[1].defections
(93, 107)
>>> chisquare([players[1].cooperations, players[1].defections]).pvalue
0.32219880616257868

MariosZoulias · 2017-05-09T13:30:40Z

Thank you for the analysis .
The only thing that is note clear in my mind is that :
If i do

chisquare([players[1].cooperations, players[1].defections]).pvalue
and i have
players = (axl.Random(), axl.TitForTat())
The pvalue of TitForTat is gonna be a big number (like 0.65) which means that we have to say the opponent (titfortat) plays randomly .
Which does not exist because he doesnt play randomly but according to titfortat strategy .
Also like titfortat for alternator (which is 50/50) the scipy is gonna give a high number . So again we will receive it like a random one .
So in both we fail because titfortat and alternator are not random .

What do i think wrong here?

drvinceknight · 2017-05-09T14:30:28Z

You're not doing anything wrong, I think you're just pointing out a weakness of the strategy. From how it is described I think the only thing you can do is test the distribution of C and D as I have written. Because of the way the strategy in question plays it would in fact recognise that Tit for Tat is not random.

drvinceknight · 2017-06-07T09:46:00Z

Some simple ZD ones to implement from the literature. From @marcharper on #1041:

I have found some other concrete ZD examples in case we want to add more examples from the literature:

(11/13, 1/2, 7/26, 0) from Press and Dyson
ZDmischief (0.8, 0.6, 0.1, 0) an ZDextortion (0.64, 0.18, 0.28, 0) from this paper: https://arxiv.org/pdf/1308.2576.pdf
~~There's a memory-two generalization in this paper on page 21~~, as well as the memory-one (15/16, 1/2, 7/16, 1/16): http://math.uchicago.edu/~may/REU2014/REUPapers/Li,Siwei.pdf

Looks like maybe one or two more in this paper PZDR (1.0, 0.35, 0.75, 0.1) (but looks like donation game matrix): https://pdfs.semanticscholar.org/824a/2123e1de5aa2e971fa9b1bf167b8ff246aa5.pdf
Some in this paper, see the caption for Fig 3: http://web.evolbio.mpg.de/~hilbe/Research_files/Hilbe%20et%20al%20(GEB%202015)%20Partners%20or%20rivals.pdf

drvinceknight · 2017-07-27T13:36:49Z

Have edited the above list with a pointer to the fortran strategies.

souravsingh · 2017-09-14T03:17:02Z

@drvinceknight The link to Mensa strategy shows a license which says that- "Redistributions of source code must retain the above copyright
notice, this list of conditions and the following disclaimer."

Should we be working on adding the strategies to the library, considering the license is incompatible?

meatballs · 2017-09-14T07:25:34Z

The licence applies to the source code itself - not to the idea which is captured by that code. We would be in breach of the licence if we took the code and incorporated into our library, but we are perfectly ok to take code the strategy ourselves.

JCodyA · 2020-12-28T14:52:32Z

Hello all! What a cool project, I just discovered this a few days ago. I'd love to contribute some new strategies. Has anyone implemented a Perlin style random strategy?

marcharper · 2021-01-02T21:15:31Z

Hi @JCodyA, you can check the list of references in the documentation to see if there's a matching source. If you are still unsure, please post a source and I should be able to tell if there's already a matching strategy in the library.

JCodyA · 2021-01-03T16:02:53Z

@marcharper I had a look at docs/reference/all_strategies.rst and the only strategy listed that was similar was the rand.py strategy, but not a perlin one. I'm thinking of two possible variations of a perlin strategy: a perlin cooperator and a perlin defector. One will cooperate on a semi random basic similar to natural randomness (ie raindrops), and the other will defect on a semi random basis.

marcharper · 2021-01-04T00:49:12Z

@JCodyA I don't think there's anything quite like that -- I presume you mean that a player will say defect with some distribution other than a Bernoulli. There are several strategies that behave like say TFT and then randomly defector or otherwise act randomly or noisily, but that doesn't sound like quite the same thing. See TrickyCooperator and RandomTitForTat for examples.

marcharper added the Easy_pull_request label Oct 22, 2015

marcharper mentioned this issue Jun 7, 2016

Tournaments to Reproduce #387

Closed

314pe mentioned this issue Jun 11, 2016

Implemented a naive prober strategy (like tit for tat, but randomly defects). #629

Merged

marcharper mentioned this issue Aug 9, 2016

Dictionary mapping player names to classes #479

Closed

geraintpalmer mentioned this issue Sep 19, 2016

3 New Strategies From the PRISON (http://www.lifl.fr/IPD/ipd.frame.html) #715

Merged

This was referenced Sep 20, 2016

New Strategy From the PRISON (http://www.lifl.fr/IPD/ipd.frame.html) #720

Closed

New Strategy From the PRISON (http://www.lifl.fr/IPD/ipd.frame.html) #724

Merged

This was referenced Oct 9, 2016

Implement per_cccdcd strategy from PRISON project #739

Merged

Implement prober4 strategy from PRISON project #743

Merged

This was referenced May 14, 2017

Stein and rapoport #1007

Closed

Refactoring mutual.py using versus_test #1008

Closed

all changes #1009

Closed

SteinAndRapoport Strategy #1012

Merged

drvinceknight mentioned this issue Jun 7, 2017

Add sources for some strategies. #1041

Merged

drvinceknight mentioned this issue Jun 22, 2017

GRASR strategy #1051

Open

drvinceknight added up-for-grabs and removed Easy_pull_request labels Jun 22, 2017

Chadys mentioned this issue Jul 24, 2017

N tit(s) for M tat(s) #1084

Merged

dmanc mentioned this issue Jul 27, 2017

Michaelos strategy #1091

Merged

drvinceknight mentioned this issue Jul 27, 2017

Import Fortran Strategies #381

Closed

drvinceknight mentioned this issue Aug 4, 2017

Specific strategy: Graaskamp #1109

Closed

gaffney2010 mentioned this issue Dec 31, 2018

Added some of the strategies listed in Ashlock2009. #1231

Merged

marcharper pinned this issue Mar 2, 2020

marcharper mentioned this issue Jul 17, 2020

New Games #1328

Open

Desired New strategies #379

Desired New strategies #379

Comments

marcharper commented Oct 22, 2015 • edited

drvinceknight commented Oct 22, 2015

drvinceknight commented Feb 29, 2016

flammino commented Aug 23, 2016

drvinceknight commented Aug 23, 2016

flammino commented Aug 23, 2016

marcharper commented Aug 23, 2016

flammino commented Aug 24, 2016

flammino commented Aug 26, 2016

marcharper commented Aug 26, 2016

flammino commented Aug 26, 2016

drvinceknight commented Aug 26, 2016

flammino commented Sep 4, 2016 • edited

meatballs commented Sep 5, 2016

souravsingh commented Sep 19, 2016

meatballs commented Sep 19, 2016

souravsingh commented Sep 19, 2016

drvinceknight commented Sep 19, 2016

mturzanska commented Oct 7, 2016

drvinceknight commented Oct 7, 2016

mturzanska commented Oct 8, 2016

drvinceknight commented Oct 8, 2016

MariosZoulias commented May 7, 2017

marcharper commented May 8, 2017

MariosZoulias commented May 8, 2017

drvinceknight commented May 9, 2017

drvinceknight commented May 9, 2017

MariosZoulias commented May 9, 2017 • edited

drvinceknight commented May 9, 2017

drvinceknight commented Jun 7, 2017 • edited by marcharper

drvinceknight commented Jul 27, 2017

souravsingh commented Sep 14, 2017

meatballs commented Sep 14, 2017

JCodyA commented Dec 28, 2020

marcharper commented Jan 2, 2021

JCodyA commented Jan 3, 2021 • edited

marcharper commented Jan 4, 2021 • edited

marcharper commented Oct 22, 2015 •

edited

flammino commented Sep 4, 2016 •

edited

MariosZoulias commented May 9, 2017 •

edited

drvinceknight commented Jun 7, 2017 •

edited by marcharper

JCodyA commented Jan 3, 2021 •

edited

marcharper commented Jan 4, 2021 •

edited