Skip to content
This repository has been archived by the owner on Sep 20, 2022. It is now read-only.

Adding Softmax Player #14

Merged
merged 1 commit into from
Feb 19, 2013
Merged

Adding Softmax Player #14

merged 1 commit into from
Feb 19, 2013

Conversation

cmaher
Copy link
Contributor

@cmaher cmaher commented Feb 7, 2013

The softmax algorithm is an alternative to the epsilon-greedy algorithm.
Instead of picking based on a set probability, it uses a 'temperature'
parameter, along with the known data to set the distribution of the
alternatives.

lib/bandit.rb requires the softmax player
lib/bandit/players/base.rb adds a softmax option to the factory
lib/bandit/players/softmax.rb implements the player
players.rdoc provides a short explanation

The softmax algorithm is an alternative to the epsilon-greedy algorithm.
Instead of picking based on a set probability, it uses a 'temperature'
parameter, along with the known data to set the distribution of the
alternatives.

lib/bandit.rb requires the softmax player
lib/bandit/players/base.rb adds a softmax option to the factory
lib/bandit/players/softmax.rb implements the player
players.rdoc provides a short explanation
@bmuller
Copy link
Owner

bmuller commented Feb 7, 2013

Hey @cmaher - thanks for this! I'll try to take a look this weekend.

B

@cmaher
Copy link
Contributor Author

cmaher commented Feb 7, 2013

@bmuller - I'm happy to contribute. I'm just glad I don't have to create a whole bandit system from scratch.

bmuller added a commit that referenced this pull request Feb 19, 2013
@bmuller bmuller merged commit c58b8f9 into bmuller:master Feb 19, 2013
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants