<img src="http://imgur.com/1ZcRyrc.png" style="float: left; margin: 15px; height: 80px">

# Project 1

### Building "Pokemon Stay"

---
You are an analyst at a "scrappy" online gaming company that specializes in remakes of last year's fads.

Your boss, who runs the product development team, is convinced that Pokemon Go's fatal flaw was that you had to actually move around outside. She has design mock-ups for a new game called Pokemon Stay: in this version players still need to move, but just from website to website. Pokemon gyms are now popular online destinations, and catching Pokemon in the "wild" simply requires browsing the internet for hours in the comfort of your home.

She wants you to program a prototype version of the game and analyze the planned content to help the team calibrate the design.

<h1>Table of Contents<span class="tocSkip"></span></h1>
<div class="toc"><ul class="toc-item"><li><ul class="toc-item"><li><span><a href="#Building-&quot;Pokemon-Stay&quot;" data-toc-modified-id="Building-&quot;Pokemon-Stay&quot;-0.1">Building "Pokemon Stay"</a></span><ul class="toc-item"><li><span><a href="#Package-imports" data-toc-modified-id="Package-imports-0.1.1">Package imports</a></span></li></ul></li></ul></li><li><span><a href="#1.-Defining-a-player" data-toc-modified-id="1.-Defining-a-player-1">1. Defining a player</a></span></li><li><span><a href="#2.-Defining-&quot;gym&quot;-locations" data-toc-modified-id="2.-Defining-&quot;gym&quot;-locations-2">2. Defining "gym" locations</a></span></li><li><span><a href="#3.-Create-a-pokedex" data-toc-modified-id="3.-Create-a-pokedex-3">3. Create a pokedex</a></span></li><li><span><a href="#4.-Create-a-data-structure-for-players" data-toc-modified-id="4.-Create-a-data-structure-for-players-4">4. Create a data structure for players</a></span><ul class="toc-item"><li><span><a href="#4.1" data-toc-modified-id="4.1-4.1">4.1</a></span></li><li><span><a href="#4.2" data-toc-modified-id="4.2-4.2">4.2</a></span></li></ul></li><li><span><a href="#5.-Add-captured-pokemon-for-each-player" data-toc-modified-id="5.-Add-captured-pokemon-for-each-player-5">5. Add captured pokemon for each player</a></span></li><li><span><a href="#6.-What-gyms-have-players-visited?" data-toc-modified-id="6.-What-gyms-have-players-visited?-6">6. What gyms have players visited?</a></span><ul class="toc-item"><li><span><a href="#6.1" data-toc-modified-id="6.1-6.1">6.1</a></span></li><li><span><a href="#6.2" data-toc-modified-id="6.2-6.2">6.2</a></span></li></ul></li><li><span><a href="#7.-Calculate-player-&quot;power&quot;." data-toc-modified-id="7.-Calculate-player-&quot;power&quot;.-7">7. Calculate player "power".</a></span></li><li><span><a href="#8.-Load-a-pokedex-file-containing-all-the-pokemon" data-toc-modified-id="8.-Load-a-pokedex-file-containing-all-the-pokemon-8">8. Load a pokedex file containing all the pokemon</a></span><ul class="toc-item"><li><span><a href="#8.1" data-toc-modified-id="8.1-8.1">8.1</a></span></li><li><span><a href="#8.2-Parse-the-raw-pokedex-with-list-comprehensions" data-toc-modified-id="8.2-Parse-the-raw-pokedex-with-list-comprehensions-8.2">8.2 Parse the raw pokedex with list comprehensions</a></span></li></ul></li><li><span><a href="#9.-Write-a-function-to-generate-the-full-pokedex" data-toc-modified-id="9.-Write-a-function-to-generate-the-full-pokedex-9">9. Write a function to generate the full pokedex</a></span></li><li><span><a href="#10.-Write-a-function-to-generate-a-&quot;filtered&quot;-pokedex" data-toc-modified-id="10.-Write-a-function-to-generate-a-&quot;filtered&quot;-pokedex-10">10. Write a function to generate a "filtered" pokedex</a></span></li><li><span><a href="#11.-Descriptive-statistics-on-the-prototype-pokedex" data-toc-modified-id="11.-Descriptive-statistics-on-the-prototype-pokedex-11">11. Descriptive statistics on the prototype pokedex</a></span><ul class="toc-item"><li><span><a href="#11.1" data-toc-modified-id="11.1-11.1">11.1</a></span></li><li><span><a href="#11.2" data-toc-modified-id="11.2-11.2">11.2</a></span></li></ul></li><li><span><a href="#12.-Calibrate-the-frequency-of-Pokemon" data-toc-modified-id="12.-Calibrate-the-frequency-of-Pokemon-12">12. Calibrate the frequency of Pokemon</a></span></li></ul></div>

#### Package imports

The pprint package below is the only package imported here, and it's not even strictly required to do any of the project. Printing python variables and objects with pprint can help to format them in a "prettier" way.

In [1]:
from pprint import pprint
import numpy as np
import scipy.stats as st

<img src="http://imgur.com/l5NasQj.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

## 1. Defining a player

---

The player variables are:

    player_id : id code unique to each player (integer)
    player_name : entered name of the player (string)
    time_played : number of times played the game in minutes (float)
    player_pokemon: the player's captured pokemon (dictionary)
    gyms_visited: ids of the gyms that a player has visited (list)
    
Create the components for a player object by defining each of these variables. The dictionary and list variables should just be defined as empty; you can use any (correctly typed) values for the others.

In [2]:
player_1 = {
    'player_id' : 1, 
    'player_name' : 'Marc',
    'time_played' : 5.0,
    'player_pokemon' : {},
    'gyms_visited' : [],
    }
pprint(player_1)

{'gyms_visited': [],
 'player_id': 1,
 'player_name': 'Marc',
 'player_pokemon': {},
 'time_played': 5.0}


<img src="http://imgur.com/l5NasQj.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

## 2. Defining "gym" locations

---

As the sole programmer, Pokemon Stay will have to start small. To begin, there will be 10 different gym location websites on the internet. The gym locations are:

    1. 'reddit.com'
    2. 'amazon.com'
    3. 'twitter.com'
    4. 'linkedin.com'
    5. 'ebay.com'
    6. 'netflix.com'
    7. 'sporcle.com'
    8. 'stackoverflow.com'
    9. 'github.com'
    10. 'quora.com'

1. Set up a list of all the gym locations. This will be a list of strings.
2. Append two of these locations to your player's list of visited gyms.
3. Print the list.

In [3]:
#1
gyms = ['reddit.com', 'amazon.com', 'twitter.com', 'linkedin.com', 'ebay.com', 'netflix.com', 'sporcle.com', 'stackoverflow.com', 'github.com', 'quora.com']

In [4]:
pprint(gyms)

['reddit.com',
 'amazon.com',
 'twitter.com',
 'linkedin.com',
 'ebay.com',
 'netflix.com',
 'sporcle.com',
 'stackoverflow.com',
 'github.com',
 'quora.com']


In [5]:
#2
player_1['gyms_visited'].extend(gyms[0:2])

In [6]:
#3
pprint(player_1)

{'gyms_visited': ['reddit.com', 'amazon.com'],
 'player_id': 1,
 'player_name': 'Marc',
 'player_pokemon': {},
 'time_played': 5.0}


## 3. Create a pokedex

---

We also need to create some pokemon to catch. Each pokemon will be defined by these variables:

    pokemon_id : unique identifier for each pokemon (integer)
    name : the name of the pokemon (string)
    type : the category of pokemon (string)
    hp : base hitpoints (integer)
    attack : base attack (integer)
    defense : base defense (integer)
    special_attack : base special attack (integer)
    special_defense : base sepecial defense (integer)
    speed : base speed (integer)

We are only going to create 3 different pokemon with these `pokemon_id` and `pokemon_name` values:

    1 : 'charmander'
    2 : 'squirtle'
    3 : 'bulbasaur'

Create a dictionary that will contain the pokemon. The keys of the dictionary will be the `pokemon_id` and the values will themselves be dictionaries that contain the other pokemon variables. The structure of the pokedex dictionary will start like so:
     
     {
         1: {
                 'name':'charmander',
                 'type':'fire',
                 ...
                 
The `type` of charmander, squirtle, and bulbasaur should be `'fire'`, `'water'`, and `'poison'` respectively. The other values are up to you, make them anything you like!

Print (or pretty print) the pokedex dictionary with the 3 pokemon.

In [7]:
pokedex = {
    1: {
    'pokemon_name' : 'charmander',
    'type' : 'fire',
    'hp' : 39,
    'attack' : 52,
    'defense' : 43,
    'special_attack' : 63,
    'special_defense' : 50,
    'speed' : 65,
    },
    2: {'pokemon_name' : 'squirtle',
    'type' : 'water',
    'hp' : 44,
    'attack' : 48,
    'defense' : 65,
    'special_attack' : 50,
    'special_defense' : 64,
    'speed' : 43,
       },
    3: {'pokemon_name' : 'bulbasaur',
    'type' : 'GrassPoison',
    'hp' : 60,
    'attack' : 49,
    'defense' : 49,
    'special_attack' : 65,
    'special_defense' : 65,
    'speed' : 45,
} 
}

pprint(pokedex)

{1: {'attack': 52,
     'defense': 43,
     'hp': 39,
     'pokemon_name': 'charmander',
     'special_attack': 63,
     'special_defense': 50,
     'speed': 65,
     'type': 'fire'},
 2: {'attack': 48,
     'defense': 65,
     'hp': 44,
     'pokemon_name': 'squirtle',
     'special_attack': 50,
     'special_defense': 64,
     'speed': 43,
     'type': 'water'},
 3: {'attack': 49,
     'defense': 49,
     'hp': 60,
     'pokemon_name': 'bulbasaur',
     'special_attack': 65,
     'special_defense': 65,
     'speed': 45,
     'type': 'GrassPoison'}}


<img src="http://imgur.com/l5NasQj.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

## 4. Create a data structure for players

---

### 4.1 

In order to maintain a database of multiple players, create a dictionary that keeps track of players indexed by `player_id`. 

The keys of the dictionary will be `player_id` and the values will be dictionaries containing each player's variables (from question 1). 

Construct the `players` dictionary and insert the player that you defined in question 1, then print `players`.

In [8]:
# Create empty dictionary for all players.
players = {}

# Add player_1 in players.
players[player_1['player_id']] = player_1
pprint(players)

{1: {'gyms_visited': ['reddit.com', 'amazon.com'],
     'player_id': 1,
     'player_name': 'Marc',
     'player_pokemon': {},
     'time_played': 5.0}}


---

### 4.2

Create a new player with `player_id = 2` in the `players` dictionary. Leave the `'player_pokemon'` dictionary empty. Append `'stackoverflow'` and `'github.com'` to the `'gyms_visited'` list for player 2.

The `'player_name'` and `'time_played'` values are up to you, but must be a string and float, respectively.

Remember, the player_id is the key for the player in the players dictionary.

Print the `players` dictionary with the new player inserted.

In [9]:
# create player 2
player_2 = {
    'player_id': 2,
    'player_name' : 'Dave',
    'time_played' : 7.0,
    'player_pokemon' : {},
    'gyms_visited' : (gyms[7]) + ', ' +(gyms[8]) }
pprint(player_2)

{'gyms_visited': 'stackoverflow.com, github.com',
 'player_id': 2,
 'player_name': 'Dave',
 'player_pokemon': {},
 'time_played': 7.0}


In [10]:
# Add player_2 in players.
players[player_2['player_id']] = player_2
pprint(players)

{1: {'gyms_visited': ['reddit.com', 'amazon.com'],
     'player_id': 1,
     'player_name': 'Marc',
     'player_pokemon': {},
     'time_played': 5.0},
 2: {'gyms_visited': 'stackoverflow.com, github.com',
     'player_id': 2,
     'player_name': 'Dave',
     'player_pokemon': {},
     'time_played': 7.0}}


In [11]:
# create player 3
player_3 = {
    'player_id': 3,
    'player_name' : 'Rob',
    'time_played' : 15.0,
    'player_pokemon' : {},
    'gyms_visited' : [] }

In [12]:
# Add player_3 in players.
players[player_3['player_id']] = player_3

In [13]:
# Update player_3 gyms_visited
players[3]['gyms_visited'].extend(gyms[5:7])

In [14]:
pprint(player_3)

{'gyms_visited': ['netflix.com', 'sporcle.com'],
 'player_id': 3,
 'player_name': 'Rob',
 'player_pokemon': {},
 'time_played': 15.0}


In [15]:
pprint(players)

{1: {'gyms_visited': ['reddit.com', 'amazon.com'],
     'player_id': 1,
     'player_name': 'Marc',
     'player_pokemon': {},
     'time_played': 5.0},
 2: {'gyms_visited': 'stackoverflow.com, github.com',
     'player_id': 2,
     'player_name': 'Dave',
     'player_pokemon': {},
     'time_played': 7.0},
 3: {'gyms_visited': ['netflix.com', 'sporcle.com'],
     'player_id': 3,
     'player_name': 'Rob',
     'player_pokemon': {},
     'time_played': 15.0}}


<img src="http://imgur.com/l5NasQj.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

## 5. Add captured pokemon for each player

---

The `'player_pokemon'` keyed dictionaries for each player keep track of which of the pokemon each player has.

The keys of the `'player_pokemon'` dictionaries are the pokemon ids that correspond to the ids in the `pokedex` dictionary you created earlier. The values are integers specifying the stats for the pokemon.

Give player 1 a squirtle. Give player 2 a charmander and a bulbasaur.

Print the players dictionary after adding the pokemon for each player.


In [16]:
# Give pokemon to players
players[1]['player_pokemon'][2] = pokedex[2]
players[2]['player_pokemon'][1] = pokedex[3]
players[3]['player_pokemon'][3] = pokedex[1]

In [17]:
pprint(players)

{1: {'gyms_visited': ['reddit.com', 'amazon.com'],
     'player_id': 1,
     'player_name': 'Marc',
     'player_pokemon': {2: {'attack': 48,
                            'defense': 65,
                            'hp': 44,
                            'pokemon_name': 'squirtle',
                            'special_attack': 50,
                            'special_defense': 64,
                            'speed': 43,
                            'type': 'water'}},
     'time_played': 5.0},
 2: {'gyms_visited': 'stackoverflow.com, github.com',
     'player_id': 2,
     'player_name': 'Dave',
     'player_pokemon': {1: {'attack': 49,
                            'defense': 49,
                            'hp': 60,
                            'pokemon_name': 'bulbasaur',
                            'special_attack': 65,
                            'special_defense': 65,
                            'speed': 45,
                            'type': 'GrassPoison'}},
     'time_played': 7.0},
 3

## 6. What gyms have players visited?

---

<img src="http://imgur.com/l5NasQj.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

### 6.1

Write a for-loop that:

1. Iterates through the `pokemon_gyms` list of gym locations you defined before.
2. For each gym, iterate through each player in the `players` dictionary with a second, internal for-loop.
3. If the player has visited the gym, print out "[player] has visited [gym location].", filling in [player] and [gym location] with the current player's name and current gym location.

In [18]:
for gym in gyms:
    for player, player_dict in list(players.items()):
        if gym in player_dict['gyms_visited']:
            print(player_dict['player_name']+' has visited '+gym+'.')

Marc has visited reddit.com.
Marc has visited amazon.com.
Rob has visited netflix.com.
Rob has visited sporcle.com.
Dave has visited stackoverflow.com.
Dave has visited github.com.


<img src="http://imgur.com/xDpSobf.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

### 6.2

How many times did that loop run? If you have N gyms and also N players, how many times would it run as a function of N?

Can you think of a more efficient way to accomplish the same thing? 

(You can write your answer as Markdown text.)

In [19]:
# Number of time 6.1 loop has run
count = 1
counter = [ count for gym in gyms for player, player_dict in players.items()]
sum(counter)

30

6.1 has run 30 times

<img src="http://imgur.com/l5NasQj.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

## 7. Calculate player "power".

---

Define a function that will calculate a player's "power". Player power is defined as the sum of the base statistics of all their pokemon.

Your function will:

1. Accept the `players` dictionary, `pokedex` dictionary, and a player_id as arguments.
2. For the specified player_id, look up that player's pokemon and their level(s).
3. Find and aggregate the attack and defense values for each of the player's pokemon from the `pokedex` dictionary.
4. Print "[player name]'s power is [player power].", where the player power is the sum of the base statistics for all of their pokemon.
5. Return the player's power value.

Print out the pokemon power for each of your players.

In [20]:
def player_power(player_id, players_dict=players, pokedex_dict=pokedex):
    
    # Lookup player's pokemon
    pokemon = players_dict[player_id]['player_pokemon']
    
    # List of values to find the aggregate power
    power_values = ['hp', 'attack', 'defense', 'special_attack', 'special_defense', 'speed']
    
    # Aggregate the attack and defense values for each of the player's pokemon 
    player_power = sum( j for player, gyms in pokemon.items() for i, j in gyms.items() if i in power_values )
    
    # Print Player's power
    print(players_dict[player_id]['player_name']+"'s", "power is",  player_power)

In [21]:
player_power(1)

player_power(2)

Marc's power is 314
Dave's power is 333


<img src="http://imgur.com/l5NasQj.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

## 8. Load a pokedex file containing all the pokemon

---

### 8.1

While you were putting together the prototype code, your colleagues were preparing a dataset of Pokemon and their attributes. (This was a rush job, so they may have picked some crazy values for some...)

The code below loads information from a comma separated value (csv) file. You need to parse this string into a more useable format. The format of the string is:

- Rows are separated by newline characters: \n
- Columns are separated by commas: ,
- All cells in the csv are double quoted. Ex: "PokedexNumber" is the first cell of the first row.


Using for-loops, create a list of lists where each list within the overall list is a row of the csv/matrix, and each element in that list is a cell in that row. Additional criteria:

1. Quotes are removed from each cell item.
2. Numeric column values are converted to floats.
3. There are some cells that are empty and have no information. For these cells put a -1 value in place.

Your end result is effectively a matrix. Each list in the outer list is a row, and the *j*th elements of the list together form the *j*th column, which represents a data attribute. The first three lists in your pokedex list should look like this:

    ['PokedexNumber', 'Name', 'Type', 'Total', 'HP', 'Attack', 'Defense', 'SpecialAttack', 'SpecialDefense', 'Speed']
    [1.0, 'Bulbasaur', 'GrassPoison', 318.0, 45.0, 49.0, 49.0, 65.0, 65.0, 45.0]
    [2.0, 'Ivysaur', 'GrassPoison', 405.0, 60.0, 62.0, 63.0, 80.0, 80.0, 60.0]

In [22]:
raw_pd = ''
pokedex_file = 'pokedex_basic.csv'
with open(pokedex_file, 'r') as f:
    raw_pd = f.read()

In [23]:
raw_pd[0:1000]

'"PokedexNumber","Name","Type","Total","HP","Attack","Defense","SpecialAttack","SpecialDefense","Speed"\n"001","Bulbasaur","GrassPoison","318","45","49","49","65","65","45"\n"002","Ivysaur","GrassPoison","405","60","62","63","80","80","60"\n"003","Venusaur","GrassPoison","525","80","82","83","100","100","80"\n"003","VenusaurMega Venusaur","GrassPoison","625","80","100","123","122","120","80"\n"004","Charmander","Fire","309","39","52","43","60","50","65"\n"005","Charmeleon","Fire","405","58","64","58","80","65","80"\n"006","Charizard","FireFlying","534","78","84","78","109","85","100"\n"006","CharizardMega Charizard X","FireDragon","634","78","130","111","130","85","100"\n"006","CharizardMega Charizard Y","FireFlying","634","78","104","78","159","115","100"\n"007","Squirtle","Water","314","44","48","65","50","64","43"\n"008","Wartortle","Water","405","59","63","80","65","80","58"\n"009","Blastoise","Water","530","79","83","100","85","105","78"\n"009","BlastoiseMega Blastoise","Water","6

In [24]:
# Split the string into individual rows
split_pd = raw_pd.split("\n")
split_pd[0:10]

['"PokedexNumber","Name","Type","Total","HP","Attack","Defense","SpecialAttack","SpecialDefense","Speed"',
 '"001","Bulbasaur","GrassPoison","318","45","49","49","65","65","45"',
 '"002","Ivysaur","GrassPoison","405","60","62","63","80","80","60"',
 '"003","Venusaur","GrassPoison","525","80","82","83","100","100","80"',
 '"003","VenusaurMega Venusaur","GrassPoison","625","80","100","123","122","120","80"',
 '"004","Charmander","Fire","309","39","52","43","60","50","65"',
 '"005","Charmeleon","Fire","405","58","64","58","80","65","80"',
 '"006","Charizard","FireFlying","534","78","84","78","109","85","100"',
 '"006","CharizardMega Charizard X","FireDragon","634","78","130","111","130","85","100"',
 '"006","CharizardMega Charizard Y","FireFlying","634","78","104","78","159","115","100"']

In [25]:
# Loop to create list of lists 
fixed_pd = []

for rows in split_pd:
    rows = rows.replace('"', "") # Remove quotes  
    rows = rows.split(",") # Split rows
    
    fixed_row = [] # List to hold fixed values
    
    # Loop throgh rows to make ammends to cells
    for el in rows:
        
        # Convert numeric values to floats
        if el.isdigit():
            el = float(el)
            fixed_row.append(el)            
        else:
            fixed_row.append(el)

        # Replace empty cells value with -1
        if el == '':
            el = -1
            fixed_row.append(el)
            
    fixed_pd.append(fixed_row)

In [26]:
fixed_pd[:10]

[['PokedexNumber',
  'Name',
  'Type',
  'Total',
  'HP',
  'Attack',
  'Defense',
  'SpecialAttack',
  'SpecialDefense',
  'Speed'],
 [1.0, 'Bulbasaur', 'GrassPoison', 318.0, 45.0, 49.0, 49.0, 65.0, 65.0, 45.0],
 [2.0, 'Ivysaur', 'GrassPoison', 405.0, 60.0, 62.0, 63.0, 80.0, 80.0, 60.0],
 [3.0, 'Venusaur', 'GrassPoison', 525.0, 80.0, 82.0, 83.0, 100.0, 100.0, 80.0],
 [3.0,
  'VenusaurMega Venusaur',
  'GrassPoison',
  625.0,
  80.0,
  100.0,
  123.0,
  122.0,
  120.0,
  80.0],
 [4.0, 'Charmander', 'Fire', 309.0, 39.0, 52.0, 43.0, 60.0, 50.0, 65.0],
 [5.0, 'Charmeleon', 'Fire', 405.0, 58.0, 64.0, 58.0, 80.0, 65.0, 80.0],
 [6.0, 'Charizard', 'FireFlying', 534.0, 78.0, 84.0, 78.0, 109.0, 85.0, 100.0],
 [6.0,
  'CharizardMega Charizard X',
  'FireDragon',
  634.0,
  78.0,
  130.0,
  111.0,
  130.0,
  85.0,
  100.0],
 [6.0,
  'CharizardMega Charizard Y',
  'FireFlying',
  634.0,
  78.0,
  104.0,
  78.0,
  159.0,
  115.0,
  100.0]]

<img src="http://imgur.com/xDpSobf.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

### 8.2 Parse the raw pokedex with list comprehensions

---

Perform the same parsing as above, but **using only a single list comprehension** instead of for loops. You may have nested list comprehensions within the main list comprehension! The output should be exactly the same.

In [27]:
#
[[float(s) if s.isdigit() else -1 if s == '' else s 
  for s in line.replace('"', '').split(',')] 
 for line in split_pd][:10]

[['PokedexNumber',
  'Name',
  'Type',
  'Total',
  'HP',
  'Attack',
  'Defense',
  'SpecialAttack',
  'SpecialDefense',
  'Speed'],
 [1.0, 'Bulbasaur', 'GrassPoison', 318.0, 45.0, 49.0, 49.0, 65.0, 65.0, 45.0],
 [2.0, 'Ivysaur', 'GrassPoison', 405.0, 60.0, 62.0, 63.0, 80.0, 80.0, 60.0],
 [3.0, 'Venusaur', 'GrassPoison', 525.0, 80.0, 82.0, 83.0, 100.0, 100.0, 80.0],
 [3.0,
  'VenusaurMega Venusaur',
  'GrassPoison',
  625.0,
  80.0,
  100.0,
  123.0,
  122.0,
  120.0,
  80.0],
 [4.0, 'Charmander', 'Fire', 309.0, 39.0, 52.0, 43.0, 60.0, 50.0, 65.0],
 [5.0, 'Charmeleon', 'Fire', 405.0, 58.0, 64.0, 58.0, 80.0, 65.0, 80.0],
 [6.0, 'Charizard', 'FireFlying', 534.0, 78.0, 84.0, 78.0, 109.0, 85.0, 100.0],
 [6.0,
  'CharizardMega Charizard X',
  'FireDragon',
  634.0,
  78.0,
  130.0,
  111.0,
  130.0,
  85.0,
  100.0],
 [6.0,
  'CharizardMega Charizard Y',
  'FireFlying',
  634.0,
  78.0,
  104.0,
  78.0,
  159.0,
  115.0,
  100.0]]

<img src="http://imgur.com/l5NasQj.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

## 9. Write a function to generate the full pokedex

---

Write a function that recreates the pokedex you made before, but with the data read in from the full pokemon file. Create a unique key value for each entry in the pokemon dictionary.

Your function should:

1. Take the parsed pokedex information you created above as an argument.
2. Return a dictionary in the same format as your original pokedex you created before containing the information from the parsed full pokedex file.

To test the function, print out the pokemon with id = 100.

In [30]:
# Function to generate full pokedex
def full_pokedex(fixed_pd):
   
    header = fixed_pd[0]
    data = fixed_pd[1:]
    full_pokedex = {}
    
    # Re-index the PokedexNumber due to duplicatation to prevent data loss.
    for i, raw in enumerate(data):
        raw[0] = float(i+1)

    # Create dictionary.
    for raw in data:
        pokemon_stat = {}
        for i, stat in enumerate(raw[1:]):
            pokemon_stat[header[i+1]] = raw[i+1]
        full_pokedex[raw[0]] = pokemon_stat
    return full_pokedex

In [31]:
# Check no data loss.
assert len(fixed_pd[1:]) == len(full_pokedex(fixed_pd))

In [32]:
# Check raw data.
fixed_pd[200]

[200.0, 'Azumarill', 'WaterFairy', 420.0, 100.0, 50.0, 80.0, 60.0, 80.0, 50.0]

In [33]:
# Check raw data in dictionary.
full_pokedex(fixed_pd)[200]

{'Name': 'Azumarill',
 'Type': 'WaterFairy',
 'Total': 420.0,
 'HP': 100.0,
 'Attack': 50.0,
 'Defense': 80.0,
 'SpecialAttack': 60.0,
 'SpecialDefense': 80.0,
 'Speed': 50.0}

<img src="http://i.imgur.com/GCAf1UX.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

## 10. Write a function to generate a "filtered" pokedex
---
Your function should:
1. Take the parsed pokedex information you created above as an argument.
1. Take a dictionary as a parameter with keys matching the features of the Pokedex, filtering by exact match for string type values, and/or filter continuous variables specified by a value that is greater than or equal to the value of the corresponding dictionary key parameter.
1. Return multiple elements from the Pokedex

Example:

```python

# Only filter based on parameters passed
filter_options = {
    'Attack':   25,
    'Defense':  30,
    'Type':     'Electric'
}

# Return records with attack >= 25, defense >= 30, and type == "Electric"
# Also anticipate that other paramters can also be passed such as "SpecialAttack", "Speed", etc.
filtered_pokedex(pokedex_data, filter=filter_options)

# Example output:
# [{'Attack': 30.0,
#  'Defense': 50.0,
#  'HP': 40.0,
#  'Name': 'Voltorb',
#  'SpecialAttack': 55.0,
#  'SpecialDefense': 55.0,
#  'Speed': 100.0,
#  'Total': 330.0,
#  'Type': 'Electric'},
#  {'Attack': 30.0,
#  'Defense': 33.0,
#  'HP': 32.0,
#  'Name': 'Pikachu',
#  'SpecialAttack': 55.0,
#  'SpecialDefense': 55.0,
#  'Speed': 100.0,
#  'Total': 330.0,
#  'Type': 'Electric'},
#  ... etc
#  ]

```



In [34]:
def pokefilter(fixed_pd, filterdict):
    
    pokelist = []
    
    for r in fixed_pd:
        
        check = {}
        
        for attr, val in filterdict.items():
            
            i = fixed_pd[0].index(attr)
            
            if type(val) == str and type(r[i]) == str and r[i] == val:
                
                check[attr] = True
                
            elif type(val) != str and type(r[i]) != str and r[i] >= val:
                
                check[attr] = True
                
            else:
                
                check[attr] = False
                
        if False not in check.values():
            
            pokedict = {}
            
            for a, b in enumerate(fixed_pd[0]):
                
                pokedict[b] = r[a]
                
            pokelist.append(pokedict)
    
            
    
    return pokelist

In [35]:
filterdict = {
    'Attack':   25,
    'Defense':  30,
    'Type':     'Electric'
}

pokefilter(fixed_pd, filterdict)[20]

{'PokedexNumber': 518.0,
 'Name': 'Electivire',
 'Type': 'Electric',
 'Total': 540.0,
 'HP': 75.0,
 'Attack': 123.0,
 'Defense': 67.0,
 'SpecialAttack': 95.0,
 'SpecialDefense': 85.0,
 'Speed': 95.0}


## 11. Descriptive statistics on the prototype pokedex

<img src="http://imgur.com/l5NasQj.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

### 11.1

What is the population mean and standard deviation of the "Total" attribute for all characters in the Pokedex?



In [37]:
# Convert 'Total' to list.
pokedex = full_pokedex(fixed_pd)
total = [v['Total'] for k, v in pokedex.items()]

# Calculate mean and standard deviation.
mean = np.mean(total)
std = np.std(total)
print('population mean: {:11}\nStandard deviation: {:.4f}'.format(mean, std))

population mean:    435.1275
Standard deviation: 119.9620


<img src="http://imgur.com/l5NasQj.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

### 11.2

The game is no fun if the characters are wildly unbalanced! Are any characters "overpowered", which we'll define as having a "Total" more than three standard deviations from the population mean?

In [38]:
# Filter overpowered pokemon from pokedex.
overpowered = {k: v for k, v in pokedex.items() if v['Total'] > (mean + 3*std)}
print('No. of "overpowered" pokemon: %s\n' %(len(overpowered)))
pprint(overpowered)

No. of "overpowered" pokemon: 1

{164.0: {'Attack': 190.0,
         'Defense': 100.0,
         'HP': 126.0,
         'Name': 'MewtwoMega Mewtwo X',
         'SpecialAttack': 154.0,
         'SpecialDefense': 100.0,
         'Speed': 130.0,
         'Total': 800.0,
         'Type': 'PsychicFighting'}}


<img src="http://imgur.com/xDpSobf.png" style="float: left; margin: 25px 15px 0px 0px; height: 25px">

## 12. Calibrate the frequency of Pokemon

The design team wants you to make the powerful Pokemon rare, and the weaklings more common. How would you set the probability $p_i$ of finding Pokemon *i* each time a player visits a gym?

Write a function that takes in a Pokedex number and returns a value $p_i$ for that character.

Hint: there are many ways you could do this. What do _you_ think makes sense? Start with simplifying assumptions: for example, you could assume that the probabilities of encountering any two Pokemon on one visit to a gym are independent of each other.

In [40]:
def rare_pokemon( pokemon_number, pokedex = fixed_pd ):
    
    total_attributes = [ info[3] for info in pokedex[1::] ]
    
    total_mean = np.mean(total_attributes)
    total_std = np.std(total_attributes)
    
    pokemon_total = pokedex[pokemon_number][3]
    
    # distance from mean, the (positive) distance between x and zero.
    difference = abs(pokemon_total - total_mean)
    
    multplier = difference / total_std
    # print multplier
    interval = total_mean / total_std
    # print interval
    probabiity_std = 0.5 / interval
    # print probabiity_std

    
    # Check if higher or lower than mean
    if pokemon_total >= total_mean:
        Probability = 0.5 - (probabiity_std * multplier)
    else:
        Probability = 0.5 + (probabiity_std * multplier)
    return Probability

In [49]:
rare_pokemon(50)

0.546110048204262