## Stroop Effect

### Introduction

In a Stroop task, participants are presented with a list of words, with each word displayed in a color of ink. The participant’s task is to say out loud the color of the ink in which the word is printed. The task has two conditions: a congruent words condition, and an incongruent words condition. In the congruent words condition, the words being displayed are color words whose names match the colors in which they are printed. In the incongruent words condition, the words displayed are color words whose names do not match the colors in which they are printed. In each case, we measure the time it takes to name the ink colors in equally-sized lists. Each participant will go through and record a time from each condition.

The following image demonstrates both conditions. The first line shows the congruent words condition. The second line shows the incongruent words condition.
<img style="float: center;" src="https://upload.wikimedia.org/wikipedia/commons/f/fe/Stroop_effect_memory_test.png">


### Method

#### Design

In this study, we will be measuring the time in seconds it takes to name ink colors in equally sized lists for each participant in the two conditions. Our independent variable is the type of condition applied to the student, either the congruent condition or the incongruent condition. Our dependent variable is the time in seconds that it takes for the participant to name the ink colors in the list.

#### Hypothesis

We want to determine if the effect between the two conditions (congruent and incongruent) is statistically significant. Therefore we will take the following as our hypothesis:

\begin{equation} 
H_0:\mu_a - \mu_0 = 0 \\
H_a:\mu_a - \mu_0 \neq 0
\end{equation}

where $\mu_0$ is the mean of the congruent condition and $\mu_a$ is the mean of the incongruent condition.

To analyze our hypothesis we will use a paired t-test. Each participant will perform under two conditions, the congruent condition and incongruent condition. Although we have an intuitive sense that the effect will cause increased times for the incongruent condition group, we will use a two tailed test in case our intuition is incorrect. 

We will use the following $\alpha$ to determine statistical significance.

\begin{equation}
\alpha = .05
\end{equation}



### Results

In [4]:
import pandas as pd

path = 'https://raw.githubusercontent.com/thrabchak/Udacity-Data-Analysis/master/P1%20Stroop%20Effect/stroopdata.csv'
dataFrame = pd.read_csv(path)
#dataFrame

#### Descriptive Statistics
- mean
- stdev
- graph

#### Inferential Statistics
- hypothesis test
- confidence intervals

#### Effect Size Measures
- d, r^2

Let's do a basic statistical calculation on the data using code! Run the block of code below to calculate the average "Food Pinching Efficiency" for all 31 participants and all chopstick lengths.

In [11]:
dataFrame['Food.Pinching.Efficiency'].mean()

25.00559139784947

In [12]:
#TODO

meansByChopstickLength = dataFrame.groupby('Chopstick.Length')['Food.Pinching.Efficiency'].mean().reset_index()
meansByChopstickLength

# reset_index() changes Chopstick.Length from an index to column. Instead of the index being the length of the chopsticks, the index is the row numbers 0, 1, 2, 3, 4, 5.

Unnamed: 0,Chopstick.Length,Food.Pinching.Efficiency
0,180,24.935161
1,210,25.483871
2,240,26.322903
3,270,24.323871
4,300,24.968065
5,330,23.999677


In [4]:
#TODO

# Causes plots to display within the notebook rather than in a new window
%pylab inline

import matplotlib.pyplot as plt

plt.scatter(x=meansByChopstickLength['Chopstick.Length'], y=meansByChopstickLength['Food.Pinching.Efficiency'])
            # title="")
plt.xlabel("Length in mm")
plt.ylabel("Average Efficiency in PPPC")
plt.title("Average Food Pinching Efficiency by Chopstick Length in the Adult Population")
plt.show()

Populating the interactive namespace from numpy and matplotlib


NameError: name 'meansByChopstickLength' is not defined

### Conclusion




### References

This report is a P1: Test a Perceptual Phenomenon submission for a Udacity Data Analysis Nanodegree. The introduction of this report is from the background section of the instructions for this report, which can be found [here](https://docs.google.com/document/d/1-OkpZLjG_kX9J6LIQ5IltsqMzVWjh36QpnP2RYpVdPU/pub?embedded=True). The dataset for this report was provided by udacity and can be found [here](https://www.google.com/url?q=https://drive.google.com/file/d/0B9Yf01UaIbUgQXpYb2NhZ29yX1U/view?usp%3Dsharing&sa=D&usg=AFQjCNGAjbK9VYD5GsQ8c_iRT9zH9QdOVg). To learn more about Udacity and their online classes go to http://www.udacity.com.

The example [Stroop Effect image](https://commons.wikimedia.org/wiki/File:Stroop_effect_memory_test.png) is from wikipedia and is licensed under a Creative Commons License.

