# Chi-Squared test of independence

Following from our earlier example, is the way a business collects marketing data independent of the responses to the marketing strategies to target customers? We can use a chi-squared test to test for independence. We use data from [THE INFLUENCE OF MARKETING INTELLIGENCE ON PERFORMANCES OF ROMANIAN RETAILERS](http://conferinta.management.ase.ro/archives/2014/pdf/32.pdf). Romanian retailers want to promote a new eco-label product. They consider how to promote the new eco-label using three marketing strategies -- strategic, tactical and operational strategies. Are these strategies independent of the way marketing information was collected for each of the strategies?

### The data
The data comes from the [paper](http://conferinta.management.ase.ro/archives/2014/pdf/32.pdf). There are three different marketing strategies, namely strategic, tactical and operational. There are three sources of marketing intelligence, namely the retail sector, promotion events and information from competitors. 

In [None]:
import pandas as pd
import scipy.stats as stats

In [None]:
%matplotlib inline

In [None]:
raw_data = {'intelligence': ['retail', 'retail', 'retail', 'promotion', 'promotion', 'promotion', 'competitors', 'competitors', 'competitors'], 
        'strategy': ['strategic', 'tactical', 'operational', 'strategic', 'tactical', 'operational', 'strategic', 'tactical', 'operational'], 
        'scores':[13,9,17,8,12,15,3,7,6]}
data = pd.DataFrame(raw_data, columns = ['intelligence', 'strategy', 'scores'])
data

Create a pivot table of the entries. The pivot table is contingency table we can apply the chi-squared test.

In [None]:
observed = data.pivot(index='intelligence', columns='strategy')

In [None]:
observed

### The hypothesis

We state the null and alternative hypothesis as follows:

$H_0:$ the marketing strategy is independent of the source of marketing intelligence.

$H_1:$ the marketing strategy is not independent of the source of marketing intelligence.

We use a 5% significance level to test the null hypothesis.

In [None]:
t, p, l, a = stats.chi2_contingency(observed=observed)
print('Test statistic', t)
print('p-value', p)

### The results

A p value of 0.5468 indicates strong support for the null hypothesis. There is strong support to indicate that the marketing strategy is independent of the source of marketing intelligence. 

### Exercise

Now that you have seen a chi-square test for independence applied to the marketing strategy example, let's try another example to see how well you understood the concepts. For this exercise, we present you with two candidates who stood for election. Shown are the number of people who voted for them by age-group. Answer the questions that follow to apply chi-square test for independence yourself.
<table class="MsoTable15Plain3" style="border-collapse: collapse; mso-yfti-tbllook: 1184; mso-padding-alt: 0cm 5.4pt 0cm 5.4pt;" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr style="mso-yfti-irow: -1; mso-yfti-firstrow: yes; mso-yfti-lastfirstrow: yes;">
<td style="width: 150.15pt; border: none; border-bottom: solid #7F7F7F 1.0pt; mso-border-bottom-themecolor: text1; mso-border-bottom-themetint: 128; mso-border-bottom-alt: solid #7F7F7F .5pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal" style="mso-yfti-cnfc: 517;"><b><span style="text-transform: uppercase;">&nbsp;</span></b></p>
</td>
<td style="width: 150.15pt; border: none; border-bottom: solid #7F7F7F 1.0pt; mso-border-bottom-themecolor: text1; mso-border-bottom-themetint: 128; mso-border-bottom-alt: solid #7F7F7F .5pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal" style="mso-yfti-cnfc: 1;"><b><span style="text-transform: uppercase;">candidate A</span></b></p>
</td>
<td style="width: 150.2pt; border: none; border-bottom: solid #7F7F7F 1.0pt; mso-border-bottom-themecolor: text1; mso-border-bottom-themetint: 128; mso-border-bottom-alt: solid #7F7F7F .5pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal" style="mso-yfti-cnfc: 1;"><b><span style="text-transform: uppercase;">candidate B</span></b></p>
</td>
</tr>
<tr style="mso-yfti-irow: 0;">
<td style="width: 150.15pt; border: none; border-right: solid #7F7F7F 1.0pt; mso-border-right-themecolor: text1; mso-border-right-themetint: 128; mso-border-right-alt: solid #7F7F7F .5pt; background: #F2F2F2; mso-background-themecolor: background1; mso-background-themeshade: 242; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal" style="mso-yfti-cnfc: 68;"><b><span style="text-transform: uppercase;">18-25</span></b></p>
</td>
<td style="width: 150.15pt; background: #F2F2F2; mso-background-themecolor: background1; mso-background-themeshade: 242; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal" style="mso-yfti-cnfc: 64;">2670</p>
</td>
<td style="width: 150.2pt; background: #F2F2F2; mso-background-themecolor: background1; mso-background-themeshade: 242; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal" style="mso-yfti-cnfc: 64;">1560</p>
</td>
</tr>
<tr style="mso-yfti-irow: 1; mso-yfti-lastrow: yes;">
<td style="width: 150.15pt; border: none; border-right: solid #7F7F7F 1.0pt; mso-border-right-themecolor: text1; mso-border-right-themetint: 128; mso-border-right-alt: solid #7F7F7F .5pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal" style="mso-yfti-cnfc: 4;"><b><span style="text-transform: uppercase;">25-40</span></b></p>
</td>
<td style="width: 150.15pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal">13578</p>
</td>
<td style="width: 150.2pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal">4121</p>
</td>
</tr>
<tr style="mso-yfti-irow: 1; mso-yfti-lastrow: yes;">
<td style="width: 150.15pt; border: none; border-right: solid #7F7F7F 1.0pt; mso-border-right-themecolor: text1; mso-border-right-themetint: 128; mso-border-right-alt: solid #7F7F7F .5pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal" style="mso-yfti-cnfc: 4;"><b><span style="text-transform: uppercase;">40-60</span></b></p>
</td>
<td style="width: 150.15pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal">13578</p>
</td>
<td style="width: 150.2pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal">4121</p>
</td>
</tr>
<tr style="mso-yfti-irow: 1; mso-yfti-lastrow: yes;">
<td style="width: 150.15pt; border: none; border-right: solid #7F7F7F 1.0pt; mso-border-right-themecolor: text1; mso-border-right-themetint: 128; mso-border-right-alt: solid #7F7F7F .5pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal" style="mso-yfti-cnfc: 4;"><b><span style="text-transform: uppercase;">over 60<br /></span></b></p>
</td>
<td style="width: 150.15pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal">13578</p>
</td>
<td style="width: 150.2pt; padding: 0cm 5.4pt 0cm 5.4pt;" width="150" valign="top">
<p class="MsoNormal">4121</p>
</td>
</tr>
</tbody>
</table>

#### Question 1
Determine whether the two categorical variables are independent or whether there is an association.

#### Question 1.1 
State the null and alternative hypothesis

In [None]:
# your answer

#### Qeustion 1.2
Conduct the hypothesis test and interpret your results.

In [None]:
# your answer

Now that you have completed this exercise, what are some applications in your work in which this test would be useful. Discuss below.