### Performance Analysis of Transformer Models on Stroop Task
This notebook analyzes the performance of GPT-4o and Sonnet 3.5 on the Stroop task, focusing on their executive control capabilities.

In [None]:
import pandas as pd
import matplotlib.pyplot as plt

# Data from the study
performance_data = {
    'Condition': ['Congruent', 'Congruent', 'Congruent', 'Congruent', 'Congruent',
                  'Incongruent', 'Incongruent', 'Incongruent', 'Incongruent', 'Incongruent',
                  'Neutral', 'Neutral', 'Neutral', 'Neutral', 'Neutral'],
    'Word List Length': [1, 5, 10, 20, 40, 1, 5, 10, 20, 40, 1, 5, 10, 20, 40],
    'Percent Correct - GPT-4o': [100, 100, 99, 99, 89, 100, 91, 57, 22, 15, 100, 99, 94, 75, 32],
    'Percent Correct - Sonnet 3.5': [83, 100, 90, 99, 92, 100, 97, 75, 76, 24, 73, 100, 96, 78, 27]
}

# Create DataFrame
df = pd.DataFrame(performance_data)

# Plotting the performance
plt.figure(figsize=(12, 6))
for model in ['Percent Correct - GPT-4o', 'Percent Correct - Sonnet 3.5']:
    plt.plot(df['Word List Length'], df[model], marker='o', label=model)

plt.title('Performance of GPT-4o and Sonnet 3.5 on Stroop Task')
plt.xlabel('Word List Length')
plt.ylabel('Percent Correct')
plt.xticks(df['Word List Length'])
plt.legend()
plt.grid()
plt.show()

### Discussion
This analysis highlights the performance differences between the two transformer models across various conditions, emphasizing their limitations in executive control.

In [None]:
# Further analysis can be added here.





***
### [E**Evolve This Code**](https://biologpt.com/?q=Evolve%20Code%3A%20This%20code%20analyzes%20performance%20data%20from%20transformer%20models%20on%20the%20Stroop%20task%20to%20identify%20patterns%20and%20correlations%20in%20executive%20control%20capabilities.%0A%0AIncorporate%20additional%20cognitive%20tasks%20to%20broaden%20the%20analysis%20of%20executive%20control%20in%20transformer%20models.%0A%0ADeficient%20executive%20control%20in%20transformer%20attention%20mechanisms%0A%0A%23%23%23%20Performance%20Analysis%20of%20Transformer%20Models%20on%20Stroop%20Task%0AThis%20notebook%20analyzes%20the%20performance%20of%20GPT-4o%20and%20Sonnet%203.5%20on%20the%20Stroop%20task%2C%20focusing%20on%20their%20executive%20control%20capabilities.%0A%0Aimport%20pandas%20as%20pd%0Aimport%20matplotlib.pyplot%20as%20plt%0A%0A%23%20Data%20from%20the%20study%0Aperformance_data%20%3D%20%7B%0A%20%20%20%20%27Condition%27%3A%20%5B%27Congruent%27%2C%20%27Congruent%27%2C%20%27Congruent%27%2C%20%27Congruent%27%2C%20%27Congruent%27%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%27Incongruent%27%2C%20%27Incongruent%27%2C%20%27Incongruent%27%2C%20%27Incongruent%27%2C%20%27Incongruent%27%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%27Neutral%27%2C%20%27Neutral%27%2C%20%27Neutral%27%2C%20%27Neutral%27%2C%20%27Neutral%27%5D%2C%0A%20%20%20%20%27Word%20List%20Length%27%3A%20%5B1%2C%205%2C%2010%2C%2020%2C%2040%2C%201%2C%205%2C%2010%2C%2020%2C%2040%2C%201%2C%205%2C%2010%2C%2020%2C%2040%5D%2C%0A%20%20%20%20%27Percent%20Correct%20-%20GPT-4o%27%3A%20%5B100%2C%20100%2C%2099%2C%2099%2C%2089%2C%20100%2C%2091%2C%2057%2C%2022%2C%2015%2C%20100%2C%2099%2C%2094%2C%2075%2C%2032%5D%2C%0A%20%20%20%20%27Percent%20Correct%20-%20Sonnet%203.5%27%3A%20%5B83%2C%20100%2C%2090%2C%2099%2C%2092%2C%20100%2C%2097%2C%2075%2C%2076%2C%2024%2C%2073%2C%20100%2C%2096%2C%2078%2C%2027%5D%0A%7D%0A%0A%23%20Create%20DataFrame%0Adf%20%3D%20pd.DataFrame%28performance_data%29%0A%0A%23%20Plotting%20the%20performance%0Aplt.figure%28figsize%3D%2812%2C%206%29%29%0Afor%20model%20in%20%5B%27Percent%20Correct%20-%20GPT-4o%27%2C%20%27Percent%20Correct%20-%20Sonnet%203.5%27%5D%3A%0A%20%20%20%20plt.plot%28df%5B%27Word%20List%20Length%27%5D%2C%20df%5Bmodel%5D%2C%20marker%3D%27o%27%2C%20label%3Dmodel%29%0A%0Aplt.title%28%27Performance%20of%20GPT-4o%20and%20Sonnet%203.5%20on%20Stroop%20Task%27%29%0Aplt.xlabel%28%27Word%20List%20Length%27%29%0Aplt.ylabel%28%27Percent%20Correct%27%29%0Aplt.xticks%28df%5B%27Word%20List%20Length%27%5D%29%0Aplt.legend%28%29%0Aplt.grid%28%29%0Aplt.show%28%29%0A%0A%23%23%23%20Discussion%0AThis%20analysis%20highlights%20the%20performance%20differences%20between%20the%20two%20transformer%20models%20across%20various%20conditions%2C%20emphasizing%20their%20limitations%20in%20executive%20control.%0A%0A%23%20Further%20analysis%20can%20be%20added%20here.%0A%0A)
***

### [Created with BioloGPT](https://biologpt.com/?q=Paper%20Review%3A%20Deficient%20Executive%20Control%20in%20Transformer%20Attention)
[![BioloGPT Logo](https://biologpt.com/static/icons/bioinformatics_wizard.png)](https://biologpt.com/)
***