CM-DQN: A Value-Based Deep Reinforcement Learning Model to Simulate Confirmation Bias [https://arxiv.org/abs/2407.07454]

NYU DS-GA 1016 Computational Cognitive Modelling Final Project

In human decision-making tasks, individuals learn through trials and prediction errors. When individuals learn the task, some are more influenced by good outcomes, while others weigh bad outcomes more heavily. Such confirmation bias can lead to different learning effects. In this study, we propose a new algorithm in Deep Reinforcement Learning, CM-DQN, which applies the idea of varying update strategies for positive or negative prediction errors, to simulate the human decision-making process when the task's states are continuous while the actions are discrete. We test CM-DQN in a Lunar Lander environment with confirmatory, disconfirmatory, and non-biased bias to observe the learning effects. Moreover, we apply the confirmation model in a multi-armed bandit problem (environment in discrete states and discrete actions), which utilizes the same idea as our proposed algorithm, as a contrast experiment to algorithmically simulate the impact of different confirmation biases in the decision-making process. In both experiments, confirmatory bias indicates better learning effects.

How to start

fork the repo by

git clone https://github.com/Patrickhshs/CM-DQN

change the root path and run the experiment files like the following way:

python ccm_comfirmation_bias_rl.py

Below is the result running on seed 2024:

Contributor

Jiacheng Shen (shen.patrick.jiacheng@nyu.edu)
Lihan Feng (lf2383@nyu.edu)

If you used our work or code, please cite us in your work by

@misc{shen2024cmdqnvaluebaseddeepreinforcement,
      title={CM-DQN: A Value-Based Deep Reinforcement Learning Model to Simulate Confirmation Bias}, 
      author={Jiacheng Shen and Lihan Feng},
      year={2024},
      eprint={2407.07454},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2407.07454}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
DS_GA_1013_CCM_Final_Report.pdf		DS_GA_1013_CCM_Final_Report.pdf
Experiment1-confirmation_model_multi_bandit.py		Experiment1-confirmation_model_multi_bandit.py
README.md		README.md
ccm_comfirmation_bias_rl.py		ccm_comfirmation_bias_rl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CM-DQN: A Value-Based Deep Reinforcement Learning Model to Simulate Confirmation Bias [https://arxiv.org/abs/2407.07454]

NYU DS-GA 1016 Computational Cognitive Modelling Final Project

How to start

Contributor

If you used our work or code, please cite us in your work by

About

Releases

Packages

Languages

Patrickhshs/CM-DQN

Folders and files

Latest commit

History

Repository files navigation

CM-DQN: A Value-Based Deep Reinforcement Learning Model to Simulate Confirmation Bias [https://arxiv.org/abs/2407.07454]

NYU DS-GA 1016 Computational Cognitive Modelling Final Project

How to start

Contributor

If you used our work or code, please cite us in your work by

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages