Causal reasoning benchmarks and tasks for large language models. For detailed reviews, please see Yang, L., Clivio, O., Shirvaikar, V., & Falck, F. (2023, December). A critical review of Causal Inference benchmarks for Large Language Models. In AAAI 2024 Workshop on''Are Large Language Models Simply Causal Parrots?''.