Bellman, Richard. 1957. Dynamic Programming. Princeton University Press.
Bertsekas, Dimitri P. 2019. Reinforcement Learning and Optimal Control. Athena Scientific.
Friston, Karl. 2010. “The Free-Energy Principle: A Unified Brain Theory?” Nature Reviews Neuroscience 11 (2): 127–38.
Friston, Karl. 2013. “Life as We Know It.” Journal of The Royal Society Interface 10 (86): 20130475.
Friston, Karl, James Kilner, and Lee Harrison. 2006. “A Free Energy Principle for the Brain.” Journal of Physiology-Paris 100 (1–3): 70–87.
Murphy, Susan A. 2003. “Optimal Dynamic Treatment Regimes.” Journal of the Royal Statistical Society: Series B 65 (2): 331–55.
Pearl, Judea. 1988. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann.
Pearl, Judea. 2009. Causality: Models, Reasoning, and Inference. 2nd ed. Cambridge University Press.
Pearl, Judea. 2014. “Probabilistic and Causal Inference: The Works of Judea Pearl.” ACM Turing Award Lecture.
Puterman, Martin L. 2014. Markov Decision Processes: Discrete Stochastic Dynamic Programming. 2nd ed. Wiley.
Schulam, Peter, and Suchi Saria. 2017. “Reliable Decision Support Using Counterfactual Models.” Advances in Neural Information Processing Systems 30.
Sutton, Richard S., and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction. 2nd ed. MIT Press.
Tennenholtz, Guy, Assaf Hallak, Shie Mannor, Uri Shalit, Lior Shani, and Aviv Tamar. 2020. “Off-Policy Evaluation in Partially Observable Environments.” Proceedings of the AAAI Conference on Artificial Intelligence 34 (04): 6148–56.
Whitehead, Alfred North. 1929. The Function of Reason. Princeton University Press.