Enriching behavioral ecology with reinforcement learning methods
- PMID: 29412143
- DOI: 10.1016/j.beproc.2018.01.008
Enriching behavioral ecology with reinforcement learning methods
Abstract
This article focuses on the division of labor between evolution and development in solving sequential, state-dependent decision problems. Currently, behavioral ecologists tend to use dynamic programming methods to study such problems. These methods are successful at predicting animal behavior in a variety of contexts. However, they depend on a distinct set of assumptions. Here, we argue that behavioral ecology will benefit from drawing more than it currently does on a complementary collection of tools, called reinforcement learning methods. These methods allow for the study of behavior in highly complex environments, which conventional dynamic programming methods do not feasibly address. In addition, reinforcement learning methods are well-suited to studying how biological mechanisms solve developmental and learning problems. For instance, we can use them to study simple rules that perform well in complex environments. Or to investigate under what conditions natural selection favors fixed, non-plastic traits (which do not vary across individuals), cue-driven-switch plasticity (innate instructions for adaptive behavioral development based on experience), or developmental selection (the incremental acquisition of adaptive behavior based on experience). If natural selection favors developmental selection, which includes learning from environmental feedback, we can also make predictions about the design of reward systems. Our paper is written in an accessible manner and for a broad audience, though we believe some novel insights can be drawn from our discussion. We hope our paper will help advance the emerging bridge connecting the fields of behavioral ecology and reinforcement learning.
Keywords: Adaptation; Development; Dynamic programming; Evolution; Learning; Reinforcement learning.
Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Similar articles
-
On our best behavior: optimality models in human behavioral ecology.Stud Hist Philos Biol Biomed Sci. 2009 Jun;40(2):133-41. doi: 10.1016/j.shpsc.2009.03.005. Epub 2009 Apr 28. Stud Hist Philos Biol Biomed Sci. 2009. PMID: 19442928
-
Selective processes in development: implications for the costs and benefits of phenotypic plasticity.Integr Comp Biol. 2012 Jul;52(1):31-42. doi: 10.1093/icb/ics067. Epub 2012 Apr 27. Integr Comp Biol. 2012. PMID: 22544286 Review.
-
An integrated bayesian theory of phenotypic flexibility.Behav Processes. 2019 Apr;161:54-64. doi: 10.1016/j.beproc.2018.02.002. Epub 2018 Feb 8. Behav Processes. 2019. PMID: 29428826
-
Behavior systems and reinforcement: an integrative approach.J Exp Anal Behav. 1993 Jul;60(1):105-28. doi: 10.1901/jeab.1993.60-105. J Exp Anal Behav. 1993. PMID: 8354963 Free PMC article. Review.
-
Reward-dependent learning in neuronal networks for planning and decision making.Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0. Prog Brain Res. 2000. PMID: 11105649 Review.
Cited by
-
The Uncontrollable Mortality Risk Hypothesis: Theoretical foundations and implications for public health.Evol Med Public Health. 2024 May 9;12(1):86-96. doi: 10.1093/emph/eoae009. eCollection 2024. Evol Med Public Health. 2024. PMID: 38807860 Free PMC article.
-
Behavioral selection in structured populations.Theory Biosci. 2024 Jun;143(2):97-105. doi: 10.1007/s12064-024-00413-8. Epub 2024 Mar 5. Theory Biosci. 2024. PMID: 38441745 Free PMC article.
-
From beasts to bytes: Revolutionizing zoological research with artificial intelligence.Zool Res. 2023 Nov 18;44(6):1115-1131. doi: 10.24272/j.issn.2095-8137.2023.263. Zool Res. 2023. PMID: 37933101 Free PMC article. Review.
-
Efficiency traps beyond the climate crisis: exploration-exploitation trade-offs and rebound effects.Philos Trans R Soc Lond B Biol Sci. 2023 Nov 6;378(1889):20220405. doi: 10.1098/rstb.2022.0405. Epub 2023 Sep 18. Philos Trans R Soc Lond B Biol Sci. 2023. PMID: 37718604 Free PMC article.
-
An evolutionary model of sensitive periods when the reliability of cues varies across ontogeny.Behav Ecol. 2021 Oct 25;33(1):101-114. doi: 10.1093/beheco/arab113. eCollection 2022 Jan-Feb. Behav Ecol. 2021. PMID: 35197808 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources