Enriching behavioral ecology with reinforcement learning methods

Willem E Frankenhuis¹, Karthik Panchanathan², Andrew G Barto³

Affiliations

¹ Behavioural Science Institute, Radboud University, Montessorilaan 3, PO Box 9104, 6500, HE, Nijmegen, The Netherlands. Electronic address: w.frankenhuis@psych.ru.nl.
² Department of Anthropology, University of Missouri, United States.
³ College of Information and Computer Sciences, University of Massachusetts Amherst, United States.

PMID: 29412143
DOI: 10.1016/j.beproc.2018.01.008

Free article

Review

Enriching behavioral ecology with reinforcement learning methods

Willem E Frankenhuis et al. Behav Processes. 2019 Apr.

Free article

. 2019 Apr:161:94-100.

doi: 10.1016/j.beproc.2018.01.008. Epub 2018 Feb 13.

Authors

Willem E Frankenhuis¹, Karthik Panchanathan², Andrew G Barto³

Affiliations

¹ Behavioural Science Institute, Radboud University, Montessorilaan 3, PO Box 9104, 6500, HE, Nijmegen, The Netherlands. Electronic address: w.frankenhuis@psych.ru.nl.
² Department of Anthropology, University of Missouri, United States.
³ College of Information and Computer Sciences, University of Massachusetts Amherst, United States.

PMID: 29412143
DOI: 10.1016/j.beproc.2018.01.008

Abstract

This article focuses on the division of labor between evolution and development in solving sequential, state-dependent decision problems. Currently, behavioral ecologists tend to use dynamic programming methods to study such problems. These methods are successful at predicting animal behavior in a variety of contexts. However, they depend on a distinct set of assumptions. Here, we argue that behavioral ecology will benefit from drawing more than it currently does on a complementary collection of tools, called reinforcement learning methods. These methods allow for the study of behavior in highly complex environments, which conventional dynamic programming methods do not feasibly address. In addition, reinforcement learning methods are well-suited to studying how biological mechanisms solve developmental and learning problems. For instance, we can use them to study simple rules that perform well in complex environments. Or to investigate under what conditions natural selection favors fixed, non-plastic traits (which do not vary across individuals), cue-driven-switch plasticity (innate instructions for adaptive behavioral development based on experience), or developmental selection (the incremental acquisition of adaptive behavior based on experience). If natural selection favors developmental selection, which includes learning from environmental feedback, we can also make predictions about the design of reward systems. Our paper is written in an accessible manner and for a broad audience, though we believe some novel insights can be drawn from our discussion. We hope our paper will help advance the emerging bridge connecting the fields of behavioral ecology and reinforcement learning.

Keywords: Adaptation; Development; Dynamic programming; Evolution; Learning; Reinforcement learning.

PubMed Disclaimer

Cited by

The Uncontrollable Mortality Risk Hypothesis: Theoretical foundations and implications for public health.
Brown RD, Pepper GV. Brown RD, et al. Evol Med Public Health. 2024 May 9;12(1):86-96. doi: 10.1093/emph/eoae009. eCollection 2024. Evol Med Public Health. 2024. PMID: 38807860 Free PMC article.
Behavioral selection in structured populations.
Borgstede M. Borgstede M. Theory Biosci. 2024 Jun;143(2):97-105. doi: 10.1007/s12064-024-00413-8. Epub 2024 Mar 5. Theory Biosci. 2024. PMID: 38441745 Free PMC article.
From beasts to bytes: Revolutionizing zoological research with artificial intelligence.
Zhang YJ, Luo Z, Sun Y, Liu J, Chen Z. Zhang YJ, et al. Zool Res. 2023 Nov 18;44(6):1115-1131. doi: 10.24272/j.issn.2095-8137.2023.263. Zool Res. 2023. PMID: 37933101 Free PMC article. Review.
Efficiency traps beyond the climate crisis: exploration-exploitation trade-offs and rebound effects.
Segovia-Martin J, Creutzig F, Winters J. Segovia-Martin J, et al. Philos Trans R Soc Lond B Biol Sci. 2023 Nov 6;378(1889):20220405. doi: 10.1098/rstb.2022.0405. Epub 2023 Sep 18. Philos Trans R Soc Lond B Biol Sci. 2023. PMID: 37718604 Free PMC article.
An evolutionary model of sensitive periods when the reliability of cues varies across ontogeny.
Walasek N, Frankenhuis WE, Panchanathan K. Walasek N, et al. Behav Ecol. 2021 Oct 25;33(1):101-114. doi: 10.1093/beheco/arab113. eCollection 2022 Jan-Feb. Behav Ecol. 2021. PMID: 35197808 Free PMC article.

See all "Cited by" articles

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Elsevier Science
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Enriching behavioral ecology with reinforcement learning methods

Affiliations

Enriching behavioral ecology with reinforcement learning methods

Authors

Affiliations

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources