MERLIN: Difference between revisions
| RobowaifuDev (talk | contribs) No edit summary | RobowaifuDev (talk | contribs)  mNo edit summary | ||
| (One intermediate revision by the same user not shown) | |||
| Line 1: | Line 1: | ||
| [[Category:2018]][[Category: | [[Category:2018]][[Category:Research paper]] | ||
| '''MERLIN''' (the '''Me'''mory, '''RL''', and '''I'''nference Network) is a model for unsupervised predictive memory in a goal-directed agent proposed by Greg Wayne, ''et al''., in which  | '''MERLIN''' (the '''Me'''mory, '''RL''', and '''I'''nference Network) is a model for unsupervised predictive [[memory]] in a goal-directed agent proposed by Greg Wayne, ''et al''., in which memory formation is guided by a process of predictive modeling. The authors state animals execute goal-directed behaviors despite limited range and scope of their sensors and to cope, they explore environments and store memories maintaining estimates of important information not available. MERLIN used 3D [[virtual reality]] environments for which partial observability was severe and memories had to be maintained over long durations and demonstrated that it can solve canonical behavioral tasks in psychology and neurobiology without simplifying assumptions about the dimensionality of sensory input or the duration of experiences.<ref>{{cite|authors=Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap|title=Unsupervised Predictive Memory in a Goal-Directed Agent|publication=arXiv:1803.10760|year=2018}}</ref> | ||
| == Demonstrations == | == Demonstrations == | ||
Latest revision as of 14:36, 28 April 2023
MERLIN (the Memory, RL, and Inference Network) is a model for unsupervised predictive memory in a goal-directed agent proposed by Greg Wayne, et al., in which memory formation is guided by a process of predictive modeling. The authors state animals execute goal-directed behaviors despite limited range and scope of their sensors and to cope, they explore environments and store memories maintaining estimates of important information not available. MERLIN used 3D virtual reality environments for which partial observability was severe and memories had to be maintained over long durations and demonstrated that it can solve canonical behavioral tasks in psychology and neurobiology without simplifying assumptions about the dimensionality of sensory input or the duration of experiences.[1]
Demonstrations
Ext. Video 1
Ext. Video 2
Ext. Video 3
Ext. Video 4
Ext. Video 5
Ext. Video 6
References
- ↑ Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap. Unsupervised Predictive Memory in a Goal-Directed Agent. arXiv:1803.10760, 2018.