MERLIN: Difference between revisions

From Robowaifu Institute of Technology
Jump to navigation Jump to search
No edit summary
No edit summary
Line 1: Line 1:
[[Category:2018]][[Category:Papers]]
[[Category:2018]][[Category:Papers]]
'''MERLIN''' (the '''Me'''mory, '''RL''', and '''I'''nference Network) is a model for unsupervised predictive memory in a goal-directed agent proposed by Greg Wayne, ''et al''., in which memory formation is guided by a process of predictive modeling. The authors state animals execute goal-directed behaviors despite limited range and scope of their sensors and to cope, they explore environments and store memories maintaining estimates of important information not available. MERLIN used 3D virtual reality environments for which partial observability was severe and memories had to be maintained over long durations and demonstrated that it can solve canonical behavioral tasks in psychology and neurobiology without simplifying assumptions about the dimensionality of sensory input or the duration of experiences.<ref>{{cite|authors=Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap|title=Unsupervised Predictive Memory in a Goal-Directed Agent|publication=arXiv:1803.10760|year=2018}}</ref>
'''MERLIN''' (the '''Me'''mory, '''RL''', and '''I'''nference Network) is a model for unsupervised predictive memory in a goal-directed agent proposed by Greg Wayne, ''et al''., in which [[memory]] formation is guided by a process of predictive modeling. The authors state animals execute goal-directed behaviors despite limited range and scope of their sensors and to cope, they explore environments and store memories maintaining estimates of important information not available. MERLIN used 3D [[virtual reality]] environments for which partial observability was severe and memories had to be maintained over long durations and demonstrated that it can solve canonical behavioral tasks in psychology and neurobiology without simplifying assumptions about the dimensionality of sensory input or the duration of experiences.<ref>{{cite|authors=Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap|title=Unsupervised Predictive Memory in a Goal-Directed Agent|publication=arXiv:1803.10760|year=2018}}</ref>


== Demonstrations ==
== Demonstrations ==

Revision as of 01:35, 10 August 2021

MERLIN (the Memory, RL, and Inference Network) is a model for unsupervised predictive memory in a goal-directed agent proposed by Greg Wayne, et al., in which memory formation is guided by a process of predictive modeling. The authors state animals execute goal-directed behaviors despite limited range and scope of their sensors and to cope, they explore environments and store memories maintaining estimates of important information not available. MERLIN used 3D virtual reality environments for which partial observability was severe and memories had to be maintained over long durations and demonstrated that it can solve canonical behavioral tasks in psychology and neurobiology without simplifying assumptions about the dimensionality of sensory input or the duration of experiences.[1]

Demonstrations

Ext. Video 1

Ext. Video 2

Ext. Video 3

Ext. Video 4

Ext. Video 5

Ext. Video 6

References

  1. Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap. Unsupervised Predictive Memory in a Goal-Directed Agent. arXiv:1803.10760, 2018.