A multidimensional distributional map of future reward in dopamine neurons

Nature, Published online: 04 June 2025; doi:10.1038/s41586-025-09089-6An algorithm called time–magnitude reinforcement learning (TMRL) extends distributional reinforcement learning to take account of reward time and magnitude, and behavioural and neurophysiological experiments in mice suggest that midbrain dopamine neurons use TMRL-like computations.

Jun 4, 2025 - 16:20
 0
A multidimensional distributional map of future reward in dopamine neurons

Nature, Published online: 04 June 2025; doi:10.1038/s41586-025-09089-6An algorithm called time–magnitude reinforcement learning (TMRL) extends distributional reinforcement learning to take account of reward time and magnitude, and behavioural and neurophysiological experiments in mice suggest that midbrain dopamine neurons use TMRL-like computations.