In 1951, Marvin Minsky, then a student at Harvard, borrowed observations from animal behavior to try to design an intelligent machine. Drawing on the work of physiologist Ivan Pavlov, who famously used dogs to show how animals learn through punishments and rewards, Minsky created a computer that could continuously learn through similar reinforcement to solve a virtual maze.
At the time, neuroscientists had yet to figure out the mechanisms within the brain that allow animals to learn in this way. But Minsky was still able to loosely mimic the behavior, thereby advancing artificial intelligence. Several decades later, as reinforcement learning continued to mature, it in turn helped the field of neuroscience discover those mechanisms, feeding into a virtuous cycle of advancement between the two fields.
In a paper published in Nature today, DeepMind, Alphabets AI subsidiary, has once again used lessons from reinforcement learning to propose a new theory about the reward mechanisms within our brains. The hypothesis, supported by initial experimental findings, could not only improve our understanding of mental health and motivation. It could also validate the current direction of AI research toward building more human-like general intelligence.
Sign up for The Algorithm artificial intelligence, demystified
At a high level, reinforcement learning follows the insight derived from Pavlovs dogs: its possible to teach an agent to master complex, novel tasks through only positive and negative feedback. An algorithm begins learning an assigned task by randomly predicting which action might earn it a reward. It then takes the action, observes the real reward, and adjusts its prediction based on the margin of error. Over millions or even billions of trials, the algorithms prediction errors converge to zero, at which point it knows precisely which actions to take to maximize its reward and so complete its task.
It turns out the brains reward system works in much the same waya discovery made in the 1990s, inspired by reinforcement-learning algorithms. When a human or animal is about to perform an action, its dopamine neurons make a prediction about the expected reward. Once the actual reward is received, they then fire off an amount of dopamine that corresponds to the prediction error. A better reward than expected triggers a strong dopamine release, while a worse reward than expected suppresses the chemicals production. The dopamine, in other words, serves as a correction signal, telling the neurons to adjust their predictions until they converge to reality. The phenomenon, known as reward prediction error, works much like a reinforcement-learning algorithm.
DeepMinds new paper builds on the tight connection between these natural and artificial learning mechanisms. In 2017, its researchers introduced an improved reinforcement-learning algorithm that has since unlocked increasingly impressive performance on various tasks. They now believe this new method could offer an even more precise explanation of how dopamine neurons work in the brain.
Specifically, the improved algorithm changes the way it predicts rewards. Whereas the old approach estimated rewards as a single numbermeant to equal the average expected outcomethe new approach represents them more accurately as a distribution. (Think for a moment about a slot machine: you can either win or lose following some distribution. But in no instance would you ever receive the average expected outcome.)
The modification lends itself to a new hypothesis: Do dopamine neurons also predict rewards in the same distributional way?
To test this theory, DeepMind partnered with a group at Harvard to observe dopamine neuron behavior in mice. They set the mice on a task and rewarded them based on the roll of dice, measuring the firing patterns of their dopamine neurons throughout. They found that every neuron released different amounts of dopamine, meaning they had all predicted different outcomes. While some were too optimistic, predicting higher rewards than actually received, others were more pessimistic, lowballing the reality. When the researchers mapped out the distribution of those predictions, it closely followed the distribution of the actual rewards. This data offers compelling evidence that the brain indeed uses distributional reward predictions to strengthen its learning algorithm.
DeepMind
This is a nice extension to the notion of dopamine coding of reward prediction error, wrote Wolfram Schultz, a pioneer in dopamine neuron behavior who wasnt involved in the study, in an email. It is amazing how this very simple dopamine response predictably follows intuitive patterns of basic biological learning processes that are now becoming a component of AI.
The study has implications for both AI and neuroscience. First, it validates distributional reinforcement learning as a promising path to more advanced AI capabilities. If the brain is using it, its probably a good idea, said Matt Botvinick, DeepMinds director of neuroscience research and one of the lead authors on the paper, during a press briefing. It tells us that this is a computational technique that can scale in real-world situations. Its going to fit well with other computational processes.
Second, it could offer an important update to one of the canonical theories in neuroscience about reward systems in the brain, which in turn could improve our understanding of everything from motivation to mental health. What might it mean, for example, to have pessimistic and optimistic dopamine neurons? If the brain selectively listened to only one or the other, could it lead to chemical imbalances and induce depression?
Fundamentally, by further decoding processes in the brain, the results also shed light on what creates human intelligence. It gives us a new perspective on what's going on in our brains during everyday life, Botvinick said.
Read the original post:
An algorithm that learns through rewards may show how our brain does too - MIT Technology Review
- Marcus Neuroscience Institute to Host Brain and Spine Symposium - South Florida Hospital News - March 30th, 2025 [March 30th, 2025]
- Elon University to launch neuroscience major in fall 2025 - Today at Elon - March 30th, 2025 [March 30th, 2025]
- The brains stalwart sentinels express an unexpected gene - The Transmitter: Neuroscience News and Perspectives - March 30th, 2025 [March 30th, 2025]
- Video catches microglia in the act of synaptic pruning - The Transmitter: Neuroscience News and Perspectives - March 30th, 2025 [March 30th, 2025]
- Null and Noteworthy: Reexamining registered reports - The Transmitter: Neuroscience News and Perspectives - March 30th, 2025 [March 30th, 2025]
- Accepting the bitter lesson and embracing the brains complexity - The Transmitter: Neuroscience News and Perspectives - March 30th, 2025 [March 30th, 2025]
- NIH neurodevelopmental assessment system now available as iPad app - The Transmitter: Neuroscience News and Perspectives - March 30th, 2025 [March 30th, 2025]
- Stronger Bonds Before Birth Shape Healthier Mother-Child Futures - Neuroscience News - March 30th, 2025 [March 30th, 2025]
- How Emotionally Intelligent People Learn to Control Their Inner Voice, Backed by Neuroscience - Inc. - March 30th, 2025 [March 30th, 2025]
- Gabriele Scheler reflects on the interplay between language, thought and AI - The Transmitter: Neuroscience News and Perspectives - March 30th, 2025 [March 30th, 2025]
- Worlds first crowd-sourced neuroscience study aims to understand how our brains predict the future - EurekAlert - March 15th, 2025 [March 15th, 2025]
- Rewriting Neuroscience: Possible Foundations of Human Intelligence Observed for the First Time - SciTechDaily - March 15th, 2025 [March 15th, 2025]
- Calculating neurosciences carbon cost: Q&A with Stefan Pulver and William Smith - The Transmitter: Neuroscience News and Perspectives - March 15th, 2025 [March 15th, 2025]
- The future of neuroscience research at U.S. minority-serving institutions is in danger - The Transmitter: Neuroscience News and Perspectives - March 15th, 2025 [March 15th, 2025]
- Dopamine and social media: Why you cant stop scrolling, according to neuroscience - PsyPost - March 15th, 2025 [March 15th, 2025]
- Neuroscience Discovered a Clever Trick for Squeezing More Joy Out of Everyday Pleasures - Inc. - March 15th, 2025 [March 15th, 2025]
- The limits of neuroscience - The Transmitter: Neuroscience News and Perspectives - March 15th, 2025 [March 15th, 2025]
- BPOM Explains The Benefits Of Fasting From The Health And Neuroscience Side - VOI English - March 15th, 2025 [March 15th, 2025]
- How tiny tardigrades could help tackle systems neuroscience questions - The Transmitter: Neuroscience News and Perspectives - March 15th, 2025 [March 15th, 2025]
- Alison Preston explains how our brains form mental frameworks for interpreting the world - The Transmitter: Neuroscience News and Perspectives - March 15th, 2025 [March 15th, 2025]
- The Mystical Mind Meets Neuroscience: Seeking the Roots of Consciousness - Next Big Idea Club Magazine - March 15th, 2025 [March 15th, 2025]
- Myosin Therapeutics Closes Second Seed Round to Advance Clinical Trials for Innovative Cancer and Neuroscience Therapies - PR Newswire - March 5th, 2025 [March 5th, 2025]
- Neuroscience Ph.D. programs adjust admissions in response to U.S. funding uncertainty - The Transmitter: Neuroscience News and Perspectives - March 5th, 2025 [March 5th, 2025]
- New tools help make neuroimaging accessible to more researchers - The Transmitter: Neuroscience News and Perspectives - March 5th, 2025 [March 5th, 2025]
- Future Thinking Training Reduces Impulsivity - Neuroscience News - March 5th, 2025 [March 5th, 2025]
- Null and Noteworthy, relaunched: Probing a schizophrenia biomarker - The Transmitter: Neuroscience News and Perspectives - March 5th, 2025 [March 5th, 2025]
- How to communicate the value of curiosity-driven research - The Transmitter: Neuroscience News and Perspectives - March 5th, 2025 [March 5th, 2025]
- Cognitive neuroscience approach to explore the impact of wind turbine noise on various mental functions - Nature.com - March 5th, 2025 [March 5th, 2025]
- Football on the Brain: Helping coaches embed neuroscience knowledge - Training Ground Guru - March 5th, 2025 [March 5th, 2025]
- Taking Control: Using Neuroscience to Build Better Lives - theLoop - March 5th, 2025 [March 5th, 2025]
- Creating a pipeline of talent to feed the growth of Neuroscience: Lessons from Ghana - Myjoyonline - March 5th, 2025 [March 5th, 2025]
- Exclusive: NIH appears to archive policy requiring female animals in studies - The Transmitter: Neuroscience News and Perspectives - February 25th, 2025 [February 25th, 2025]
- Roll On Down The Highway 2025 Tour coming to Neuroscience Group Field - WeAreGreenBay.com - February 25th, 2025 [February 25th, 2025]
- STEM organizations host Neuroscience Outreach Fair for local K-12 students - University of Virginia The Cavalier Daily - February 25th, 2025 [February 25th, 2025]
- Adapt or die: Safeguarding the future of diversity and inclusion funding in neuroscience - The Transmitter: Neuroscience News and Perspectives - February 25th, 2025 [February 25th, 2025]
- The last two-author neuroscience paper? - The Transmitter: Neuroscience News and Perspectives - February 25th, 2025 [February 25th, 2025]
- Gate Neurosciences Strengthens Focus on the Synapse as a Therapeutic Target with Acquisition of Boost Neuroscience - Business Wire - February 25th, 2025 [February 25th, 2025]
- Why Firefly Neuroscience, Inc. (AIFF) Is Soaring This Year So Far - Yahoo Finance - February 25th, 2025 [February 25th, 2025]
- Breaking the barrier between theorists and experimentalists - The Transmitter: Neuroscience News and Perspectives - February 25th, 2025 [February 25th, 2025]
- Preserving Brain Health and Advancing Neuroscience - University of Miami - February 25th, 2025 [February 25th, 2025]
- Science must step away from nationally managed infrastructure - The Transmitter: Neuroscience News and Perspectives - February 25th, 2025 [February 25th, 2025]
- Repurposed Blood Pressure Drug May Treat ADHD - Neuroscience News - February 25th, 2025 [February 25th, 2025]
- How to teach students about science funding - The Transmitter: Neuroscience News and Perspectives - February 25th, 2025 [February 25th, 2025]
- Reflecting on 2024: Advancing Neuroscience Research to Improve Neurological Health - National Institute of Neurological Disorders and Stroke - February 25th, 2025 [February 25th, 2025]
- Brains Hidden Circuitry for Risk and Reward Uncovered - Neuroscience News - February 25th, 2025 [February 25th, 2025]
- Why We Keep Exploring Even After Learning the Best Strategy - Neuroscience News - February 25th, 2025 [February 25th, 2025]
- Unlocking Cellular Youth: The Protein That Reverses Aging - Neuroscience News - February 25th, 2025 [February 25th, 2025]
- This paper changed my Life: Bill Newsome reflects on a quadrilogy of classic visual perception studies - The Transmitter: Neuroscience News and... - February 25th, 2025 [February 25th, 2025]
- Roundup: The false association between vaccines and autism - The Transmitter: Neuroscience News and Perspectives - February 3rd, 2025 [February 3rd, 2025]
- Static pay, shrinking prospects fuel neuroscience postdoc decline - The Transmitter: Neuroscience News and Perspectives - February 3rd, 2025 [February 3rd, 2025]
- Stimulating the brain with Damien Fair - The Transmitter: Neuroscience News and Perspectives - February 3rd, 2025 [February 3rd, 2025]
- Unhealthy Diet Linked to Faster Biological Aging in Young Adults - Neuroscience News - February 3rd, 2025 [February 3rd, 2025]
- Bob Smittcamp Family Neuroscience Institute coming to Fresno in 2026 - ABC30 News - February 3rd, 2025 [February 3rd, 2025]
- Norton Neuroscience Institute selected to pilot national Brain Health Navigator program - Norton Healthcare - February 3rd, 2025 [February 3rd, 2025]
- Coding bonus: Bats hippocampal cells log spatial, social cues - The Transmitter: Neuroscience News and Perspectives - February 3rd, 2025 [February 3rd, 2025]
- ADHD and brainwaves: How neuroscience is changing the way we diagnose the condition - PsyPost - February 3rd, 2025 [February 3rd, 2025]
- David Robbe challenges conventional notions of time and memory - The Transmitter: Neuroscience News and Perspectives - February 3rd, 2025 [February 3rd, 2025]
- How the Brain Processes Space and Time - Neuroscience News - February 3rd, 2025 [February 3rd, 2025]
- Using neuroscience to help establish healthier habits | Opinion - South Bend Tribune - February 3rd, 2025 [February 3rd, 2025]
- Solvonis chairman on heavy-hitting M&A in neuroscience sector - ICYMI - Proactive Investors UK - February 3rd, 2025 [February 3rd, 2025]
- New neuroscience research sheds light on distinct patterns of learning and generalization in autistic adults - PsyPost - January 23rd, 2025 [January 23rd, 2025]
- Neuroscientists need to do better at explaining basic mental health research - The Transmitter: Neuroscience News and Perspectives - January 23rd, 2025 [January 23rd, 2025]
- How Severance shows the possibilities of cognitive neuroscience - Fast Company - January 23rd, 2025 [January 23rd, 2025]
- AdventHealth Welcomes New Leadership In Heart and Vascular Services, Neuroscience and Orthopedics - Northwest Georgia News - January 23rd, 2025 [January 23rd, 2025]
- School of Neuroscience and Language Sciences Program recognized with University Exemplary Department or Program Award - Virginia Tech - January 23rd, 2025 [January 23rd, 2025]
- Early Exposure to Violent Media Linked to Teen Antisocial Behavior - Neuroscience News - January 23rd, 2025 [January 23rd, 2025]
- The Real Cognitive Neuroscience Behind Severance - WIRED - January 23rd, 2025 [January 23rd, 2025]
- The 15 most popular psychology and neuroscience studies in 2024 - PsyPost - January 1st, 2025 [January 1st, 2025]
- The 'lizard brain' lie: How neuroscience demolished the greatest mind myth - BBC Science Focus - January 1st, 2025 [January 1st, 2025]
- Revolutionizing Brain Diagnostics with Light and AI - Neuroscience News - January 1st, 2025 [January 1st, 2025]
- How Early Experiences Shape Genes, Brain Health, and Resilience - Neuroscience News - January 1st, 2025 [January 1st, 2025]
- A nation exhausted: The neuroscience of why Americans are tuning out political news - Indiana Capital Chronicle - January 1st, 2025 [January 1st, 2025]
- Lithium Restores Brain Function and Behavior in Autism - Neuroscience News - January 1st, 2025 [January 1st, 2025]
- Partners in Diversity presents the science of belonging: exploring the neuroscience of inclusion - Here is Oregon - January 1st, 2025 [January 1st, 2025]
- Classical vs. Operant Conditioning: The Brain's Memory Tug-of-War - Neuroscience News - January 1st, 2025 [January 1st, 2025]
- The Personality Gap Between Singles and the Partnered - Neuroscience News - January 1st, 2025 [January 1st, 2025]
- The Neuroscience Behind Vermeers Girl and Its Hypnotic Power - ZME Science - January 1st, 2025 [January 1st, 2025]
- Serotonin, GABA, and Dopamine Drive Hunger and Feeding - Neuroscience News - December 23rd, 2024 [December 23rd, 2024]
- A nation exhausted: The neuroscience of why Americans are tuning out politics - The Conversation - December 23rd, 2024 [December 23rd, 2024]
- UNO Goalie and Neuroscience Grad Shines in Her Athletic and Academic Aspirations - University of Nebraska Omaha - December 23rd, 2024 [December 23rd, 2024]