In 1951, Marvin Minsky, then a student at Harvard, borrowed observations from animal behavior to try to design an intelligent machine. Drawing on the work of physiologist Ivan Pavlov, who famously used dogs to show how animals learn through punishments and rewards, Minsky created a computer that could continuously learn through similar reinforcement to solve a virtual maze.
At the time, neuroscientists had yet to figure out the mechanisms within the brain that allow animals to learn in this way. But Minsky was still able to loosely mimic the behavior, thereby advancing artificial intelligence. Several decades later, as reinforcement learning continued to mature, it in turn helped the field of neuroscience discover those mechanisms, feeding into a virtuous cycle of advancement between the two fields.
In a paper published in Nature today, DeepMind, Alphabets AI subsidiary, has once again used lessons from reinforcement learning to propose a new theory about the reward mechanisms within our brains. The hypothesis, supported by initial experimental findings, could not only improve our understanding of mental health and motivation. It could also validate the current direction of AI research toward building more human-like general intelligence.
Sign up for The Algorithm artificial intelligence, demystified
At a high level, reinforcement learning follows the insight derived from Pavlovs dogs: its possible to teach an agent to master complex, novel tasks through only positive and negative feedback. An algorithm begins learning an assigned task by randomly predicting which action might earn it a reward. It then takes the action, observes the real reward, and adjusts its prediction based on the margin of error. Over millions or even billions of trials, the algorithms prediction errors converge to zero, at which point it knows precisely which actions to take to maximize its reward and so complete its task.
It turns out the brains reward system works in much the same waya discovery made in the 1990s, inspired by reinforcement-learning algorithms. When a human or animal is about to perform an action, its dopamine neurons make a prediction about the expected reward. Once the actual reward is received, they then fire off an amount of dopamine that corresponds to the prediction error. A better reward than expected triggers a strong dopamine release, while a worse reward than expected suppresses the chemicals production. The dopamine, in other words, serves as a correction signal, telling the neurons to adjust their predictions until they converge to reality. The phenomenon, known as reward prediction error, works much like a reinforcement-learning algorithm.
DeepMinds new paper builds on the tight connection between these natural and artificial learning mechanisms. In 2017, its researchers introduced an improved reinforcement-learning algorithm that has since unlocked increasingly impressive performance on various tasks. They now believe this new method could offer an even more precise explanation of how dopamine neurons work in the brain.
Specifically, the improved algorithm changes the way it predicts rewards. Whereas the old approach estimated rewards as a single numbermeant to equal the average expected outcomethe new approach represents them more accurately as a distribution. (Think for a moment about a slot machine: you can either win or lose following some distribution. But in no instance would you ever receive the average expected outcome.)
The modification lends itself to a new hypothesis: Do dopamine neurons also predict rewards in the same distributional way?
To test this theory, DeepMind partnered with a group at Harvard to observe dopamine neuron behavior in mice. They set the mice on a task and rewarded them based on the roll of dice, measuring the firing patterns of their dopamine neurons throughout. They found that every neuron released different amounts of dopamine, meaning they had all predicted different outcomes. While some were too optimistic, predicting higher rewards than actually received, others were more pessimistic, lowballing the reality. When the researchers mapped out the distribution of those predictions, it closely followed the distribution of the actual rewards. This data offers compelling evidence that the brain indeed uses distributional reward predictions to strengthen its learning algorithm.
DeepMind
This is a nice extension to the notion of dopamine coding of reward prediction error, wrote Wolfram Schultz, a pioneer in dopamine neuron behavior who wasnt involved in the study, in an email. It is amazing how this very simple dopamine response predictably follows intuitive patterns of basic biological learning processes that are now becoming a component of AI.
The study has implications for both AI and neuroscience. First, it validates distributional reinforcement learning as a promising path to more advanced AI capabilities. If the brain is using it, its probably a good idea, said Matt Botvinick, DeepMinds director of neuroscience research and one of the lead authors on the paper, during a press briefing. It tells us that this is a computational technique that can scale in real-world situations. Its going to fit well with other computational processes.
Second, it could offer an important update to one of the canonical theories in neuroscience about reward systems in the brain, which in turn could improve our understanding of everything from motivation to mental health. What might it mean, for example, to have pessimistic and optimistic dopamine neurons? If the brain selectively listened to only one or the other, could it lead to chemical imbalances and induce depression?
Fundamentally, by further decoding processes in the brain, the results also shed light on what creates human intelligence. It gives us a new perspective on what's going on in our brains during everyday life, Botvinick said.
Read the original post:
An algorithm that learns through rewards may show how our brain does too - MIT Technology Review
- The 15 most popular psychology and neuroscience studies in 2024 - PsyPost - January 1st, 2025 [January 1st, 2025]
- The 'lizard brain' lie: How neuroscience demolished the greatest mind myth - BBC Science Focus - January 1st, 2025 [January 1st, 2025]
- Revolutionizing Brain Diagnostics with Light and AI - Neuroscience News - January 1st, 2025 [January 1st, 2025]
- How Early Experiences Shape Genes, Brain Health, and Resilience - Neuroscience News - January 1st, 2025 [January 1st, 2025]
- A nation exhausted: The neuroscience of why Americans are tuning out political news - Indiana Capital Chronicle - January 1st, 2025 [January 1st, 2025]
- Lithium Restores Brain Function and Behavior in Autism - Neuroscience News - January 1st, 2025 [January 1st, 2025]
- Partners in Diversity presents the science of belonging: exploring the neuroscience of inclusion - Here is Oregon - January 1st, 2025 [January 1st, 2025]
- Classical vs. Operant Conditioning: The Brain's Memory Tug-of-War - Neuroscience News - January 1st, 2025 [January 1st, 2025]
- The Personality Gap Between Singles and the Partnered - Neuroscience News - January 1st, 2025 [January 1st, 2025]
- The Neuroscience Behind Vermeers Girl and Its Hypnotic Power - ZME Science - January 1st, 2025 [January 1st, 2025]
- Serotonin, GABA, and Dopamine Drive Hunger and Feeding - Neuroscience News - December 23rd, 2024 [December 23rd, 2024]
- A nation exhausted: The neuroscience of why Americans are tuning out politics - The Conversation - December 23rd, 2024 [December 23rd, 2024]
- UNO Goalie and Neuroscience Grad Shines in Her Athletic and Academic Aspirations - University of Nebraska Omaha - December 23rd, 2024 [December 23rd, 2024]
- Neuroscience Major Seeks to Bridge the Generation Gap, Help Alzheimers Patients - Pomona College - December 23rd, 2024 [December 23rd, 2024]
- Spectrum 2024: Year in review - The Transmitter: Neuroscience News and Perspectives - December 23rd, 2024 [December 23rd, 2024]
- Say what? The Transmitters top quotes of 2024 - The Transmitter: Neuroscience News and Perspectives - December 23rd, 2024 [December 23rd, 2024]
- Targeted or Broadcast? How the Brain Processes Visual Information - Neuroscience News - December 23rd, 2024 [December 23rd, 2024]
- 70 Is the New 60: Age Related Declines Slowing in Older People - Neuroscience News - December 23rd, 2024 [December 23rd, 2024]
- Breathing Rhythms During Sleep Strengthen Memory Consolidation - Neuroscience News - December 23rd, 2024 [December 23rd, 2024]
- How our brains think: Exploring the world of neuroscience at the Yale Peabody Museum - Connecticut Public - December 23rd, 2024 [December 23rd, 2024]
- Assembloids illuminate circuit-level changes linked to autism, neurodevelopment - The Transmitter: Neuroscience News and Perspectives - December 23rd, 2024 [December 23rd, 2024]
- Mapping the Brain's Response to Social Rejection - Neuroscience News - December 9th, 2024 [December 9th, 2024]
- An eye for science: Q&A with Bryan W. Jones - The Transmitter: Neuroscience News and Perspectives - December 9th, 2024 [December 9th, 2024]
- Short Sleep and High Blood Pressure Linked to Brain Aging - Neuroscience News - December 9th, 2024 [December 9th, 2024]
- Neighborhood Disadvantage Linked to Cognitive Health Risks - Neuroscience News - December 9th, 2024 [December 9th, 2024]
- Psychosis Risk Tied to Heavy Cannabis Use and Genetic Factors - Neuroscience News - December 9th, 2024 [December 9th, 2024]
- Most Teens Recover From Long Covid Within Two Years - Neuroscience News - December 9th, 2024 [December 9th, 2024]
- Opportunities and challenges of single-cell and spatially resolved genomics methods for neuroscience discovery - Nature.com - December 9th, 2024 [December 9th, 2024]
- How Evolution Shaped the Brains Understanding of Numbers - Neuroscience News - December 9th, 2024 [December 9th, 2024]
- Neuroscience Study Aboard Cunard's Queen Mary 2 Reveals Cognitive Benefits of Slow Travel at Sea - PR Newswire - November 28th, 2024 [November 28th, 2024]
- How Expectations Shape Our Gaze in a Changing World - Neuroscience News - November 28th, 2024 [November 28th, 2024]
- To keep or not to keep: Neurophysiologys data dilemma - The Transmitter: Neuroscience News and Perspectives - November 28th, 2024 [November 28th, 2024]
- Does Alcohol Consumption Contribute to Hair Loss? - Neuroscience News - November 28th, 2024 [November 28th, 2024]
- Brains Traffic Controllers Hold Key to Learning and Memory - Neuroscience News - November 28th, 2024 [November 28th, 2024]
- Despite Neuroscience Setback, AbbVie Has Strong Recovery Ahead (ABBV) - Seeking Alpha - November 28th, 2024 [November 28th, 2024]
- Neuroscientists reeling from past cuts advocate for more BRAIN Initiative funding - The Transmitter: Neuroscience News and Perspectives - November 28th, 2024 [November 28th, 2024]
- Want Better Habits? Neuroscience Says This Is How to Train Your Brain - Inc. - November 28th, 2024 [November 28th, 2024]
- Dopamine and Serotonin Work in Opposition for Effective Learning - Neuroscience News - November 28th, 2024 [November 28th, 2024]
- Cunard Proves the Healing Power of Ocean Travel with Breakthrough Neuroscience Research - Travel And Tour World - November 28th, 2024 [November 28th, 2024]
- Bridging the Gap between Meditation, Neuroscience, and the Soul - openPR - November 28th, 2024 [November 28th, 2024]
- Animal Characters in Childrens Books Boost Theory of Mind - Neuroscience News - November 28th, 2024 [November 28th, 2024]
- Emotional Struggles and Tantrums in Preschoolers Linked to ADHD - Neuroscience News - November 28th, 2024 [November 28th, 2024]
- Neuroscience Says This Simple Habit Improves Cognitive Health and Makes Your Brain Act Younger - Inc. - November 20th, 2024 [November 20th, 2024]
- Premature declarations on animal consciousness hinder progress - The Transmitter: Neuroscience News and Perspectives - November 20th, 2024 [November 20th, 2024]
- Medtronic Q2 Earnings: Diabetes And Neuroscience Revenue Boost Growth, Raises Annual Outlook - Yahoo Finance - November 20th, 2024 [November 20th, 2024]
- Trace Neuroscience Nets $101M in Series A Funding for ALS, Dementia Therapy Development - Senior Housing News - November 20th, 2024 [November 20th, 2024]
- How to be a multidisciplinary neuroscientist - The Transmitter: Neuroscience News and Perspectives - November 20th, 2024 [November 20th, 2024]
- Neuroscience Market Expected to Reach USD 71.0 Billion by - GlobeNewswire - November 20th, 2024 [November 20th, 2024]
- Finger-Prick Test Brings Alzheimers Detection Closer to Everyone - Neuroscience News - November 20th, 2024 [November 20th, 2024]
- Dual-Gene Therapy Shows Promise for Hearing and Vision Loss - Neuroscience News - November 20th, 2024 [November 20th, 2024]
- Robots Help Unlock the Mystery of Human Sense of Self - Neuroscience News - November 20th, 2024 [November 20th, 2024]
- The neuroscience of sleep - University of South Carolina - November 20th, 2024 [November 20th, 2024]
- Stress warps fear memories in multiple ways - The Transmitter: Neuroscience News and Perspectives - November 20th, 2024 [November 20th, 2024]
- Mental Exhaustion Drives Aggressive Behavior - Neuroscience News - November 12th, 2024 [November 12th, 2024]
- NeuroAI: A field born from the symbiosis between neuroscience, AI - The Transmitter: Neuroscience News and Perspectives - November 12th, 2024 [November 12th, 2024]
- The neuroscience of deeper learning in math - SmartBrief - November 12th, 2024 [November 12th, 2024]
- What the brain can teach artificial neural networks - The Transmitter: Neuroscience News and Perspectives - November 12th, 2024 [November 12th, 2024]
- How Anthony Zador thinks neuroscience can help improve AI - The Transmitter: Neuroscience News and Perspectives - November 12th, 2024 [November 12th, 2024]
- Discovering Cancer Therapies through Neuroscience - The New York Academy of Sciences - November 12th, 2024 [November 12th, 2024]
- Neuroscience Market Projected to Reach USD 50.2 Billion by 2032, Growing at a 4.0% CAGR S&S Insider - GlobeNewswire - November 12th, 2024 [November 12th, 2024]
- Insights on Brain Aging and Lifelong Cognitive Health - Neuroscience News - November 12th, 2024 [November 12th, 2024]
- A neuroscience PhD student at the University of Oxford has died - The Tab - November 12th, 2024 [November 12th, 2024]
- Exploring the connection between autism and sleep - The Transmitter: Neuroscience News and Perspectives - November 12th, 2024 [November 12th, 2024]
- Astrocytes star in memory storage, recall - The Transmitter: Neuroscience News and Perspectives - November 12th, 2024 [November 12th, 2024]
- Gut Bacteria Modulate Stress Responses Over Time - Neuroscience News - November 12th, 2024 [November 12th, 2024]
- Gut Bacteria Could Hold the Key to Promoting Healthy Aging - Neuroscience News - November 12th, 2024 [November 12th, 2024]
- Microglias pruning function called into question - The Transmitter: Neuroscience News and Perspectives - October 26th, 2024 [October 26th, 2024]
- Depression Alters Brain Circuits, Heightening Negative Perception - Neuroscience News - October 26th, 2024 [October 26th, 2024]
- UNE Researchers Showcase Groundbreaking Work at Global Neuroscience Conference - University of New England - October 26th, 2024 [October 26th, 2024]
- Scientists discover "glue" that holds memory together in fascinating neuroscience breakthrough - PsyPost - October 26th, 2024 [October 26th, 2024]
- Systems neuroscience: combining theory and neurotechnology for a multiscale account of the brain - Nature.com - October 26th, 2024 [October 26th, 2024]
- Seaport Therapeutics adds another $225 million to coffers to embrace the golden age of neuroscience - STAT - October 26th, 2024 [October 26th, 2024]
- ANRO Investors Have Opportunity to Join Alto Neuroscience, Inc. Fraud Investigation with the Schall Law Firm - Business Wire - October 26th, 2024 [October 26th, 2024]
- Youth Face Rising Risks of Harassment and Exploitation in the Metaverse - Neuroscience News - October 26th, 2024 [October 26th, 2024]
- Exercise During Chemotherapy Boosts Cognitive Function - Neuroscience News - October 26th, 2024 [October 26th, 2024]
- Removing Pre-Bed Screen Time Improves Toddler Sleep - Neuroscience News - October 26th, 2024 [October 26th, 2024]
- Bright Minds Biosciences and Firefly Neuroscience to Collaborate After the BREAKTHROUGH Study: A Phase 2 Trial of BMB-101 in Absence Epilepsy and... - October 26th, 2024 [October 26th, 2024]
- How Visual Clutter Disrupts Information Flow in the Brain - Neuroscience News - October 26th, 2024 [October 26th, 2024]
- Menopausal Hormone Therapys Effects on Brain Health - Neuroscience News - October 26th, 2024 [October 26th, 2024]
- After-hours movers: McDonald's, Starbucks, Seagate, Alto Neuroscience and more - StreetInsider.com - October 26th, 2024 [October 26th, 2024]