2017 NeuroAI 2

Deep Learning

1943, McCulloch and Pitts: NN that could compute logical functions
1949, Hebb: efficiently encode environmental statistics in an unsupervised fashion
1958, Rosenblatt: NN learn incrementally via supervisory feedback
1980, Fukushima: early NN models of visual processing
1985, Rumelhart: backprop
2006, Hinton: deep belief networks
2009, Deng: introduction of large datasets inspired by research on human language
2012, Hinton: Dropout regularization, motivated by the stochasticity that is inherent in neurons that fire with Poisson-like statistics
2015, LeCun: sentences can be represented as vectors
2016, Yamins and DiCarlo: CNNs incorporate nonlinear transduction, divisive normalization, and maximum based pooling of inputs
- 1959, Hubel and Wiesel: single-cell recordings from the mammalian visual cortex revealed how visual input is filtered and pooled in simple and complex cells in area V1
- Replicates the hierarchical organization of mammalian cortical systems, with both convergent and divergent information flow in successive, nested processing layers

SOTA 2025

LLMs

Reinforcement Learning

TD methods
- Real-time models that learn from differences between temporally successive predictions, rather than having to wait until the actual reward is delivered.
- Of particular relevance was an effect called second-order conditioning, where affective significance is conferred on a conditioned stimulus (CS) through association with another CS rather than directly via association with the unconditioned stimulus.
- TD learning provides a natural explanation for second-order conditioning and indeed has gone on to explain a much wider range of findings from neuroscience.
TD based RL: DQNs, A3C, PPO (TD for value estimation), SAC (TD for Q-function updates)
- DQN uses TD learning by bootstrapping from its own predictions \(Q_{\text{target}}\) to update Q-values in real time.
- Unlike Monte Carlo, it doesn’t need to wait for final outcomes — it learns from temporal differences between successive predictions.
- The "real-time" aspect comes from the fact that every step generates a TD error, which is used to improve the policy immediately.

SOTA 2025

DreamerV3: Best for model-based RL from pixels
MuZero: Combines planning + learning without knowing environment rules
SAC: SOTA for continuous control
PPO: Widely used, stable, scalable

Attention

Traditionally, CNN models worked directly on entire images, with equal priority given to all pixels at the earliest stage of processing
The primate visual system works differently. Visual attention shifts strategically among locations and objects, centering processing resources and representational coordinates on a series of regions in turn
Attentional mechanisms have been a source of inspiration for AI architectures that take "glimpses" of the input image at each step, update internal state representations, and then select the next location to sample
- One such network was able to use this selective attentional mechanism to ignore irrelevant objects in a scene, allowing it to perform well in challenging object classification tasks in the presence of clutter
- 2014. DeepMind. Recurrent Models of Visual Attention
While attention is typically thought of as an orienting mechanism for perception, it can also be focused toward the contents of internal memory, this has helped provide recent successes in machine translation and memory + reasoning tasks
One further area of AI where attention mechanisms have recently proven useful focuses on generative models that mimic the structure of examples presented during training
- For example, in one SOTA generative model known as DRAW, attention allows the system to build up an image incrementally, attending to one portion of a "mental canvas" at a time

SOTA 2025

LLMs

Episodic Memory

Allow experiences to be encoded rapidly in a content-addressable store
- Associated with medial temporal lobe, (including hippocampus)
Animal learning is supported by complementary learning systems in the hippocampus and neocortex
- The hippocampus acts to encode novel information after a single exposure (one-shot learning), but this information is gradually consolidated to the neocortex in sleep or resting periods that are interleaved with periods of activity. This consolidation is accompanied by replay in the hippocampus and neocortex, which is observed as a reinstatement of the structured patterns of neural activity that accompanied the learning event
- This theory was originally proposed as a solution to the well-known problem that in conventional neural networks, correlated exposure to sequential task settings leads to interference (catastrophic forgetting)
- The replay buffer in DQN is like a primitive hippocampus, permitting complementary learning in silico
- Enhanced when replay of highly rewarding events is prioritized (hippocampal replay seems to favor events that lead to high levels of reinforcement)
  - 2015. DeepMind. Prioritized Experience Replay
DQN exhibits expert play on Atari video games by learning to transform image pixels to a policy
- Experience replay is critical to maximizing data efficiency, avoids the destabilizing effects of learning from consecutive correlated experiences, and allows the network to learn a viable value function even in complex sequential environments (video games)
Episodic Control
- Experiences stored in a memory buffer can not only be used to gradually adjust the parameters of a deep network toward an optimal policy, as in DQN
- Can also support rapid behavioral change based on an individual experience. Neuroscience has argued for the potential benefits of episodic control, whereby rewarded action sequences can be internally re-enacted from a rapidly updatable memory store (hippocampus). Advantageous when limited experience has been obtained
- Recent AI research has drawn on these ideas to overcome the slow learning in deep RL
  - 2016. DeepMind. Model-Free Episodic Control
- These networks store experiences (e.g., actions and reward outcomes associated with particular game screens) and select new actions based on the similarity between the current input and memories, taking the reward associated with previous events into account
- Striking gains in performance over deep RL. Further, they are able to achieve success on tasks that depend heavily on one-shot learning, where typical deep RL architectures fail
- In the future, it will be interesting to harness the benefits of rapid episodic-like memory and more traditional incremental learning (Imagination and planning)

SOTA 2025

Problems with Episodic Control
- You can't store all the experience.
- Similar inputs don’t always lead to similar outcomes.
How World Models solves them
- Compression & Generalization: World models summarize and compress many experiences into learned patterns
- Variance Reduction: world models learn structure and smooth out noise in the data
- Predictive Imagination: World models allow simulation of counterfactuals: What if I try a different action in this situation?

Working Memory

Human working memory
- Thought to be instantiated within the prefrontal cortex and interconnected areas.
- Classic cognitive theories: depends on interactions between a central controller and separate, domain-specific memory buffers
Began with RNN displaying attractor dynamics and rich sequential behavior, directly inspired by neuroscience
One can see close parallels between the learning dynamics in these early, neuroscience-inspired networks and those in LSTM networks. LTSMs allow information to be gated into a fixed activity state and maintained until an appropriate output is required. The functions of sequence control and memory storage are closely intertwined instead of separate
Differential neural computer (DNC) involves a neural network controller that attends to and reads/writes from an external memory matrix.
- This externalization allows the network controller to learn from scratch (i.e., via end-to-end optimization) to perform a wide range of complex memory and reasoning tasks that currently elude LSTMs, such as finding the shortest path through a graph-like structure
- These types of problems were previously argued to depend exclusively on symbol processing and variable binding and therefore beyond the purview of neural networks
Although both LSTMs and the DNC are described here in the context of working memory, they have the potential to maintain information over many thousands of training cycles and so may thus be suited to longer-term forms of memory, such as retaining and understanding the contents of a book.

SOTA 2025

Transformers (implicit memory)

Continual Learning

Neuroscience
- Decreased synaptic lability (lower rates of plasticity) in a proportion of strengthened synapses, mediated by enlargements to dendritic spines that persist despite learning of other tasks.
- Theoretical models: memories can be protected from interference through synapses that transition between a cascade of states with different levels of plasticity.
Elastic Weight Consolidation (EWC)
- 2016. DeepMind. Overcoming catastrophic forgetting in neural networks
- Acts by slowing down learning in a subset of network weights identified as important to previous tasks. Allows deep RL networks to support continual learning at large scale.

Intuitive Understanding of the Physical World

Novel neural network architectures
- Interpret and reason about scenes in a human-like way, by decomposing them into individual objects and their relations
- 2016. DeepMind. Interaction Networks for Learning about Objects, Relations and Physics
- 2016. DeepMind. Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
- 2017. DeepMind. A simple neural network module for relational reasoning
  - Human-level performance on challenging reasoning tasks
Deep RL
- Capture the processes by which children gain commonsense understanding of the world through interactive experiments.
- 2016. DeepMind. Learning to Perform Physics Experiments via Deep Reinforcement Learning
Deep generative models
- Construct rich object models from raw sensory inputs
- 2016. DeepMind. Early Visual Concept Learning with Unsupervised Deep Learning
- Leverage constraints first identified in neuroscience, such as redundancy reduction, which encourage the emergence of disentangled representations of independent factors such as shape and position.

SOTA 2025

2021. UC Berkeley. Decision Transformer: Reinforcement Learning via Sequence Modeling
- An architecture that casts the problem of RL as conditional sequence modeling. Unlike prior approaches to RL that fit value functions or compute policy gradients, Decision Transformer simply outputs the optimal actions by leveraging a causally masked Transformer
- Matches or exceeds the performance of state-of-the-art model-free offline RL baselines on Atari, OpenAI Gym, and Key-to-Door tasks
2023. Meta. Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
- Grounded in the fact that humans learn an enormous amount of background knowledge about the world just by passively observing it.
- At a high level, the JEPA aims to predict the representation of part of an input (such as an image or piece of text) from the representation of other parts of the same input.
- Because it does not involve collapsing representations from multiple views/augmentations of an image to a single point, the hope is for the JEPA to avoid the biases and issues associated with another widely used method called invariance-based pretraining.

Efficient Learning

Learn to learn
- Acquiring knowledge on new tasks by leveraging prior experience with related problems, to support one-shot concept learning and accelerating learning in RL tasks
- 2016. DeepMind. One-shot Learning with Memory-Augmented Neural Networks (MANN)
  - The Neural Turing Machine (NTM) is a fully differentiable implementation of a MANN
- 2016. DeepMind. Matching Networks for One Shot Learning
  - A new neural architecture that, by way of its corresponding training regime, is capable of state-of-the-art performance on a variety of one-shot classification tasks.
- 2016. DeepMind. Learning to reinforcement learn
- 2017. Finn. Model-Agnostic Meta-Learning (MAML) for Fast Adaptation of Deep Networks

Transfer Learning

Progressive Neural Networks: a new class of architecture
- 2016. DeepMind. Progressive Neural Networks
- 2016. DeepMind. Sim-to-Real Robot Learning from Pixels with Progressive Nets
- Leverage knowledge gained in one video game to learn rapidly in another
- The proposed architecture bears some resemblance to a successful computational model of sequential task learning in humans.
Neuroscience
- How humans or other animals achieve this sort of high-level transfer learning is unknown, and remains a relatively unexplored topic in neuroscience
- At the level of neural coding, this kind of transfer of abstract structured knowledge may rely on the formation of conceptual representations that are invariant to the objects, individuals, or scene elements that populate a sensory domain but code instead for abstract, relational information among patterns of inputs (lack direct evidence)
- One recent report: neural codes thought to be important in the representation of allocentric (map-like) spaces might be critical for abstract reasoning in more general domains
- In the mammalian entorhinal cortex, cells encode the geometry of allocentric space with a periodic "grid" code, with receptive fields that tile the local space in a hexagonal pattern (Rowland et al., 2016)

SOTA 2025

LLMs!
World Models
- I-JEPA: The first AI model based on Yann LeCun's vision for more human-like AI

Imagination and Planning

Model-free RL
- Computationally inexpensive
- Data inefficient
- Inflexible (insensitive to changes in the value of outcomes)
- 2005. Daw. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
- 2013. Dolan and Dayan. Goals and Habits in the Brain
- 1948. Tolman. Cognitive maps in rats and men
Monte Carlo tree search (MCTS): use forward search to update a value function and/or policy
- 2012. Browne. A survey of Monte Carlo tree search methods
- 2016. Silver. Mastering the game of Go with deep neural networks and tree search
Limitations
- How rich internal models can be learned through experience without strong priors being handcrafted into the network by the experimenter?
Hippocampus: imagine possible scenarios & simulation based planning
- Example: when paused at a choice point, ripples of neural activity in the rat hippocampus resemble those observed during subsequent navigation of the available trajectories ("preplay"), as if the animal were "imagining" each possible alternative
- Supports planning by instantiating an internal model of the environment, with goal-contingent valuation of simulated outcomes occurring in areas downstream of the hippocampus such the orbitofrontal cortex or striatum
- Mechanisms that guide the rolling forward of an internal model of the environment in the hippocampus remain uncertain and merit future scrutiny (One possibility is that this process is initiated by the prefrontal cortex through interactions with the hippocampus)
- 2017. Hamrick. Metacontrol for Adaptive Imagination-Based Optimization
Deep generative models for simulation-based planning
- 2016. Eslami. Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
- 2016. Rezende. Unsupervised Learning of 3D Structure from Images
- 2016. Rezende. One-Shot Generalization in Deep Generative Models
- Generate temporally consistent sequences of generated samples that reflect the geometric layout of newly experienced realistic environments
  - 2017. Gemici. Generative Temporal Models with Memory
  - 2015. Oh. Action-Conditional Video Prediction using Deep Networks in Atari Games
- Using these models for simulation-based planning in agents remains a challenge for future work
Human imagination
- Construct fictitious mental scenarios by recombining familiar elements in novel ways
- Involves efficient representations that support generalization and transfer
- "Jumpy": bridging multiple temporal scales at a time

SOTA 2025

Virtual Brain Analytics

Equivalents of single-cell recording, neuroimaging, and lesion techniques
Neuroscience: visualizing brain states through dimensionality reduction
- 2016. Zahavy. Graying the black box: Understanding DQNs
Neuroscience: Receptive field mapping
- 2013. Simonyan. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
- Activity maximization: a network learns to generate synthetic images by maximizing the activity of certain classes of unit
  - 2016. Nguyen. Synthesizing the preferred inputs for neurons in neural networks via deep generator networks
Neuroscience-inspired analyses of linearized networks
- 2003. McClelland. The parallel distributed processing approach to semantic cognition
- 2013. Saxe. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks
Networks with external memory
- 2016. Graves. Hybrid computing using a neural network with dynamic external memory
Hypothesis-driven experiments
- 2017. Jonas and Kording. Could a Neuroscientist Understand a Microprocessor?
- 2017. Krakauer. Neuroscience Needs Behavior: Correcting a Reductionist Bias
Circuits
- 2020. Olah. Zoom In: An Introduction to Circuits

SOTA 2025

Extra

google/neural-tangents: Infinite Width Neural Networks