Print Close

Historical Feedback Representations in the Brain Robustly Guide Learning

Poster No:

989

Submission Type:

Abstract Submission

Authors:

Sangsoo Jin¹, Juhyeon Lee¹, Jong-Hwan Lee¹

Institutions:

¹Department of Brain and Cognitive Engineering, Korea University, Seoul, Korea, Republic of

First Author:

Sangsoo Jin
Department of Brain and Cognitive Engineering, Korea University
Seoul, Korea, Republic of

Co-Author(s):

Juhyeon Lee
Department of Brain and Cognitive Engineering, Korea University
Seoul, Korea, Republic of

Jong-Hwan Lee
Department of Brain and Cognitive Engineering, Korea University
Seoul, Korea, Republic of

Introduction:

A developing child often needs to guess the internal structure of the environment only relying on received feedback [1]. Similarly, reinforcement learning models can construct representations of learning from the action-feedback loop without any instructions [2]. We identified robust neural representations during feedback-guided learning using a novel real-life task.

Methods:

We developed a novel feedback-guided learning task called the Photographer paradigm [3] (Fig. 1a). Participants virtually explored five city scenes in random order and captured eight photographs for each city or run. They were instructed to capture scenes with the highest feedback score. Internally, the feedback scores were computed as conceptual similarities between captured scenes and the target context (shared across the cities) via a Contrastive Language-Image Pretraining (CLIP) model [4]. Neither the target context nor internal mechanics of feedback were instructed to participants.

The fMRI data was collected using a 3-T Siemens Tim-Trio MRI scanner with a 12-channel head coil (TR = 2000 ms; TE = 30 ms; 3×3×4 mm3 voxel size). We independently analyzed [5] 32 participants as the Discovery (n = 16; acquired in 2022) and Validation (n = 16; acquired in 2023) groups. We first applied a standard preprocessing pipeline using fMRIPrep 23.0.2 [6]. Run-wise GLMs were fitted using 3dDeconvolve to identify trial-wise Feedback betas while controlling the systematic effect of head motions.

We investigated what kind of feedback information is robustly represented within each run using representational similarity analysis (RSA). Fig. 1b depicts model representation dissimilarity matrices (RDMs) with three levels of historical information. In counterpart, neural RDMs were defined from trial-wise Feedback betas within the 3-voxel radius searchlight spheres [7]. To focus on within-run learning components while minimizing systematic variations between runs, we performed RSA by each run separately and averaged run-wise RSA maps. Group inferences were conducted by a one-sample t-test for each historical model using 3dttest++. We identified robust RSA clusters (voxel-wise p < 0.005 and cluster-level α < 0.05) that were significant in both the Discovery and Validation groups. We further hypothesized that the strength of the within-run feedback representation in a cluster may predict the level of within-run learning [8]. Linear mixed-effect models were fitted to estimate mean feedback scores of the current or following runs from the median RSA strength in each cluster.

Results:

Fig. 2a-b represents the model RDMs and associated neural clusters that were significant across the Discovery and Validation groups. Only historical models (i.e., Recent-2/3 Trial) were robustly represented in two groups. Notably, the middle orbital gyrus (MOG) and inferior frontal gyrus encoded historical dynamics during feedback phases. The ventral striatum was associated with the Recent-3 Trial model, suggesting that the dopaminergic learning signal may incorporate the current, one-back, and two-back trials.

We found an association between the neural-model coupling and the level of within-run learning. Fig 2c elucidates the RSA strength of the Recent-3 Trial model in the MOG area predicted the mean feedback score of the following run rather than the current run. This effect remains significant after controlling the previous mean feedback and variability of individuals and runs.

Conclusions:

We identified historical feedback representations in the brain during real-life, feedback-guided learning. The ventromedial reward network [9] may encode the current feedback structure even when the goal is unknown, and the learned structure would help future learning. Our study may suggest critical neural signatures of feedback-guided learning that are generalizable for humans [6] and reinforcement learning models [10].

Higher Cognitive Functions:

Decision Making

Executive Function, Cognitive Control and Decision Making

Reasoning and Problem Solving ¹

Higher Cognitive Functions Other ²

Learning and Memory:

Learning and Memory Other

Keywords:

Cognition

Computational Neuroscience

FUNCTIONAL MRI

Learning

Modeling

Other - Context learning, Feedback-guided learning, Naturalistic paradigm, Representational Similarity Analysis.

^1|2Indicates the priority used for review

Provide references using author date format

[1] Nussenbaum K and Hartley C A 2019 Reinforcement learning across development: What insights can we draw from a decade of research? Dev. Cogn. Neurosci. 40 100733
[2] Cross L, Cockburn J, Yue Y and O’Doherty J P 2021 Using deep reinforcement learning to reveal how the brain encodes abstract state-space representations in high-dimensional environments Neuron 109 724-738.e7
[3] Jin S, Lee J and Lee J-H 2023 How to Be a Good Photographer: Multi-modal Learning In a Real-life Environment
[4] Radford A, Kim J W, Hallacy C, Ramesh A, Goh G, Agarwal S, Sastry G, Askell A, Mishkin P, Clark J, Krueger G and Sutskever I 2021 Learning Transferable Visual Models From Natural Language Supervision (arXiv)
[5] Kim H-C, Jang H and Lee J-H 2020 Test–retest reliability of spatial patterns from resting-state functional MRI using the restricted Boltzmann machine and hierarchically organized spatial patterns from the deep belief network J. Neurosci. Methods 330 108451
[6] Esteban O, Markiewicz C J, Blair R W, Moodie C A, Isik A I, Erramuzpe A, Kent J D, Goncalves M, DuPre E, Snyder M, Oya H, Ghosh S S, Wright J, Durnez J, Poldrack R A and Gorgolewski K J 2019 fMRIPrep: a robust preprocessing pipeline for functional MRI Nat. Methods 16 111–6
[7] Lee J, Jung M, Lustig N and Lee J-H 2023 Neural representations of the perception of handwritten digits and visual objects from a convolutional neural network compared to humans Hum. Brain Mapp. 44 2018–38
[8] Kim D-Y, Jung E K, Zhang J, Lee S-Y and Lee J-H 2020 Functional magnetic resonance imaging multivoxel pattern analysis reveals neuronal substrates for collaboration and competition with myopic and predictive strategic reasoning Hum. Brain Mapp. 41 4314–31
[9] Bartra O, McGuire J T and Kable J W 2013 The valuation system: A coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value NeuroImage 76 412–27
[10] Botvinick M, Ritter S, Wang J X, Kurth-Nelson Z, Blundell C and Hassabis D 2019 Reinforcement Learning, Fast and Slow Trends Cogn. Sci. 23 408–22

Acknowledgment: This work was supported by the National Research Foundation (NRF) grant funded by the Korea government (MSIT) (NRF-2021M3E5D2A01022515, No. RS-2023-00218987), and in part by the Electronics and Telecommunications Research Institute (ETRI) grant funded by the Korean government. [23ZS1100, Core Technology Research for Self-Improving Integrated Artificial Intelligence System].