Scientists Reveal Reward Learning Pathway Mediated by Somatostatin Neurons

Assessing the positive or negative valence of external sensory cues and translating them into appropriate behavioral responses is crucial for animal survival in complex or challenging environments. Negative valence signals lead to aversive/avoidance behaviors, while positive valence signals promote reward/approach behaviors. Much previous research has focused on the neural structures within the emotional processing network and how different cell populations influence aversive or reward behaviors, but there has been little research on how sensory signals are transmitted to these structures.

Currently, the lateral habenula (LHb) has been identified as a key brain region regulating reward/aversive responses in studies of the emotional processing network. The LHb mainly contains excitatory glutamatergic neurons that can modulate the dopamine (DA) and serotonin (5-HT) systems. Studies have found that these neurons can be activated by aversive signals and inhibited by reward signals, resulting in opposite responses. Several input structures upstream of the LHb, such as the ventral pallidum and lateral preoptic area, project excitatory and inhibitory axons to the LHb.Excitatory projections can activate the LHb, which then partially exerts negative regulation on the DA and 5-HT systems via GABAergic projections from the interpeduncular nucleus, leading to aversive or anti-reward effects. Conversely, inhibitory projections have positive valence, reducing LHb activity and resulting in reward-related emotional effects. Thus, by receiving converging positive and negative valence signals, the LHb can serve as a hub that integrates information from various sources.

The medial septum (MS) has been found to transmit bottom-up aversive sensory signals to the LHb through its glutamatergic neurons, leading to negative emotion-related behaviors^[1]. Although GABAergic neurons in the MS have also been found to influence LHb neurons, their functional roles in different behavioral contexts remain unclear. The MS contains different types of GABAergic neurons, including those expressing parvalbumin (PV) and somatostatin (SOM), which may receive different inputs and form intra-nuclear connections with glutamatergic neurons. One possibility is that certain specific types of MS GABAergic neurons may receive and convert reward information from different sensory pathways to regulate appetitive behaviors.

On March 7, 2022, a team led by Li I. Zhang and Huizhong W. Tao from the University of Southern California published a research paper titled “A bottom-up reward pathway mediated by somatostatin neurons in the medial septum complex underlying appetitive learning” in Nature Communications.This study combined behavioral analysis, in vivo and in vitro electrophysiology, opto/chemogenetics, and pharmacological manipulation to discover that MS GABAergic neurons receive various forms of bottom-up aversive sensory information from the pontine nuclei and mediate related sensory-induced avoidance behaviors. SOM neurons in the MS respond to reward predictive cues through cue-reward associative learning and regulate the acquisition of cue-reward associations by projecting to the LHb^[2]. In summary, these results emphasize the important role of the MS in the formation of reward associative memories.

Scientists Reveal Reward Learning Pathway Mediated by Somatostatin Neurons

The researchers used Vgat-Cre mice and activated GABAergic neurons in the MS by injecting Cre-dependent channelrhodopsin-2 (ChR2) adeno-associated virus (AAV). They first employed a bidirectional self-stimulation paradigm to assess the reinforcing value associated with activation. Whenever the animal’s nose poked at the designated LED on the port, LED stimulation would be triggered, while poking at another port would not trigger any light stimulation (Figure 1a). During a 30-minute testing period, experimental mice showed a significantly higher number of pokes at the LED port compared to the control port; however, control mice expressing only GFP showed no significant difference in pokes between the two ports (Figure 1b, c). This result indicates that acute optogenetic activation of MS GABAergic neurons enhances self-stimulation behavior, which is an important hallmark of reward-encoding neurons. Furthermore, in the light-dark box test, animals expressing ChR2 spent significantly more time in the light chamber paired with LED stimulation than GFP control mice (Figure 1d, e). In the real-time place preference (RTPP) test, researchers found that Vgat-Cre animals expressing ChR2 spent more time in the stimulation chamber (Figure 1f, g). These results indicate that MS GABAergic neurons encode positive motivational value.

Figure 1: Activation of MS GABAergic neurons can enhance and drive positive valence.

Next, the researchers used optoelectronic recording on head-fixed awake mice licking sugar water to directly detect whether MS GABAergic neurons could respond to reward signals (Figure 2a). For ChR2-labeled (GABAergic) neurons, an increased firing rate was observed in the majority of recorded neurons (79%) after licking, while 13% showed no change in firing rate, and 8% showed decreased firing rate (Figure 2b, c). Among all the recorded neurons that exhibited excitatory responses to sucrose, 72% were labeled with ChR2 (Figure 2d). These results indicate that licking sucrose preferentially activates the GABAergic population. Additionally, similar recordings were performed on mice that were forced to consume quinine (a bitter agent), with no significant changes in firing rate observed (Figure 2g). In contrast to GABAergic neurons, Vglut2-Cre mice labeled MS glutamatergic neurons showed no response to sucrose consumption but were activated by quinine (Figure 2e-g), consistent with previous findings that the activity of these neurons encodes aversive emotional outcomes. In summary, these results suggest that MS GABAergic neurons are preferentially activated by reward signals. More importantly, their response to sucrose was enhanced with increased intake (Figure 2h, i), indicating that activity levels can encode reward value.

Figure 2: GABAergic neurons encode reward signals.

Research on rodents has indicated that taste information from peripheral receptors is concentrated and transmitted to the nucleus of the solitary tract (NST), which then relays it to the parabrachial nucleus (PBN)^[3]. The PBN generates many projections related to taste, some of which convey information to the taste cortex via the taste thalamus (VPMpc) before reaching the taste cortex. To determine the input of MS GABAergic neurons, the researchers performed cell-type-specific monosynaptic tracing using pseudorabies virus (Figure 3a). Numerous retrogradely labeled cells were observed in the PBN, but none were found in the NST, VPMpc, or taste cortex (Figure 3b). In vivo electrophysiological recordings showed that PBN neurons responded to sucrose with an increase in firing frequency (Figure 3c, d). By injecting AAV into the PBN of Vglut2-Cre mice for anterograde tracing, it was confirmed that PBN glutamatergic neurons project their axons to the MS (Figure 3e). Whole-cell recordings of MS neurons from Vglut2-Cre mice further confirmed monosynaptic connections between PBN glutamatergic axons and MS neurons (Figure 3f). These results indicate that the MS receives taste input directly from the PBN.

To further test in vivo whether the PBN transmits taste signals to MS GABAergic neurons, the researchers silenced neurons in both sides of the PBN with muscimol (Figure 3g). Recordings showed that the response of MS GABAergic neurons significantly decreased (by approximately 80%) when mice passively received sucrose (Figure 3h-j). Similarly, silencing neurons in both sides of the NST (Figure 3k) almost completely blocked the sucrose response of MS GABAergic neurons (Figure 3l-n).

Previously, sucrose-induced inhibitory responses could be observed in LHb neurons. In vivo recordings showed that the inhibitory response was largely reversed by silencing the MS (Figure 3o-r). Thus, the MSGABAergic projections to the LHb mediate the reward-related inhibition in the LHb. These data reveal a bottom-up pathway for transmitting appetitive taste information: from NST to PBN, then to MS GABAergic neurons, which subsequently inhibit LHb neurons.

Figure 3: Bottom-up sensory pathway for transmitting taste reward signals via MS.

Since MS GABAergic neurons encode reward signals, they may play a role in reward associative learning. The researchers employed a spatial reward learning paradigm to train mice to associate spatial locations with reward success (Figure 4a-c). They then chemically inhibited MS GABAergic neurons and found that this severely impaired spatial reward learning (Figure 4b, right), with longer latencies in locating rewards compared to the control group (Figure 4c). This result indicates that MS GABAergic neurons play an important role in cue-reward association learning.

Subsequently, using a cue-reward associative learning paradigm, mice learned to lick sucrose (the reward) after hearing a sound. Since MS GABAergic neurons can encode reward value, they may provide the unconditioned stimulus (US) signal for this associative learning. After silencing MS GABAergic neurons in Vgat-Cre mice using optogenetics (Figure 4d), the researchers conducted training to couple the conditioned stimulus (CS) and US (Figure 4e). Monitoring revealed that licking behavior significantly decreased after inhibiting GABAergic neurons (Figure 4f, g), further indicating that the US response of MS GABAergic neurons is necessary for forming cue-reward associations.

Figure 4: Reward associative learning requires MS GABAergic neurons.

To test how the response of MS GABAergic neurons is formed through learning, the researchers conducted two days of electrode recordings before, during, or after training (Figure 5a). In mice prior to training, no significant response of GABAergic neurons to the CS (tone) was observed (Figure 5b, f). On the first day of training, neurons exhibited a strong response to sucrose (US) while showing a weak response to the CS (Figure 5c, f). On the second day, the response to the CS became strong (Figure 5d, f). After two days of training, a strong response was maintained only to the CS stimulus (Figure 5e, f). On average, the response to the CS increased by 5.8 times compared to the first day (Figure 5f). These results demonstrate the plasticity of MS GABAergic neurons’ responses to reward predictive cues induced by learning.

Figure 5: Plasticity of MS GABAergic neurons’ responses to reward predictive cues.

Next, the researchers further distinguished the roles of GABAergic neuron subtypes in reward learning. They expressed Cre-dependent ChR2 in the MS of PV-Cre or SOM-Cre mice and performed optoelectronic recordings (Figure 6a). The experiment found that most SOM neurons were activated by sucrose (Figure 6b, d, e), while most PV neurons showed no response to sucrose (Figure 6c-e). Similarly, the response of SOM neurons to sucrose was stronger than their response to water, as indicated by the Ca2+ signals recorded via fiber photometry (Figure 6f, g), suggesting that they can encode reward value. The researchers further confirmed that SOM neurons receive excitatory monosynaptic input from the PBN (Figure 6h, i). Additionally, in the RTPP test, activation of SOM neurons produced location preference (Figure 6j-l), while activation of PV neurons led to weak location avoidance (Figure 6m, n). These results indicate that SOM neurons encode positive valence and transmit reward information; conversely, PV neurons encode negative valence and are less likely to participate in reward learning.

Figure 6: SOM neurons encode reward signals.

In the Pavlovian conditioning paradigm, the researchers detected the spike activity of SOM neurons before and during reward associative learning. Prior to conditioning, SOM neurons showed no response to the CS tone (Figure 7a), while on the first and second day of conditioning, their response to the CS tone gradually appeared and strengthened (Figure 7b-e), indicating that the SOM subtype of MS GABAergic neurons participates in reward associative learning.

Figure 7: SOM neurons increase responses to reward predictive cues during learning.

Subsequently, the researchers expressed Cre-dependent ArchT in the MS of SOM-Cre mice to perform optogenetic inhibition during the US window (Figure 8a-d) or the CS+ delay window (Figure 8e-h). Compared to the control group, the inhibition of SOM neurons during CS or US-related activities largely impaired reward associative learning (Figure 8d, h). Interestingly, in mice post-training, optogenetic inhibition of SOM neurons during the CS+ delay window did not affect licking behavior(Figure 8i–l), indicating that after acquiring associative learning, the activity of SOM neurons is not required to express cue-reward associations. In summary, SOM neurons in the MS are essential for acquiring reward associative learning.

Figure 8: Reward associative learning requires SOM neurons.

By silencing MS to block inhibitory reward responses in the LHb (Figure 3o–r), it suggests that MS SOM neurons may mediate reward associative learning via the LHb. The researchers found a strong projection from MS SOM neurons to the LHb through anterograde tracing (Figure 9a). In the RTPP test, light activation induced location preference, indicating that the projection from MS SOM to LHb is rewarding (Figure 9c, d). Additionally, optogenetic inhibition of the MS SOM→LHb projection led to impaired reward associative learning (Figure 9e-h). These data suggest that MS SOM neurons indeed regulate reward associative learning through projections to the LHb.

Figure 9: MS SOM-LHb pathway mediates reward associative learning.

In this study, the researchers discovered a bottom-up ascending sensory pathway mediated by MS SOM neurons (Figure 9i) that transmits rewarding taste signals. The above results reveal the physiological characteristics of neuron-specific encoding reward pathways, which are the basis for specific reward associative learning and provide new insights for research on neurological disorders related to emotions and learning.

Author’s Biography

References

[1] XIAO X, DENG H, FURLAN A, et al. A Genetically Defined Compartmentalized Striatal Direct Pathway for Negative Reinforcement [J]. Cell, 2020, 183(1): 211-27 e20.

[2] SHEN L, ZHANG G W, TAO C, et al. A bottom-up reward pathway mediated by somatostatin neurons in the medial septum complex underlying appetitive learning [J]. Nature communications, 2022, 13(1): 1194.

[3] KARIMNAMAZI H, TRAVERS S P, TRAVERS J B. Oral and gastric input to the parabrachial nucleus of the rat [J]. Brain Res, 2002, 957(2): 193-206.

Leave a Comment Cancel reply