Bad habits–good goals? Meta-analysis and translation of the habit construct to alcoholism
F. Giannone
C. Ebrahimi
T. Endrass
A. C. Hansson
F. Schlagenhauf
SimpleOriginal

Summary

Chronic alcohol exposure alters brain systems governing habit and goal-directed behavior. This transition blurs behavioral categories and contributes to addiction, suggesting AUD interventions must address neurobehavioral dynamics.

2024

Bad habits–good goals? Meta-analysis and translation of the habit construct to alcoholism

Keywords Alcohol Use Disorder; Habitual Behavior; Goal-Directed Control; Decision-Making; Neurobiology; Compulsivity; Animal Models; Human Studies; Model-Based Learning; Relapse

Abstract

Excessive alcohol consumption remains a global public health crisis, with millions suffering from alcohol use disorder (AUD, or simply “alcoholism”), leading to significantly reduced life expectancy. This review examines the interplay between habitual and goal-directed behaviors and the associated neurobiological changes induced by chronic alcohol exposure. Contrary to a strict habit-goal dichotomy, our meta-analysis of the published animal experiments combined with a review of human studies reveals a nuanced transition between these behavioral control systems, emphasizing the need for refined terminology to capture the probabilistic nature of decision biases in individuals with a history of chronic alcohol exposure. Furthermore, we distinguish habitual responding from compulsivity, viewing them as separate entities with diverse roles throughout the stages of the addiction cycle. By addressing species-specific differences and translational challenges in habit research, we provide insights to enhance future investigations and inform strategies for combatting AUD.

Introduction

‘Old habits die hard’—this folklore seems to aptly describe the challenges faced in addiction therapy. Furthermore, colloquial language often equates addiction with ‘bad’ habits. Even as early as the original edition of Webster’s Dictionary in 1828 [1], it was written: “Frequent drinking of spirits leads to a habit of intemperance. We should endeavor to correct evil habits by a change of practice.” Throughout history, alcohol consumption has been accompanied by individual tragedies and public health disasters, contributing to its controversial and sometimes hypocritical portrayal [2].

Currently, excessive alcohol use constitutes an ongoing public health crisis, accounting for ~5% of the global disease burden [3]. Alcohol dependence affects 2.6% of people aged 15+ years worldwide with much higher prevalence rates in many developed countries and causes more harm than illicit drugs [4, 5]. Very heavy drinking (>100 or 60 g/day for males or females, respectively), which for example involves ~0.8% of the population aged 15-65 years in Europe, leads to severe health consequences and dramatically reduced life expectancy [6].Alcohol dependence (often equated with severe alcohol use disorder, AUD) is characterized by a systematic bias towards choosing alcohol over healthier alternatives, and individuals continue to use alcohol despite adverse consequences, displaying signs of “compulsivity”. This evident resistance to change a dysfunctional behavior demands deeper understanding beyond simply attributing it to individual choice [7]. The question arises: is AUD simply a bad habit? To explore this, we delve into the origin of the term “habit” and its definitions in experimental psychology. According to the recent web edition of Merriam & Webster’s Dictionary [8], the most common use of the term habit refers to “a settled tendency or usual manner of behavior” originating from Latin—Habitus, but it is also described as “an acquired mode of behavior that has become nearly or completely involuntary”. The latter aspect is also part of how experimental psychology defines habits as learned associations between a stimulus, context or internal state and behavioral responses that become nearly or completely involuntary, independent of the outcome. In contrast, ‘goal-directed’ behavior is motivated by consequences and requires knowledge of the specific response outcomes. This habit-goal construct is operationalized by testing an operant conditioned response after devaluation of the reward, with the assumption that under habitual control, the response remains unaffected, while under goal-directed control, the subject reduces responding (see Box 1). The traditional dichotomous perspective of the original stimulus-response theory, wherein habitual and goal-directed control are viewed as mutually exclusive, is subject to debate. Contemporary interpretations suggest more nuanced and graded interactions, as elucidated in a recent comprehensive primer on habit theory [9]. Regardless of the deterministic nature of a habit-goal dichotomy, the dominance of excessive or dysfunctional habits has become a common explanation for the transition into compulsivity in drug addiction. Accordingly, prominent theories posit that habit formation indicate diminished control over drug seeking and taking, contrasted with goal-directed responding as a sign of behavioral control [10,11,12,13,14]. Opposing the habit theory of addiction, Hogarth [15] placed excessive goal-directed behavior at the core of addiction development, but this explanation seems equally deterministic as the behavioral automaticity construct.

In this review, we present a contemporary definition of habits and related constructs, along with their experimental operationalization in animals and humans (Box 1). We also explain the main neurobiological concepts related to habitual and goal-directed responding (Box 2, Fig. 1). Against this backdrop, we provide a literature review on animal and human experiments that specifically examined habit and goal-directed behavior in the context of alcohol use and AUD. These studies are compiled in Tables 1 and 2, with a meta-analysis of the animal studies provided in Fig. 2. Most studies assume a competition between goal-directed and habitual control systems, and we conclude that both mechanisms are integral parts of a complex decision-making process. When this process is strongly biased towards automatic responding, it can contribute to the development and maintenance of AUD .

Fig. 1: Neurocircuitry and experimental paradigms of striatal learning.

Fig 1

A Left: Corticostriatal projections originate from distinct non-overlapping populations of neurons. Cortical neurons were retrogradely labeled by injection of viral tracers (ssAAV-retro/2-hSyn1-mCherry-WPRE-hGHp and ssAAV-retro/2-hSyn1-EGFP-WPRE-hGHp) within the posterior dorsomedial DMS and anterior DLS. Fluorescence-labeled neurons of the mPFC project to DMS (green), while M1 neurons project to DLS (purple). In the OFC distinct populations are found that project either to DMS or DLS. Scale bar: 1 mm. Middle: Simplified representation of the prefrontocortical input to the dorsal striatum. Neurons from M1 and SMC project to the DLS (purple), mPFC neurons to the DMS (green), and OFC neurons project to both regions (purple and green). Striatal dopaminergic input from the VTA and SNc are shown in blue. Right: Coronal sections of the rodent and human brain showing the main striatal regions with black arrows representing ventromedial to dorsolateral information transfer. The dorsal part of the striatum can be subdivided into the DMS (rodents) and caudate nucleus (humans) (green) and the DLS (rodents) and putamen (humans) (purple). Functional aspects of this circuitry are described in Box 2. B Typical instrumental (operant box), spatial navigation (T-maze), and skilled walking (horizontal ladder) tasks that are used to assess biases of goal-directed or automatic response tendencies in rodents. C Human sequential decision-making task (2-step task) to assess model-based and model-free learning (see Box 1). In each trial, participants perform two sequential decisions at two stages in order to obtain probabilistic monetary rewards. In this version [118], participants start from planet Earth (1st stage) and choose between one out of two rockets, in order to land on one out of two planets (2nd stage), each inhabited by two different aliens. Importantly, the transition from 1st stage choice to the 2nd stage underlies a probabilistic structure: while one rocket flies commonly (70% probability) to the yellow planet and only rarely to the red one (30% probability), the inverse structure is true for the other rocket. In the 2nd stage, participants chose between one out of two aliens in order to obtain a reward. The reward probabilities associated with each 2nd stage alien vary slowly across trials according to Gaussian random walks in order to foster continuous learning across the task. That way the 2-step task allows to dissociate model-based from model-free behavior: While pure model-free control simply increases the choice probability of actions rewarded in previous trials, model-based control additionally considers if rewards followed a common or a rare 2nd stage transition, i.e., takes into account the underlying task structure. D Schematic learning curve of a behavior. Early into training the behavior may be less accurate or efficient (purple) but will gradually improve. At later time points, performance has stabilized but is less flexible (green) and resistant to interference. Behavioral automaticity such as degree of goal-directed responding can be assessed in early or late phase. Neuroanatomical abbreviations were used according to the rat brain atlas [119]: DLS dorsolateral striatum, DMS dorsomedial striatum, mPFC medial prefrontal cortex, Acb nucleus accumbens, OFC orbitofrontal cortex, M1 primary motor cortex, SMC sensorimotor cortex, SNc substantia nigra pars compacta, VTA ventral tegmental area.

Box 1 Definition and assessment of habits

A Habit is a rapidly activated specific response in a specific context, which has been repeatedly performed previously, it is in itself inflexible and shows some resistance to change (in context, outcome, motivation). The term “habits” is used on different levels of description, ranging from self-report to experimentally controlled operationalizations. In established operationalization mainly in animal studies, habitual responding is characterized by continued responding despite devaluation of the outcome (probed with devaluation tests) and by an insensitivity towards the causal relation between the response and the outcome (probed with contingency degradation). However, it has been shown that it is difficult to induce habits in human laboratory tasks and that animal paradigms cannot easily be translated to humans. Therefore, computational approaches allow characterizing the degree to which participants flexibly use the causal structure of the environment in goal-directed control or rigidly respond towards certain environmental stimuli in habitual control. A specific response lies on a continuum between habitual and goal-directed control and cannot be easily classified as belonging to one of two dichotomous systems.

Goal-directed behavior, which is often contrasted to habits, is performed based on knowledge of the specific outcome and its current motivational value. Thus, the subject distinctly learns the consequences of its action (also termed action-outcome learning, A-O), and thus uses knowledge about the outcome when choosing the action.

Compulsivity also refers to automatic behavioral responses but needs to be distinguished from habitual behavior. Compulsivity refers to a resistance to aversive consequences and is modeled in animals as maintaining a certain behavior despite aversive consequences such as electric foot shocks or taste aversiveness (e.g., by the bitter substance quinine). In addiction research, compulsive responding is conceptualized as a severe form of loss of control over behavior, and commonly a transition from habitual to compulsive responding is posited as the disorder progresses [12, 13]. Given the different constructs and assessment methods, it is unclear how a continuum or transition from habitual to compulsive responding can be demonstrated. Indeed, robust correlations between measures of habitual and compulsive responding have not been found [47, 48] and there are some differences in the neurobiology underlying both processes [120] (see also Box 2).

For assessment of habits and to distinguish it from goal-directed responses, Dickinson et al. [121] introduced the response to outcome devaluation as a discriminative criterion. Thus, if an outcome such as food is devalued, often by satiety or conditioned poisoning in a separate session without the previously reinforced behavior, goal-directed responding will cease while habit-controlled behavior will continue in a test session. Similarly, if the connection between response and outcome is weakened (contingency degradation), for example by suddenly delivering the food independently of a response at the lever, a continued responding would be interpreted as habitual responding. These methods have been shown to work well in animals, especially with food rewards. As expected, following extended training in an operant lever-pressing task, the effect of outcome devaluation is diminished, indicating a heightened manifestation of habitual responding.

Training schedules can influence the preferred mode of responding indicating that simply amassing the number of repetitions is not the best training method to induce behavioral automaticity. Thus, habitual responding occurs more often after schedules of reinforcement in which the first response after some variable time (e.g., random interval of 30 seconds, RI-30 schedule) compared to schedules where the rate or number of responses is the relevant factor (e.g., random rate of 10 responses, RR-10), whereby the latter schedule is rather resistant to habit formation [122]. This has become a common way to experimentally induce biases towards one or the other type of behavior and to compare them directly in devaluation tests. Although it is clear from slot machine gambling that uncertainty between action and outcome is a strong driver of habitual behavior, the factors responsible for the different outcomes after RI and RR schedules are not well understood. Such difference may be explained by the action-outcome contingency [121], temporal uncertainty of reward availability [123, 124], or schedule-induced stress [125, 126].

Skills describe a learned ability that involves improved performance acquired after extensive training. In contrast to habits, the performance of a skill requires conscious effort to initiate and improve and may therefore be rather goal-directed, the goal being the nearly perfect execution of a more or less complex motor program, for example, riding a bicycle or playing an instrument. However, changing acquired skills, such as switching between automatic and manual transmission cars, can also lead to difficulties. In addition, skills and habits share a common striatal neurocircuitry (see Box 2), but skills emphasize on ‘how the behavior is performed’ whereas habits refer to ‘which stimuli elicit the behavior’ [9, 127]. Skill learning in rodents can be assessed by a variety of tests, for example the skilled walking task [128, 129] (Fig. 1).

Translation to humans is well established for the above-mentioned principles of outcome devaluation [57, 59, 60, 130, 131] and contingency degradation [132,133,134]. However, habit induction in humans has proven difficult: in a large study using various outcome devaluation procedures, de Wit et al. [135]. failed to replicate previous reports that habitual tendencies indeed increase with extended training [57], a negative finding that has been recently reported also in animals [124, 125].

Another approach is based on reinforcement learning theories, distinguishing between habitual and goal-directed behavior as model-free and model-based control, respectively [64, 65]. According to this idea, humans either possess a cognitive map, understanding the task’s rules and structure, or simply repeat previously rewarded actions without representing the underlying state transitions. This is exemplified in the 2-step task, where participants make choices in an initial stage and then again in a second stage, leading to potential rewards [136]; Fig. 1. Computational modeling of behavioral responses yields parameters describing model-based and model-free behavior, such as balance parameters, rates of first and second-stage options, and perseverative and prediction errors.

Self-reports: Individuals frequently characterize their addictive behavior as a “habit,” yet the precise meaning behind this term often remains ambiguous. To refine this self-reflection, instruments such as the Creature of Habit Scale (COHS) can be utilized [137]. Research on cocaine use disorder has demonstrated a slight but significant increase in automaticity over time, as measured by the COHS, lending support to the validity of this instrument [106]. However, the extent to which the COHS aligns with behavioral assessments of the habit construct, such as reward devaluation tests, is unclear. Notably, the application of the COHS to alcohol use disorder (AUD) remains unexplored.

Box 2 Striatal mechanisms of habit formation

The main neurobiological concepts related to habitual and goal-directed responding revolve around the basal ganglia circuitry and propose a shift from dorsomedial (DMS) to dorsolateral (DLS) striatal involvement during the formation and execution of habitual control.

The cortico-basal ganglia circuitry is a complex neural network that regulates the affective and motor components of behavior. At its center is the striatum, which receives inputs from various brain regions, including dopaminergic inputs from the ventral tegmental area (VTA) and substantia nigra pars compacta (SNc), as well as glutamatergic inputs from the cortex, hippocampus, and amygdala. The striatum consists mainly of GABAergic medium spiny neurons (MSNs), which play a crucial role in modulating behavior through interactions with these inputs [138].

The striatum can be divided into two parts: the dorsal striatum and the ventral striatum. The ventral striatum includes the nucleus accumbens (Acb) and receives inputs from the VTA and prefrontal association cortices, specifically the medial prefrontal and orbitofrontal areas (mPFC, OFC). In contrast, the dorsal striatum receives inputs from the SNc and primarily from motor cortical regions (primary motor and sensorimotor cortex) but also from association cortices.

Traditionally, dopaminergic projections from the VTA to the ventral striatum are associated with reward processing and reward prediction error, while those from the SNc to the dorsal striatum are implicated in habit formation and motor skill learning, but there is strong overlap between these projections and processes [127, 139]. The output stations of the basal ganglia are the substantia nigra pars reticulata (SNr) and parts of the VTA, which send GABAergic projections to the thalamus. From there excitatory projections regulate activity in cortical fields inputs, completing a feedback loop [138].

Structurally, the striatal circuitry features two important projection patterns generated by the MSNs. One population forms the direct pathway (dMSNs), sending monosynaptic projections to the SNr. The other population forms the indirect pathway (iMSNs), projecting indirectly to basal ganglia output nuclei via the external globus pallidus (GP), the subthalamic nucleus, and other stations. dMSNs and iMSNs are further distinguished by further characterized by expressing dopamine receptors of the D1 or D2 subtype, respectively. Direct and indirect pathways are involved in fine-tuning behavior by providing “drive” or “brake” signals to the thalamus and cortical fields [140, 141]. However, recent research suggests that both dMSNs and iMSNs are concurrently activated during action initiation, with dMSNs exhibiting shorter latency [13].

The striatal circuitry also comprises overlapping spiraling striatal-midbrain-striatal loops organized topologically from ventromedial to dorsolateral regions. This organization is observed not only within the striatal regions but also in the corresponding midbrain, thalamic, and cortical inputs and outputs [142]. This topological organization is exemplarily shown in Fig. 1, with selective projections from mPFC to DMS and from motor cortex to DLS. Intriguingly, the OFC seems to play an intermediary role with distinct populations projecting to either DMS or DLS. These anatomical features enable information transfer from ventromedial to dorsolateral structures during learning of various tasks, such as operant responding, maze navigation, or motor skill learning [9].

Role of DMS and DLS in the development of behavioral automaticity: An exemplary demonstration of the medial-to-lateral transition in information processing within the dorsal striatum was observed in mice learning a simple motor task [38]. During the early acquisition phase of learning to stay on a rotating rod, activity was dominated by D1-expressing MSNs located in the DMS. As task performance became consolidated, D2-MSNs of the DLS showed increased activity. This shift was associated with cell type-specific changes in the excitability of MSNs and alterations in regional D1 and D2 expression [38, 143]. Similar shifts in activity patterns were observed in instrumental learning. Inactivation of the DMS led to an accelerated emergence of habitual responding, while lesions of the DLS preserved goal-directed responding even after extended training periods [40, 144]. Mice over-trained in an easy navigation task showed distinct activity patterns in the DLS. Neurons exhibited high activity at the start of the T-maze and at the end when approaching the reward, while during the middle phase of the task, as the mice were crossing the runway, DLS neurons were mostly silent [13]. This pattern of “task-bracketing” activity of the MSNs is coordinated by fast-spiking interneurons and was not observed in the DMS. Instead, DMS neurons fired consistently throughout the performance of a new routine and became disengaged with over-training around the time when task-bracketing in the DLS emerged [38, 100]. These findings suggest that both DMS and DLS regions are involved in parallel processing during initial task learning and when behavioral automaticity is setting in [9, 145]. The shift from DMS to DLS involvement in information processing appears to be critical for the development of habitual control over behavior, highlighting the dynamic nature of the cortico-basal ganglia circuitry in regulating goal-directed versus habitual responses.

Effects of cortical inputs: Efforts to understand the distinct roles of cortical inputs in regulating goal-directed and habitual behaviors have primarily focused on mPFC and the OFC. It is commonly believed that prefrontal hypoactivity facilitates habitual responding while activating striatal projections from the OFC and mPFC can counteract this effect [9, 14]. But the picture is more complex. For instance, contrasting effects in the balance between goal-directed and habitual behaviors have been proposed for two subregions of the mPFC: the infralimbic (IL) and prelimbic (PrL) areas. According to some studies, the IL may support habitual behavior, while the PrL promotes goal-directed behavior [146, 147]. Nevertheless, this model appears difficult to reconcile with a similar construct that has emerged in the drug and fear extinction field. This other model suggests that “Go” signals emanate from the PrL, while “No-Go” signals originate from the IL [118, 148]. Combining these findings, one would expect the IL to promote both extinction learning in drug seeking and increased responding in reward devaluation paradigms, and the opposite for the PrL.

Task-specific neuronal ensembles may offer a resolution to the discrepancies discussed above. This concept, originally proposed by Hebb [119] to explain the encoding of memories, has now been further developed to elucidate the reactivity to cue-reward associations. According to this concept, specific functions or tasks are encoded by discrete populations of neurons with distinct cell type identity, connectivity, and temporal coactivation patterns [149,150,151]. Local ensembles are topographically dispersed across the circuitry and form rapidly shifting meta-ensemble networks that support efficient and flexible on-demand decision-making [96]. The observation of response-specific dynamic network configurations, encompassing sparsely distributed neuronal populations from various brain regions, adds an additional layer of complexity to the encoding of different types of response probabilities. This complexity argues against models of dichotomous control, not only between DMS and DLS but also among various prefrontal inputs.

Brain mechanisms of compulsivity: Neural substrates of persistent responding despite negative consequences have been identified in circuits that are involved in emotional, social, and stress processing, such as insular cortex, amygdala, and the midbrain origins of the serotonergic and noradrenergic system [50, 152, 153]. Activation of these circuits influences the striatal circuitry. Additionally, prefrontal hypoactivity is widely believed to facilitate compulsive responding, while activating mPFC or OFC projections to the striatum can counteract it [12, 154]. An alternative role of the OFC in compulsivity has been proposed by Pascoli et al. [155]. In a new mouse model of addiction based on excessive optogenetically mediated self-stimulation of the VTA, the authors demonstrated that potentiation of synapses from the lateral OFC onto the dorsal striatum was associated with compulsive (punishment-resistant) responding. They concluded that an overactive OFC-dorsal striatal pathway could lead to an overestimation of the value of drug experience relative to punishment, biasing instrumental behavior towards drug-taking. Thus, the OFC appears to play a critical role in either facilitating or counteracting automaticity, likely depending on interactions of these projections with further inputs from motor or associative cortical areas.

Thus, distinct neural mechanisms have been identified that facilitate compulsive responding for rewards. Some of these mechanisms may overlap with the substrates underlying habitual response biases, but specific evidence for a mechanistic continuum from habitual-biased to persistent compulsive responding, as proposed by some addiction theories [12, 13] has not been presented so far.

Human brain data: In humans, examining experimental habit formation has been shown to be challenging e.g., de Wit et al. [135] and Tricomi et al. [57] were the first to demonstrate habitual behavior after extensive training in a free operant learning task, showing increasing activity in the dorsolateral posterior putamen as a neural correlate of habit formation. Interestingly, magnetic resonance spectroscopy studies revealed reduced glutamate turnover in the putamen of patients with cocaine use disorder, suggesting dysregulation of glutamatergic transmission in this region caused by chronic cocaine use [106]. This neurochemical deficit was related to increased habitual responding in a contingency degradation test. Additionally, investigations employing the 2-step task and its computational framework of reinforcement learning [136]; Box 1 and Fig. 1, have revealed specific roles for the ventral striatum during model-free learning and the ventral mPFC during model-based behavior [for meta-analysis], see [156].

Overall, the neurobiological data from both animal and human paradigms show some differences in processing between DMS and DLS during the transition from goal-directed to habitual control of responding. However, there is no consistent support for the common notion that the DLS is universally promoting automaticity, while the DMS may oppose it and instead promote goal-directed behavior. More likely are parallel information processing modes that allow rapid reorganization of behavioral strategies upon demand [9].

Animal studies

Laboratory animals easily acquire lever-pressing behavior for rewards (instrumental learning), typically food but also alcohol and other addictive drugs used by humans. After multiple self-administration sessions, animals may increase alcohol intake, which is likely due to the fact that alcohol is initially aversive to most rodents. Animals will reach a stable level of alcohol-self-administration, though this basal level of alcohol consumption is deemed not to reflect any addiction-like behavioral feature [16]. Also, such initial escalation is absent when food or other non-drug rewards drive motivation. However, animals can substantially escalate their alcohol intake in response to specific experimental manipulations (e.g., scheduled access, distinct cues or contexts, stress), which prompts inquiries about whether this increase reflects a loss of control over seeking and taking behavior, and to what extent habitual responding contributes to this phenomenon.

Rodent studies have mainly explored two key facets of alcohol’s impact on habitual responding after reward devaluation (for assessment of habitual or goal-directed responding see Box 1). One line of research investigates whether alcohol reinforcement exhibits more robust habit-forming properties than the consumption of food or other natural rewards. Another critical focus revolves around the consequences of prolonged or excessive alcohol exposure on habitual control and its potential generalization to other rewarding non-drug stimuli.

In a pioneering study, Dickinson and colleagues trained rats to press different levers for reinforcement by either alcohol solution or food pellets [17]. Results showed resistance to outcome devaluation (by lithium chloride poisoning) in the alcohol but not the food condition. In a similar study examining self-administration of sweetened alcohol and sugar solution in rats, findings showed after short training persistent responding following reward devaluation only for the alcohol group at, while after extended self-administration training both groups showed persistent responding in the devaluation test [18].

In a seminal study, Corbit et al. [19] examined the effects of training duration on rats self-administering alcohol. After 4 weeks of training, alcohol self-administering rats showed reduced sensitivity to satiety devaluation, while rats self-administering sucrose remained sensitive even after 8 weeks of training. However, sensitivity to satiety devaluation was not maintained in rats with additional non-contingent access to alcohol in their home cage. Moderate levels of regular alcohol consumption, below 0.5 g/kg/day, correlated with the degree of habitual responding, even without noticeable intoxication. Interestingly, inactivation of the dorsomedial striatum (DMS) led to faster development of habitual responding, while animals receiving lesions of the dorsolateral striatum (DLS) after 8 weeks of training restored goal-directed responding (for a brief primer on neurobiological mechanisms see Box 2). Follow-up experiments demonstrated that habitual alcohol self-administration was driven by dopamine D2 receptor and ionotropic (AMPA) glutamate receptor activation within the DLS [20]. Moreover, alcohol also induced increased AMPA receptor activity and dendritic branching in the DMS, specifically in D1-expressing medium spiny neurons (D1-MSNs) [21]. Simultaneous recordings from DMS and DLS neurons in rats self-administering alcohol by Fanelli et al. [22] demonstrated concomitant but specialized phasic firing patterns, where DMS neurons fired mostly time-locked to reinforcement or reward-predicting cues, while DLS activity was associated with lever-pressing, with minor differences under different schedules of reinforcement. Thus, on a structural or functional level there is little support for two opposing systems supporting either habitual and goal-directed control. Instead, distinct populations of neurons in the DMS and DLS may serve specific aspects of a behavioral control, and these populations respond differently to chronic alcohol exposure.

Collectively, these studies provide evidence supporting alcohol’s greater and faster potential to induce habit-forming effects compared to other non-drug rewards. It is worth noting that, except for Dickinson et al. [17], most studies used test paradigms with only one instrumental response. Such a limited decision space may not challenge cognitive resources to an extent that favors reliance on an automatic control system. In a notable experiment by Ostlund et al. [23], rats presented with two simultaneous food rewards exhibited reduced goal-directed control over instrumental responding in a context previously associated with alcohol (i.p. injections), but not with saline cues, highlighting the disruptive influence of alcohol-paired cues on decision-making and goal-directed actions.Also to mention, Shillinglaw et al. [24] failed to find evidence of habitual responding for alcohol or sucrose. The experiment assessed satiety devaluation and contingency degradation (the latter only at the end of the experiment). Besides, rats trained on low-caloric alternatives, such as 1.5% sucrose or non-sweet 10 mM monosodium glutamate, displayed insensitivity to reward devaluation by satiety but not by contingency degradation (reward delivery independent of the lever response). This dissociation between the two main reward devaluation methods suggests potential differences in the underlying neural mechanisms, although the study did not explore this aspect further.

Recent studies have focused on whether alcohol’s long-lasting (i.e., non-pharmacological) effects may affect decision-making systems underlying goal-directed actions, potentially leading to habitual tendencies. Pre-exposure to alcohol can occur through voluntary home cage access, scheduled operant self-administration or passive exposure via various administration routes. Among the latter, chronic intermittent alcohol vapor exposure (CIE) has become a popular rodent model of AUD, ensuring clinically relevant high blood alcohol levels (>1.5 g/l) and safely administered over weeks [25,26,27].

The Gremel lab conducted three studies using the CIE paradigm to investigate alcohol dependence’s effects on orbitofrontal cortex (OFC) function in goal-directed behavior. Mice with CIE treatment showed insensitivity to satiety devaluation, associated with alterations in OFC top-down control on striatal circuits. CIE reduced OFC excitability, and artificially increasing activity of OFC projection neurons during protracted abstinence restored sensitivity to outcome devaluation. In vivo extracellular recordings during the operant task revealed a long-lasting disruption in OFC function due to CIE, leading to enhanced activity associated with actions (lever response) but diminished activity during outcome-related information (reward collection) [28,29,30]. Thus, chronic alcohol exposure alters OFC activity critical for decision-making processes biasing for habitual responding, but the specific contributions of OFC need further study.Barker et al. [31] found that mice with CIE treatment prior to operant training displayed habitual tendencies for alcohol self-administration, but not for sucrose. Pre-exposure to chronic alcohol appeared to impair goal-directed alcohol-seeking more than sucrose-seeking behavior. However, another study using a high-alcohol diet for about 3 weeks found no effect on responding in a devaluation test [32]. Similarly, two other studies with chronic voluntary alcohol consumption prior to instrumental training and testing under conditions of simultaneously available devalued and non-devalued options reported no effects on outcome devaluation [33, 34]. The lack of effect on outcome devaluation may be due to lower alcohol exposure compared to the CIE paradigm. In the study by Ma et al. [34], alcohol-drinking rats displayed insensitivity to outcome devaluation when cognitive load was increased by contingency reversal (rewards were switched relative to the levers), suggesting that in rats with a chronic drinking, history engagement of habitual response strategies may occur with higher cognitive demands. This behavioral shift was associated with compromised function of cholinergic interneurons in the DMS, which regulate the activity of D1 and D2 receptors-containing MSNs, influencing behavioral flexibility. Optogenetic enhancement of thalamic input to these interneurons reduced the bias towards habitual responding in chronically alcohol-drinking rats.

Three studies explored age and sex influences on alcohol-induced habitual tendencies. Barker et al. [35] found sex differences after CIE. Operant responding for sucrose after CIE was less sensitive to satiety devaluation in adults compared to younger males, and only adolescent female rats showed habitual tendencies, indicating reduced behavioral control in younger females. On the other hand, chronic high alcohol exposure during late adolescence increased habitual responding in adulthood regardless of sex [36]. Even without a history of alcohol dependence, developmental differences in habitual tendencies towards alcohol reward were observed, with higher susceptibility in adults compared to adolescent rats [37]. The results suggest that susceptibility to alcohol-induced habit formation increases during the transition from adolescence to adulthood, particularly in male rats. However, more research is needed to better understand the effects of sex and age.

The question arises whether similar striatal learning processes, involving information transfer from medial to lateral structures (Box 2, [9, 38]), are mediated by the same striatal cell population. We conducted experiments with rats trained on a T-maze and an instrumental task and found that prior CIE treatment led to increased automatic responding in both tasks [39]. In addition to reduced sensitivity to reward devaluation, CIE-rats made more errors in a well-learned spatial navigation task, indicating the impact of chronic alcohol dependence on various aspects of action control beyond instrumental learning. These behavioral changes were strongly dependent on DMS function. Chemogenetic inhibition of this region increased habitual bias in normal rats, aligning with previous findings that suggest a key role of the DMS in both tasks [40, 41]. These experiments suggest overlapping cell populations controlling different behaviors beyond instrumental performance, implying that alcohol’s detrimental effects on these cells may not only affect reward-seeking but also other behaviors relying on striatal learning.

Certain pharmacologically targetable mechanisms have been investigated to understand their role in habitual biases. Notably, increased endocannabinoid signaling via CB1 receptors in the DLS appears to be crucial for habitual tendencies. Studies have shown that inhibitors of endocannabinoid synthesis or transport, as well as CB1 blockade, reduced responding for alcohol cues after contingency degradation or lithium devaluation, while CB1 agonists enhanced habitual responding [42]. The higher abundance of CB1 receptors in the DLS compared to the DMS allows for target specificity in systemic pharmacological approaches. Additionally, chronic alcohol exposure has been associated with neuroadaptations, including increased CB1 signaling, enhancing DLS control in learning [43]. These findings suggest the possibility of pharmacological interventions targeting habitual biases.

Moreover, injection of rapamycin, a specific mTORC1 inhibitor, into the OFC of chronically drinking rats reduced habitual responding for alcohol [44]. This effect is attributed to mTORC1’s role in local dendritic translation of synaptic proteins. Notably, mTORC1 is activated via phosphorylation by GluN2B, a subunit of the NMDA-type glutamate receptor complex, which is upregulated after prolonged alcohol exposure in the corticostriatal circuitry [45, 46]. Thus, mTORC1 signaling appears to be a critical mediator of alcohol-induced synaptic plasticity, and inhibition of this pathway may offer the potential to reverse these neuroadaptations and improve control over alcohol intake.

Table 1 Animal studies investigating the effects of alcohol on goal-directed vs. habitual control.

Table 1

Two studies examined whether insensitivity to outcome devaluation could be linked to resistance to punishment (compulsivity) and serve as a predictor of addiction progression. In a large cohort of male rats assessed over 60 weeks of alcohol access, an addiction severity score was computed based on various measures related to AUD [47]. This score identified a small group (5 out of 47 rats) displaying higher alcohol intake, increased motivation under a progressive ratio schedule, and reduced sensitivity to quinine adulteration, indicative of compulsivity. Surprisingly, these AUD-like rats did not differ from non-addicted rats in the satiety devaluation test after long-term operant self-administration training. In contrast, Giuliano et al. [48] found that individual differences in habitual control over alcohol seeking predicted the development of compulsive alcohol intake. Rats trained in an operant seeking-taking chain for alcohol self-administration displayed habitual tendencies, with the majority (17 out of 26) becoming resistant to outcome devaluation. Subsequent tests for compulsive behavior, including footshock during seeking and adulterated alcohol drinking, showed that a minority (7 out of 24) exhibited signs of compulsive intake, with six previously identified as habitual responders in the outcome devaluation test. It is worth noting that although the study suggests a connection between habitual seeking and compulsive intake, the majority of rats displaying habitual tendencies (10 out of 17) did not progress into compulsive alcohol intake.

Importantly, both experiments showed that despite long-term alcohol access, only a minority of rats developed addiction-like behavior characterized by resistance to negative consequences. This is consistent with other studies indicating that around 15–30% of outbred rats spontaneously exhibit persistent ethanol intake despite quinine adulteration or foot shock punishment, likely due to distinct genetic factors [49, 50]. Whether these factors influence the development of habitual responding remains uncertain.

Meta-analysis of rodent studies

We conducted a meta-analysis to address the diverse experimental variations in the aforementioned reports and draw robust conclusions. Therefore, we calculated standard effect sizes for each experiment by normalizing the difference in responding between reward-devalued and non-devalued conditions, irrespective of the specific reward devaluation used (satiety or contingency degradation). The meta-analysis included 10 studies with 17 independent experiments, totaling 404 animals. The 10 studies are marked in Table 1 and discussed in the section above.

Method

We conducted a PubMed search in December 2022 using the keywords: “(alcohol OR “alcohol addiction” OR “alcohol dependence”) AND (habits OR “habitual behavior”) AND (rats OR mice OR rodents)”. Initially, we screened 202 studies based on their abstracts to exclude non-relevant studies. The remaining papers were then assessed based on their full content. For studies deemed relevant after the full-text screening, we further examined their bibliographies to identify additional pertinent studies. From the refined selection, 10 studies were identified that compared alcohol pre-exposure versus control condition. We measured the effect size of devaluation/contingency degradation for both groups in each experiment from the selected studies, resulting in 25 total comparisons with 203 exposed and 201 non-exposed animals. Means and standard deviations (SD) from pre-test/test conditions were extracted from the graphs using WebPlotDigitizer 4.6. Given that the effect of interest was, for all experiments, derived by comparing a pre-test (non-devalued–non-degraded) to a test (devalued–degraded) condition from the same subjects in a repeated or matched design, we first calculated the Cohen’s dav [51], which is the ideal choice when the correlation coefficient “r” between the dependent measures is not available [52]. This was then converted to Hedges’ gav to correct for positive bias arising from small sample size [51, 53]. In situations where the same animals underwent two different satiety devaluation or contingency degradation tests under different instrumental conditions (e.g., varied schedules or rewards) resulting in two effect size measures from the same set of animals, such effects were averaged into one representative effect size to avoid inflation of the sample size [52, 54], resulting in 17 final ethanol versus control group comparisons. The variance for each individual effect size was calculated assuming r = 0, in order to avoid overestimation of the confidence of the effect size [53]. The variance of the representative averaged effect sizes derived from two dependent effects were calculated assuming r = 1, for the same purpose [53]. The methodology that we used for estimating the variance associated with the effect sizes was intentionally conservative: our primary objective was to rigorously test the robustness of the observed difference between control and ethanol-treated conditions. If an effect is found to be significant under these conditions, it is safe to assume that it would likely become more significant under less conservative (and more realistic) variance assumptions. Finally, we performed a subgroup meta-analysis comparing ethanol-treated/non-treated conditions using SPSS 29 with a random-effect model/REML estimator to account for between-study variability. Publication bias was assessed via Egger’s test, while heterogeneity across experiments was evaluated via I2. We additionally tested the robustness of our findings by employing different sensitivity analyses. For detailed explanation of the effect size and variance calculations, and of the sensitivity analyses used, see the Supplementary information.

Results

Our meta-analysis shows that alcohol treatment significantly affects animal behavior when tested for habitual responding as indicated by the forest plot (Fig. 2) and confirmed via meta-regression analysis (effect of Treatment: t = −3.891; p < 0.001 [CI = −0.903; −0.283]). Specifically, a zero difference between the pre-test and test condition indicates complete habitual behavior, while a decrease after devaluation signifies the degree of goal-directedness. The forest plot revealed a highly significant effect, indicating reduced goal-directed behavior in alcohol-exposed groups compared to controls. Notably, the alcohol-treated animals also significantly differed from zero, showing that chronic alcohol exposure didn’t trigger an all-or-nothing shift to habitual behavior but rather led to a gradual reduction in goal-directed responding. Meta-regression analysis indicated that various experimental factors, such as animal characteristics, alcohol exposure details, training parameters, and reward type, had no significant influence. There was no clear evidence of publication bias (Egger’s test—Ctrl: p = 0.800; EtOH: p = 0.196; Overall: = 0.012; Supplementary Figs. 1, 2), and heterogeneity across experiments was low [55] with I2 = 0.07. Finally, we confirmed our meta-analytical results by employing both “leave-one-out” and correlation coefficient sensitivity analyses (see Supplementary Information). Overall, our meta-analysis suggests a dimensional relationship between habitual and goal-directed control, which is compromised by prolonged or chronic alcohol exposure, rather than supporting a clear dichotomy between the two.

Fig. 2: Meta-analysis on the effect of chronic alcohol pre-exposure on responding in reward devaluation tests in rodents.

Fig 2

The forest plot shows the standardized effect size (Hedges’ gav) representing the difference in responding between reward-devalued and non-devalued conditions of the alcohol and control groups (17 comparisons) from 10 published studies. The experiments included testing different operant schedules (experiments Nr. 3–7), rewards (exp. Nr. 10), reward-lever contingencies (exp. Nr. 16), or time points (exp. Nr. 17). Blue and red squares represent the control and alcohol conditions, respectively, with their position relative to the x-axis indicating the effect size and their area representing their percent weight within the meta-analysis, based on the variance of the effect size. Horizontal lines indicate the confidence intervals (CI), with the values given in the adjacent table. Colored vertical bars represent the CI of the overall effect of control (blue) and alcohol (red) conditions, respectively, also displayed at the base of the plot. The vertical dashed line represents the zero effect, i.e., no devaluation or full habitual behavior. Study variables are shown to the right and include sex and age (early adolescent—EA, late adolescent—LA, adult—AD), route of alcohol administration (oral intake, intra-peritoneal injection, CIE vapor, shown by symbols), and type of reward (white pellets for sucrose pellets, brown pellets for food pellets, blue drops for sweet solution, yellow drops for alcohol solution, shown in symbols), and test condition (satiety devaluation—SD, contingency degradation—CD). Reward symbols beneath the left lever indicate that a single reward was tested (no choice), while those beneath both levers signify that two rewards were tested simultaneously (choice). In exps. Nr. 7 and 10, two rewards were tested separately (no choice).

Human studies

We found 9 human studies exploring the balance between goal-directed and habitual choice tendencies either in AUD and high-risk populations, or associations with AUD severity in large community samples (Table 2). Among these studies, seven employed a sequential decision-making task to distinguish between goal-directed (model-based) and model-free learning systems, while two used an outcome devaluation procedure. The focus of these investigations was primarily on instrumental habitual versus goal-directed decision-making related to non-drug rewards and contexts. In other words, they examined generalized habitual response tendencies for newly learned instrumental contingencies within a single session.

Table 2 Human studies investigating goal-directed vs. habitual control in AUD and at-risk populations.

Table 2

Currently, there is a lack of human studies investigating contingency degradation sensitivity in AUD or related conditions. Sjoerds and colleagues [56] conducted a study using an outcome devaluation procedure, which included instrumental learning and an outcome devaluation test. The results revealed that abstinent participants with AUD showed impaired action-outcome knowledge compared to healthy controls, indicating a greater reliance on habitual stimulus-response associations rather than goal-directed associations when learning new instrumental contingencies. Interestingly, participants with AUD exhibited increased posterior putamen activity during habitual learning, while control participants showed stronger BOLD-responses in vmPFC and anterior putamen during goal-directed learning. These findings are consistent with animal and human evidence highlighting the distinct roles of vmPFC and anterior putamen in supporting goal-directed behavior, while the posterior putamen plays a key role in habitual behavior (Fig. 1, Box 2) [57, 58]. However, AUD duration did not significantly correlate with behavioral or neural indices of goal-directed or habitual control [56]. It is important to note that the task utilized by Sjoerds et al. [56] lacked the slips-of-action test phase introduced in later task versions (e.g., de Wit et al. [59]), where instrumental responses for devalued vs. non-devalued outcomes are tested in extinction, providing a more direct translation from classical animal paradigms.

An interesting aspect of the study by Sjoerds et al. [56] is the use of two task versions, one with drug-unrelated stimuli (fruits) and the other with pictures of alcohol. Both behavioral and neural results did not differ between the two versions, suggesting that the alcohol binge context did not differentially influence habitual choice tendencies in individuals with AUD. Van Timmeren et al. [60] employed a different variant of the contingency degradation paradigm, incorporating a Pavlovian-to-Instrumental Transfer test [61] along with fMRI to compare abstinent AUD participants with a control group. During instrumental training, participants were trained to respond with left or right button presses, each associated with a different food snack. Subsequently, one of the outcomes was devalued using magnesium sulfate solution to induce a bitter taste and a video displaying waxworm-infested food. Both AUD and control participants showed significant devaluation effects, suggesting intact goal-directed control in AUD. Neuroimaging analysis comparing choices for devalued and non-devalued outcomes revealed no group differences or main task effects. It is essential to note that the two outcome devaluation studies differ considerably in their approach. Sjoerds et al. [56] assessed explicit response-outcome (R-O) knowledge after instructed devaluation, while van Timmeren et al. [60] investigated free instrumental responding after taste aversion-induced devaluation.

Indirect support for a shift from ventral to dorsal striatal activity was shown by Vollstädt‐Klein et al. [62], with reduced neural alcohol cue reactivity in heavy drinkers compared to social drinkers in the ventral striatum. In a follow-up study by Hornoiu and colleagues [63], self-reported automated alcohol craving and habitual alcohol consumption correlated with increased activation in dorsal striatal, pallidal, and prefrontal regions during the alcohol cue-reactivity task.

Besides the translational attempts from animal models, another line of human habit research has formalized habitual and goal-directed processes within a reinforcement learning framework in terms of model-free and model-based control, respectively (see Box 1, Fig. 1) [64, 65]. Sebold et al. [66] compared abstinent AUD participants and controls on performance in the 2-step task, finding reduced model-based, but unchanged model-free control in the AUD group. Model-based control specifically impaired in the non-reward condition, was attributable to cognitive speed differences between groups, highlighting the need to consider potential confounding factors. Further studies by Voon et al. [67], and Sebold et al. [68], did not find direct evidence of reduced model-based control in AUD participants. Nevertheless, model-based control predicted relapse status during a follow-up assessment, and prospective relapsers showed attenuated neural signatures of model-based control in the mPFC compared to controls and abstainers [68]. Additionally, the balance parameter ω scaled positively with abstinence duration in AUD participants [67]. Overall, these findings suggest that reduced model-based control may mediate relapse risk in AUD, but this impairment can recover with prolonged abstinence.

Doñamayor et al. [69] studied young severe binge-drinkers and controls, finding reduced model-based control in binge-drinking participants. Additionally, binge drinkers showed lower learning rates for first-stage options and increased perseverative errors in the 2-step task. However, Nebe et al. [70]., using a less strict criterion for binge-drinking in a community sample of 188 young male social drinkers (i.e., at least one-lifetime binge-drinking episode), found no differences in behavioral model-based vs. model-free control or associated neural reward-prediction error signals. They also found no correlations between these control measures and average alcohol consumption or age at drinking onset. A 3-year follow-up of the same cohort revealed that lower behavioral model-based control was associated with the development of binge-drinking over time, while increased model-free reward prediction error signals in ventral striatum and vmPFC were linked to increased alcohol consumption [71]. These findings complement Sebold et al. [68]. by suggesting the predictive power of the model-based and model-free learning balance for treatment outcomes and drinking trajectories.

Two online studies explored symptom dimensions across diagnostic categories related to goal-directed control. Gillan et al. [72]. found in a population sample of nearly 2000 participants a weak but significant negative association between model-based control and alcohol use severity assessed by AUDIT [73], specifically related to compulsive behavior and intrusive thoughts. Another online study found alcohol use to be unrelated to model-based control in a non-patient population of more than 800 participants using a simplified 2-step task [74].

Overall, human evidence for increased habitual tendencies in AUD is limited, and methodological differences between studies complicate direct comparisons. However, the 2-step studies highlight the predictive power of model-based control for relapse risk and drinking trajectories.

Discussion

The review highlights that rodent studies consistently show a decrease in goal-directed control and an increase in habitual tendencies after prolonged excessive alcohol experience. Our meta-analysis from more than 400 animals challenges the dichotomous view of habitual and goal-directed responding and provides evidence for a continuum, with chronic alcohol experience shifting the balance towards more habitual responding. Based on the amalgamated findings of published studies, assessing habitual tendencies emerges as a potential indicator of an AUD-like phenotype in animals. Importantly, our meta-analysis offers a framework exemplifying how to address the reproducibility crisis in preclinical research [75, 76], potentially leading to the adoption of more rigorous experimental designs. Ultimately, this may enhance the successful translation of animal findings, fostering a better understanding of human AUD.

The observed response bias in the meta-analysis seems independent of the manner of chronic alcohol experience. Given the substantial differences in experimental protocols regarding alcohol amount, duration, and administration mode, questions arise regarding neuroadaptations associated with habitual responding in these studies, differing quantitatively, qualitatively, or both. While direct comparisons between paradigms are lacking, recent studies shed light. Smith et al. [77]. investigated voluntary alcohol consumption’s effects with or without CIE exposure on brain-wide cFos expression. Regardless of CIE, a history of alcohol drinking induced significant neuroadaptive changes persisting into prolonged abstinence in the PFC and dorsal striatum. CIE and re-access to alcohol compounded the altered cFos response, particularly in the DMS. Additionally, Roland et al. [78]. identified brain regions, notably the dorsal striatum and amygdala, affected by drinking history, showing increased numbers of cFos-positive cells in high drinking compared to low drinking mice. Similarly, Lagström et al. [79]. conducted electrophysiological recordings in brain slices from rats with a 2-month history of intermittent alcohol access versus water-drinking controls. They found enhanced glutamatergic excitability in the DMS, with the opposite effect in the DLS, more pronounced in high compared to low-alcohol drinkers. Neuroadaptations in the DLS returned to control levels after 48 h of abstinence, while the DMS continued to show hyperglutamatergic excitability.

These results suggest varying alcohol exposure or consumption levels can induce similar neuroadaptive changes in the brain, with specific regions showing increased vulnerability to higher doses. Several independent reports implicate the DMS as a critical area, especially sensitive to higher alcohol doses and exhibiting long-term neuroadaptations persisting during prolonged abstinence. Such dose-dependent long-term neuroadaptations in rodents provide insights into the dose-dependent reduction in cognitive control by chronic drinking in humans, as evidenced by analyses of UK Biobank population data [80, 81] and AUD patients [82].

Understanding how chronic alcohol exposure leads to habitual response biases remains a challenge. Numerous aberrant neuroadaptations, culminating in the progressive reprogramming of the striatocortical circuitry, have been documented [83, 84]. This discussion focuses on two crucial pathological mechanisms associated with chronic alcohol exposure: loss of metabotropic glutamate receptor 2 (mGluR2) function and withdrawal-induced neuroinflammation.

Both rodent and human studies have identified a reduction in mGluR2 levels in the mPFC following chronic alcohol exposure [46]. This reduction diminishes long-term plasticity at corticostriatal synapses, leading to impaired executive control and heightened craving [85]. Notably, the mGluR2 deficit affects long-term depression (LTD), a synaptic plasticity form crucial for learning, and may contribute to increased activity of D1-MSN found post-chronic alcohol exposure [21, 83]. Additionally, chronic alcohol enhances output from DLS to substantia nigra pars reticulata and external globus pallidus, suggesting a preference for strengthening the sensorimotor circuit pathway. This disinhibition of DLS output allows for SMC control over behavior, indicating profound functional and structural plasticity alterations in distinct MSN subpopulations that govern reinforcement-related learning.

Another pathological mechanism observed in both humans and rats during early abstinence is progressive neuroinflammation. The microglia-mediated neuroinflammation affects the local diffusion dynamics of neuromodulators [86], potentially contributing to aberrant dopamine level fluctuations observed during protracted abstinence [87]. These fluctuations, characterized by hypo- or hyperdopaminergic states, may serve as vulnerability factors for diminished cognitive control, leading to craving and relapse [88, 89]. Moreover, alcohol-induced neuroinflammation damages white matter tract integrity [90], impairing effective communication in the brain as, for example from the hippocampus to the mPFC [91]. This impedes memory updating processes, such as the extinction of maladaptive memories, thereby decreasing cognitive flexibility.

The discussed alcohol-induced molecular and cellular pathologies, whether specific, as exemplified by mGluR2 alterations, or more general, through neuroinflammatory reactions, may systematically diminish efficiency or speed of communication within the brain. Consequently, less demanding information processing modes may be utilized, resulting in observed biases towards habitual response tendencies. Importantly, these alcohol-induced pathologies are reversible and represent promising targets for novel treatment approaches aimed at enhancing cognitive processing [85, 92]. The potential effects of these interventions on habitual response biases are currently under investigation [93].

Specific stimulus-response associations are encoded by discrete neuron populations known as neuronal ensembles. The existence of ensembles has been demonstrated for alcohol memories using activity-dependent silencing of task-engaged cell populations [94]. Intriguingly, the activity of a specific ensemble in the infralimbic cortex linked to cues signaling drug non-availability could suppress habitual responding for both alcohol and cocaine [95]. This means that even under conditions linked to habitual drug taking and seeking the animal still maintains its ability to regain control over behavior by responding to a different set of cues, and this control is mediated by a discrete set of neurons. Further, functional ensembles are dispersed across the circuitry and form dynamic meta-ensembles (networks of ensembles) encoding information temporarily according to demands thereby allowing efficient and flexible decision-making [96, 97]. The observation by Giannone et al. [39] of overlapping cell populations controlling different behaviors (i.e., instrumental responding and maze navigation) should be explored in the context of dynamic meta-ensembles using recently developed task- or time-specific cellular resolution monitoring techniques [e.g [98, 99]]. Also, the findings that distinct sets of DMS and OFC neurons are active during outcome revaluation and their activity correlates with the degree of goal-directedness but not with habit execution [100], emphasize the need for detailed exploration of the meta-ensembles associated with habitual or goal-directed responding in AUD models. Encouragingly, methods to identify similar types of sparse code in neuronal populations of humans by fMRI are currently being developed [101].

In humans, mixed results have been obtained, but some studies suggest reduced goal-directed control in individuals with AUD, which may be associated with increased relapse risk and alcohol use severity [68, 71, 72]. However, habitual responding in devaluation tests have proven difficult to establish in humans. A potential reason for this discrepancy between human and animal studies could be that the former typically employ secondary rewards such as money or points. Paradigms using oral or intravenous delivery of primary rewards (e.g., juice or alcohol) in human conditioning tasks have been recently established [102, 103]. We suggest adapting these for instrumental responding to improve comparability with animal studies. Additionally, simple motor learning tasks in humans may reveal response biases in AUD subjects, and if so such tasks should be easy to back-translate into animal experiments (e.g., skilled walking task, Fig. 1). Additionally, the 2-step task for assessing model-based versus model-free learning strategies shows promise in predicting drinking behavior or relapse in humans. The successful back-translation of this paradigm to rats and mice [104, 105] will strongly facilitate research on the neurobiological mechanism underlying biased decision-making in AUD.

The relatively modest outcomes of behavioral tasks aimed at uncovering habitual control sharply contrast with the widespread self-description of addictive behaviors, including those related to alcohol, as habitual, whereby the specific interpretation of this term by a subject remains ambiguous. This discrepancy is also evident in weak correlations between self-reports and behavioral measures of the same construct, as observed in patients with substance use disorder assessed with questionnaires evaluating automaticity and devaluation tasks [106]. Similar findings are frequently observed in many fields of experimental psychology, but the underlying reasons for this divergence are not well elucidated [107]. In part, it could be attributed to the disparity between controlled laboratory settings and the complex nature of real-life experiences and may also account for the very limited predictive power (low percentage range) of specific laboratory tasks in predicting alcohol and drug-taking behaviors in humans [106, 108]. Despite the practical challenges associated with their assessment, habitual or automatic response biases have been effectively addressed in the treatment of AUD. Training schedules designed to specifically diminish automatic approach biases towards alcoholic beverages have repeatedly demonstrated effectiveness in enhancing the long-term drinking outcomes of recovery programs [109, 110].

Conclusions and further directions

How can we integrate habitual response biases into the circle of addiction [11]? A bias towards habitual responding may be particularly important in the protracted withdrawal and anticipation stages, increasing the risk of relapse. However, in the intoxication stage, once a relapse has occurred, mechanisms of compulsivity may be more influential. In this context, we want to stress the distinction between compulsivity and habitual tendencies. Compulsivity is defined as persistent behavior despite adverse consequences, while habitual control is a momentary process that is context and cue-dependent. In animal experiments compulsive responding will persist over a long time or throughout an experimental session and is not strongly influenced by the settings. On the other hand, habitual control is observed only over brief periods after a stimulus and changes to a more adaptive mode often within minutes. Animal studies on alcohol behavior provide only weak support for a direct link between habitual and compulsive control [47, 48]. Although there is some overlap within corticostriatal circuits, compulsive drinking is strongly associated with stress, and emotional regulation, and particularly involves insula circuits [111,112,113]. Thus, in contrast to common beliefs [12, 13] habitual and compulsive responding are not likely to form a continuum. In our view, habitual biases act as moderators rather than mediators of the relationship between chronic alcohol use and the development of compulsivity. This holds true regardless of whether these biases are pre-existing or acquired through drug use, a question that warrants further investigation in future research.

Taken together, there is limited support for a strict habit-goal dichotomy, particularly in terms of habits being seen as a principal sign of impaired decision-making and loss of control in AUD, or goal-directed behavior being the key to preventing dysfunctional drinking. Indeed, the same behavior, such as animals pressing a lever, can arise from different control systems and potentially different neural circuits. Both goal-directed and automatic decision-making are essential for behavioral flexibility: the automatic system enables quick decisions with minimal cognitive resources, which can be allocated to the goal-directed system when executive control is needed in novel or critical situations. As a result, these two systems may work in parallel and interact in various ways, making it challenging to determine their relative contributions to the control of behavior.

The intricacies of this relationship are inadequately represented by a terminology rooted in a habit-goal dichotomy. Moreover, within the context of addiction the term “habit” carries negative connotations and might exacerbate the stigmatization of individuals affected by it [114]. Instead we suggest adopting a more precise terminology in the context of test paradigms, that describes the probabilistic nature of observed response biases. Phrases like “level of goal-directedness” or “degree of automaticity”, better capture the temporary and dimensional allocation of cognitive resources in complex decision-making processes.

Moving forward, future research should delve into the concepts of model-free and model-based decision-making, especially in rodent models, to address fundamental neurobiological questions about learning, behavioral control, and addiction. Detailed investigations into the molecular and cellular representation of dynamic decision-making, focusing on ensembles and meta-ensembles associated with different degrees of goal-directed responding in AUD models, will offer valuable insights, especially if also considering factors such as sex and age.

On the clinical front, while a lower degree of goal-directedness is consistently observed in reward devaluation tasks in animals with chronic alcohol exposure, the predictive power of similar tests in human studies, including the 2-step task, is limited. Consequently, the utility of these laboratory tests as p clinical markers for AUD severity, progression, or treatment response seems limited. As discussed, individuals with AUD tend to initially rely on less demanding cognitive response strategies, but these systematic biases seem insufficient to fully explain compulsive drug taking. Whether and to what extent individual habitual biases moderate the development of compulsivity remains a question that requires further exploration within theoretical frameworks of addiction.

In conclusion, the available data strongly support the biopsychological model of addiction [115] and a gradual rather than categorical distinction between more goal-directed versus habitual decision-making. External factors including stress may shift this balance. Refining our understanding of decision-making processes and response biases offers promising avenues for both basic research and clinical interventions in AUD.

Open Article as PDF

Abstract

Excessive alcohol consumption remains a global public health crisis, with millions suffering from alcohol use disorder (AUD, or simply “alcoholism”), leading to significantly reduced life expectancy. This review examines the interplay between habitual and goal-directed behaviors and the associated neurobiological changes induced by chronic alcohol exposure. Contrary to a strict habit-goal dichotomy, our meta-analysis of the published animal experiments combined with a review of human studies reveals a nuanced transition between these behavioral control systems, emphasizing the need for refined terminology to capture the probabilistic nature of decision biases in individuals with a history of chronic alcohol exposure. Furthermore, we distinguish habitual responding from compulsivity, viewing them as separate entities with diverse roles throughout the stages of the addiction cycle. By addressing species-specific differences and translational challenges in habit research, we provide insights to enhance future investigations and inform strategies for combatting AUD.

Introduction

The idea that "old habits die hard" accurately describes the challenges in treating addiction. Common language often links addiction to "bad" habits. Historically, excessive alcohol use has led to individual tragedies and public health crises.

Currently, excessive alcohol consumption is a major global health issue, contributing to about 5% of the worldwide disease burden. Alcohol dependence affects 2.6% of people aged 15 and older globally, with higher rates in many developed countries, causing more harm than illegal drugs. Very heavy drinking, which involves a small percentage of the population in Europe, leads to severe health problems and a significantly shorter life expectancy. Alcohol dependence, often called severe alcohol use disorder (AUD), involves a strong preference for alcohol over healthier options, with individuals continuing to drink despite harmful outcomes, showing "compulsive" behavior. This resistance to change dysfunctional behavior requires a deeper understanding beyond simply calling it a personal choice.

A key question is whether AUD is simply a "bad habit." To explore this, it is helpful to examine the term "habit" in experimental psychology. A common definition of "habit" refers to a "settled tendency or usual manner of behavior." It also describes "an acquired behavior that has become nearly or completely involuntary." In experimental psychology, habits are learned links between a stimulus, situation, or internal state and behavioral responses that become almost entirely involuntary, regardless of the outcome. In contrast, "goal-directed" behavior is driven by expected results and requires knowledge of specific outcomes.

The traditional view saw habits and goal-directed control as completely separate. However, current interpretations suggest a more complex, gradual interaction between them. Regardless of this distinction, the idea that excessive or dysfunctional habits explain the shift to compulsivity in drug addiction is widespread. Many theories suggest that habit formation means less control over drug seeking and use, while goal-directed behavior indicates control. Conversely, some theories propose that addiction is driven by excessive goal-directed behavior, which seems equally rigid.

This review defines habits and related concepts, including how they are studied in animals and humans. It also explains the main brain mechanisms involved in habitual and goal-directed behavior. The review then examines research on habit and goal-directed behavior in alcohol use and AUD, focusing on studies that assume these control systems compete. The conclusion is that both mechanisms are vital parts of a complex decision-making process. When this process strongly favors automatic responses, it can contribute to the development and continuation of AUD.

Animal studies

Laboratory animals easily learn to press levers for rewards, such as food, alcohol, or other drugs used by humans. After many self-administration sessions, animals may increase their alcohol intake. While initial escalation is not seen with non-drug rewards, animals can significantly increase alcohol consumption due to specific experimental factors like scheduled access or stress. This raises questions about whether this increase reflects a loss of control and how much habitual responding contributes.

Rodent studies have primarily investigated two aspects of alcohol's effect on habitual behavior after a reward's value is reduced. One area examines whether alcohol reinforcement leads to stronger habit formation than food or other natural rewards. Another key focus is how prolonged or excessive alcohol exposure affects habitual control and if this generalizes to other non-drug rewards.

Early research showed that rats trained to press levers for alcohol or food reacted differently to reward devaluation. Alcohol-seeking behavior was resistant to devaluation, unlike food-seeking behavior. Similar studies found that after short training, only alcohol-seeking showed persistent responding after devaluation, but after extended training, both alcohol and sugar solution self-administration became persistent.

A significant study explored the impact of training duration on rats self-administering alcohol. After four weeks, alcohol-administering rats were less sensitive to satiety devaluation, while sucrose-administering rats remained sensitive even after eight weeks. Interestingly, moderate regular alcohol consumption, even without intoxication, correlated with the degree of habitual responding. Brain imaging studies showed that inactivating a specific brain region, the dorsomedial striatum (DMS), led to faster habit development. In contrast, damage to the dorsolateral striatum (DLS) after eight weeks of training restored goal-directed behavior. Subsequent experiments found that habitual alcohol self-administration involved specific brain receptors in the DLS. Alcohol also increased activity and structural changes in the DMS. Simultaneous recordings showed DMS neurons primarily responded to rewards or cues predicting them, while DLS activity was linked to lever pressing. This suggests that different groups of neurons in the DMS and DLS may manage specific aspects of behavioral control, and these groups react differently to chronic alcohol exposure.

Collectively, these studies suggest alcohol has a greater and faster potential to induce habit-forming effects compared to other non-drug rewards. Most studies used test designs with only one instrumental response, which might not fully challenge cognitive resources in a way that reveals automatic control. One experiment showed that alcohol-associated cues reduced goal-directed control in rats choosing between two food rewards, indicating alcohol's disruptive effect on decision-making. However, some studies failed to find evidence of habitual responding for alcohol or sucrose. A distinction was observed between two main reward devaluation methods, suggesting different underlying brain mechanisms.

Recent research has focused on whether alcohol's long-term effects (beyond its immediate pharmacological impact) influence decision-making systems responsible for goal-directed actions, potentially leading to habitual tendencies. Alcohol pre-exposure can occur through voluntary access, scheduled self-administration, or passive exposure. Chronic intermittent alcohol vapor exposure (CIE) is a common rodent model for AUD, providing clinically relevant blood alcohol levels over weeks.

Studies using the CIE model have investigated alcohol dependence's effects on a brain region called the orbitofrontal cortex (OFC) in goal-directed behavior. Mice with CIE treatment showed insensitivity to satiety devaluation, linked to changes in the OFC's top-down control over other brain circuits. CIE reduced OFC activity, and artificially increasing the activity of OFC neurons during abstinence restored sensitivity to reward devaluation. Chronic alcohol exposure thus alters OFC activity vital for decision-making, favoring habitual responding, though more research is needed to understand OFC's specific contributions. Other studies found that pre-exposure to chronic alcohol impaired goal-directed alcohol-seeking more than sucrose-seeking behavior. However, other studies using different alcohol exposure methods found no effect on responding in devaluation tests, possibly due to lower alcohol exposure compared to CIE. Some research suggests that increased cognitive demands can trigger habitual response strategies in rats with a history of chronic drinking, associated with impaired function of specific brain cells in the DMS.

Three studies explored how age and sex influence alcohol-induced habitual tendencies. Sex differences were observed after CIE, with adolescent female rats showing reduced behavioral control. Chronic high alcohol exposure during late adolescence increased habitual responding in adulthood regardless of sex. Even without a history of alcohol dependence, adults showed higher susceptibility to alcohol-induced habit formation compared to adolescent rats, particularly males. More research is needed to fully understand sex and age effects.

The question arises whether similar brain learning processes, involving information transfer within the striatum, are mediated by the same brain cell populations. Experiments showed that prior CIE treatment led to increased automatic responding in both maze navigation and instrumental tasks. These behavioral changes depended heavily on DMS function, suggesting that alcohol's harmful effects on these cells may impact various action control behaviors beyond just reward-seeking.

Certain mechanisms that can be targeted by drugs have been studied for their role in habitual biases. Increased signaling in the DLS appears crucial for habitual tendencies. Studies show that blocking specific receptors reduces habitual responding, while activating them enhances it. Chronic alcohol exposure is linked to changes that increase DLS control in learning. These findings suggest potential drug interventions for habitual biases.

Additionally, a drug that inhibits a specific protein (mTORC1) in the OFC of chronically drinking rats reduced habitual alcohol-seeking. This effect is linked to mTORC1's role in creating new synaptic proteins, which is activated after prolonged alcohol exposure. Thus, this signaling pathway appears to be a critical mediator of alcohol-induced brain changes, and inhibiting it might reverse these changes and improve control over alcohol intake.

Two studies examined whether reduced sensitivity to reward devaluation could predict addiction progression. One study found that rats with an AUD-like severity score did not differ from non-addicted rats in a satiety devaluation test. In contrast, another study found that individual differences in habitual control over alcohol seeking predicted the development of compulsive alcohol intake. It is important to note that only a minority of rats in both experiments developed addiction-like behavior, despite long-term alcohol access, consistent with other studies. Whether genetic factors influence the development of habitual responding remains unclear.

Meta-analysis of rodent studies

A meta-analysis was conducted to address the varied experimental designs in previous reports and draw solid conclusions. Standardized effect sizes were calculated for each experiment by normalizing the difference in responding between reward-devalued and non-devalued conditions. The analysis included 10 studies with 17 independent experiments, totaling 404 animals.

The meta-analysis showed that alcohol treatment significantly affects animal behavior in tests for habitual responding. A zero difference between the pre-test and test condition indicates complete habitual behavior, while a decrease after devaluation signifies goal-directedness. The analysis revealed a highly significant effect, showing reduced goal-directed behavior in alcohol-exposed groups compared to controls. Notably, alcohol-treated animals still showed some goal-directed behavior, meaning chronic alcohol exposure did not cause a complete shift to habitual behavior but a gradual reduction in goal-directed responding. The analysis also indicated that various experimental factors, such as animal characteristics, alcohol exposure details, training parameters, and reward type, had no significant influence. There was no clear evidence of publication bias, and heterogeneity across experiments was low. These results were confirmed by sensitivity analyses. Overall, the meta-analysis suggests a continuous relationship between habitual and goal-directed control, which is compromised by prolonged or chronic alcohol exposure, rather than a clear separation between the two.

Human studies

Nine human studies explored the balance between goal-directed and habitual choice tendencies in individuals with AUD, high-risk populations, or large community samples. Most of these studies used a sequential decision-making task to distinguish between goal-directed (model-based) and model-free learning, while two used an outcome devaluation procedure. The focus was mainly on instrumental habitual versus goal-directed decision-making related to non-drug rewards and contexts, examining generalized habitual response tendencies for newly learned behaviors within a single session.

Currently, there is a lack of human studies investigating sensitivity to changes in reward delivery (contingency degradation) in AUD. One study using an outcome devaluation procedure found that abstinent individuals with AUD showed impaired knowledge of action-outcome relationships compared to healthy controls, suggesting a greater reliance on habitual associations rather than goal-directed ones when learning new instrumental behaviors. Interestingly, AUD participants showed increased activity in a specific brain region (posterior putamen) during habitual learning, while control participants showed stronger activity in other regions (vmPFC and anterior putamen) during goal-directed learning. These findings align with animal and human evidence highlighting the distinct roles of these brain areas in supporting goal-directed versus habitual behavior. However, AUD duration did not correlate significantly with behavioral or brain measures of control. It is important to note that this task lacked a "slips-of-action" test phase that directly tests responses for devalued outcomes, which is common in animal paradigms.

The same study used two task versions, one with non-drug stimuli and one with alcohol pictures. Behavioral and brain results did not differ between versions, suggesting the alcohol context did not uniquely influence habitual choice in AUD individuals. Another study used a different version of the devaluation paradigm, incorporating a test where Pavlovian cues influence instrumental actions, and found that both AUD and control participants showed significant devaluation effects, suggesting intact goal-directed control in AUD. Brain imaging showed no group differences. The two devaluation studies used considerably different methods, making direct comparisons difficult.

Indirect support for a shift in brain activity from ventral to dorsal striatum was shown in heavy drinkers compared to social drinkers, with reduced brain response to alcohol cues in the ventral striatum. A follow-up study found that self-reported automatic alcohol craving and habitual alcohol consumption correlated with increased activation in dorsal striatal, pallidal, and prefrontal regions during alcohol cue-reactivity tasks.

Beyond direct animal-to-human translations, another area of human habit research formalizes habitual and goal-directed processes within a reinforcement learning framework as model-free and model-based control. One study compared abstinent AUD participants and controls on a decision-making task, finding reduced model-based but unchanged model-free control in the AUD group. The impairment in model-based control was linked to cognitive speed differences between groups, emphasizing the need to consider other factors. Further studies did not find direct evidence of reduced model-based control in AUD participants. Nevertheless, model-based control predicted relapse status during follow-up, and individuals likely to relapse showed weaker brain signals of model-based control. Additionally, a balance parameter related to model-based control increased with abstinence duration in AUD participants. These findings suggest that reduced model-based control may mediate relapse risk in AUD, but this impairment can improve with longer abstinence.

Studies on young severe binge-drinkers found reduced model-based control and increased errors. However, another study of young male social drinkers with less strict criteria for binge-drinking found no differences in model-based vs. model-free control or associated brain signals. A follow-up of this group revealed that lower behavioral model-based control was associated with developing binge-drinking over time, while increased model-free reward signals in the brain were linked to increased alcohol consumption. These findings support the idea that the balance of model-based and model-free learning can predict treatment outcomes and drinking patterns.

Two online studies explored symptom dimensions related to goal-directed control. One found a weak but significant negative link between model-based control and alcohol use severity in a large population sample, specifically related to compulsive behavior and intrusive thoughts. Another online study found alcohol use was unrelated to model-based control in a non-patient population.

Overall, human evidence for increased habitual tendencies in AUD is limited, and methodological differences make direct comparisons challenging. However, studies using decision-making tasks highlight the predictive power of model-based control for relapse risk and drinking patterns.

Discussion

This review consistently shows that rodent studies indicate a decrease in goal-directed control and an increase in habitual tendencies after prolonged, excessive alcohol exposure. The meta-analysis, involving over 400 animals, challenges the idea of a strict separation between habitual and goal-directed responding, providing evidence for a continuum where chronic alcohol experience shifts the balance towards more habitual responses. Based on the combined findings, assessing habitual tendencies appears to be a potential indicator of an AUD-like state in animals. Importantly, this meta-analysis offers a framework for improving research rigor, potentially leading to better translation of animal findings and a deeper understanding of human AUD.

The observed shift towards habitual responses in the meta-analysis seems independent of the specific method of chronic alcohol exposure. Given the significant variations in experimental protocols regarding alcohol amount, duration, and administration, questions arise about whether the brain changes associated with habitual responding differ quantitatively, qualitatively, or both. While direct comparisons between paradigms are scarce, recent studies offer some insights. Research on voluntary alcohol consumption, with or without chronic intermittent alcohol vapor exposure, showed that a history of drinking induced significant brain changes, particularly in areas related to control and reward. Higher alcohol consumption levels also led to increased activity in specific brain regions. Similarly, electrophysiological recordings from rats with a history of intermittent alcohol access showed altered brain cell excitability, with some changes lasting even during prolonged abstinence.

These results suggest that different levels of alcohol exposure or consumption can induce similar brain changes, with certain regions being more vulnerable to higher doses. Several independent reports identify the DMS as a critical area, especially sensitive to higher alcohol doses and showing long-term brain changes that persist during abstinence. Such dose-dependent long-term brain adaptations in rodents provide insight into how chronic drinking reduces cognitive control in humans, as seen in population data and AUD patients.

Understanding how chronic alcohol exposure leads to habitual response biases remains a challenge. Many abnormal brain changes, resulting in the progressive reprogramming of brain circuits, have been documented. Two crucial pathological mechanisms linked to chronic alcohol exposure are the loss of function in specific brain receptors (mGluR2) and inflammation in the brain caused by withdrawal.

Both rodent and human studies have found reduced levels of mGluR2 in the mPFC after chronic alcohol exposure. This reduction impairs long-term brain plasticity, leading to reduced executive control and increased craving. This mGluR2 deficit affects a type of synaptic plasticity crucial for learning and may contribute to increased activity of certain neurons observed after chronic alcohol exposure. Additionally, chronic alcohol enhances output from the DLS to other brain regions, suggesting a preference for strengthening sensory-motor pathways. This indicates significant functional and structural changes in distinct neuron populations that control learning related to reinforcement.

Another pathological mechanism observed in both humans and rats during early abstinence is progressive brain inflammation. This inflammation, mediated by specific brain cells (microglia), affects the local movement of brain chemicals, potentially contributing to abnormal dopamine fluctuations seen during prolonged abstinence. These fluctuations, characterized by low or high dopamine states, may make individuals more vulnerable to reduced cognitive control, leading to craving and relapse. Furthermore, alcohol-induced brain inflammation damages nerve fibers, impairing effective communication in the brain, such as between memory regions and the prefrontal cortex. This hinders memory updating processes, like extinguishing maladaptive memories, thereby decreasing mental flexibility.

The described alcohol-induced molecular and cellular pathologies, whether specific (like mGluR2 changes) or more general (like brain inflammation), may systematically reduce the efficiency or speed of communication within the brain. As a result, less demanding information processing modes may be used, leading to the observed biases towards habitual responses. Importantly, these alcohol-induced pathologies are reversible and represent promising targets for new treatment approaches aimed at improving cognitive processing. The potential effects of these interventions on habitual response biases are currently under investigation.

Specific stimulus-response associations are stored by distinct groups of neurons called neuronal ensembles. The existence of these ensembles has been shown for alcohol memories. Interestingly, the activity of a specific ensemble in the infralimbic cortex, linked to cues signaling drug unavailability, could suppress habitual responding for both alcohol and cocaine. This means that even in situations linked to habitual drug use, the animal can still regain control over behavior by responding to different cues, and this control is mediated by a distinct set of neurons. Furthermore, functional ensembles are spread across brain circuits and form dynamic "meta-ensembles" (networks of ensembles) that temporarily encode information as needed, allowing for efficient and flexible decision-making. The observation of overlapping cell populations controlling different behaviors (e.g., instrumental responding and maze navigation) should be explored in the context of dynamic meta-ensembles using newer monitoring techniques. Also, findings that distinct sets of neurons are active during outcome re-evaluation and their activity correlates with the degree of goal-directedness, but not with habit performance, highlight the need to explore meta-ensembles associated with habitual or goal-directed responding in AUD models. Methods to identify similar patterns in human brain activity are currently being developed.

In humans, mixed results have been obtained, but some studies suggest reduced goal-directed control in individuals with AUD, which may be linked to increased relapse risk and alcohol use severity. However, establishing habitual responding in devaluation tests has proven difficult in humans. A possible reason for this difference between human and animal studies is that human studies typically use secondary rewards like money or points. Paradigms using direct delivery of primary rewards (e.g., juice or alcohol) in human conditioning tasks have recently been developed. Adapting these for instrumental responding could improve comparability with animal studies. Additionally, simple motor learning tasks in humans might reveal response biases in AUD subjects, and such tasks should be easy to translate back to animal experiments. The two-step task for assessing model-based versus model-free learning strategies also shows promise in predicting drinking behavior or relapse in humans. The successful translation of this paradigm to rats and mice will greatly facilitate research on the brain mechanisms underlying biased decision-making in AUD.

The relatively modest outcomes of behavioral tasks designed to uncover habitual control sharply contrast with the widespread self-description of addictive behaviors, including those related to alcohol, as habitual. This discrepancy is also evident in weak correlations between self-reports and behavioral measures of the same concept. Similar findings are often observed in many areas of experimental psychology, but the underlying reasons are not well understood. In part, this could be due to the difference between controlled laboratory settings and the complex nature of real-life experiences, and may also explain the very limited predictive power of specific laboratory tasks in predicting alcohol and drug-taking behaviors in humans. Despite the practical challenges in assessing them, habitual or automatic response biases have been effectively addressed in AUD treatment. Training programs specifically designed to reduce automatic approach biases towards alcoholic beverages have repeatedly shown effectiveness in improving long-term drinking outcomes in recovery programs.

Conclusions and further directions

This review integrates habitual response biases into the understanding of addiction. A bias towards habitual responding may be particularly important during the prolonged withdrawal and anticipation stages, increasing relapse risk. However, during the intoxication stage, once a relapse occurs, mechanisms of compulsivity may be more influential. It is important to distinguish between compulsivity and habitual tendencies. Compulsivity is defined as persistent behavior despite negative consequences, while habitual control is a momentary process dependent on context and cues. In animal experiments, compulsive responding persists over long periods and is not strongly influenced by settings. On the other hand, habitual control is observed only for brief periods after a stimulus and often shifts to a more adaptive mode within minutes. Animal studies on alcohol behavior provide weak support for a direct link between habitual and compulsive control. While there is some overlap in brain circuits, compulsive drinking is strongly associated with stress and emotional regulation, particularly involving insula circuits. Therefore, contrary to common beliefs, habitual and compulsive responding are not likely to form a single continuum. Instead, habitual biases act as moderators rather than direct causes of the relationship between chronic alcohol use and the development of compulsivity. Whether these biases are pre-existing or acquired through drug use requires further investigation.

Overall, there is limited support for a strict division between habit and goal, particularly regarding habits being seen as a primary sign of impaired decision-making and loss of control in AUD, or goal-directed behavior being the key to preventing dysfunctional drinking. Indeed, the same behavior can arise from different control systems and potentially different neural circuits. Both goal-directed and automatic decision-making are essential for behavioral flexibility: the automatic system allows quick decisions with minimal mental effort, freeing up cognitive resources for the goal-directed system when executive control is needed in new or critical situations. As a result, these two systems may work in parallel and interact in various ways, making it challenging to determine their relative contributions to behavior control.

The complexities of this relationship are not fully captured by terminology based on a strict habit-goal division. Furthermore, within the context of addiction, the term "habit" carries negative connotations and might worsen the stigmatization of affected individuals. Instead, it is suggested to adopt more precise terminology when describing test paradigms, using phrases like "level of goal-directedness" or "degree of automaticity." These terms better capture the temporary and dimensional allocation of mental resources in complex decision-making processes.

Moving forward, future research should delve into the concepts of model-free and model-based decision-making, especially in rodent models, to address fundamental brain questions about learning, behavioral control, and addiction. Detailed investigations into the molecular and cellular representation of dynamic decision-making, focusing on specific neuron groups associated with different degrees of goal-directed responding in AUD models, will offer valuable insights, particularly when considering factors such as sex and age.

On the clinical front, while a lower degree of goal-directedness is consistently observed in reward devaluation tasks in animals with chronic alcohol exposure, the predictive power of similar tests in human studies, including the two-step task, is limited. Consequently, the usefulness of these laboratory tests as clinical markers for AUD severity, progression, or treatment response appears limited. As discussed, individuals with AUD tend to initially rely on less demanding cognitive response strategies, but these systematic biases seem insufficient to fully explain compulsive drug taking. Whether and to what extent individual habitual biases influence the development of compulsivity remains a question that requires further exploration within theoretical frameworks of addiction.

In conclusion, the available data strongly support a biopsychological model of addiction and a gradual rather than categorical distinction between more goal-directed versus habitual decision-making. External factors, including stress, may shift this balance. Refining the understanding of decision-making processes and response biases offers promising avenues for both basic research and clinical interventions in AUD.

Open Article as PDF

Abstract

Excessive alcohol consumption remains a global public health crisis, with millions suffering from alcohol use disorder (AUD, or simply “alcoholism”), leading to significantly reduced life expectancy. This review examines the interplay between habitual and goal-directed behaviors and the associated neurobiological changes induced by chronic alcohol exposure. Contrary to a strict habit-goal dichotomy, our meta-analysis of the published animal experiments combined with a review of human studies reveals a nuanced transition between these behavioral control systems, emphasizing the need for refined terminology to capture the probabilistic nature of decision biases in individuals with a history of chronic alcohol exposure. Furthermore, we distinguish habitual responding from compulsivity, viewing them as separate entities with diverse roles throughout the stages of the addiction cycle. By addressing species-specific differences and translational challenges in habit research, we provide insights to enhance future investigations and inform strategies for combatting AUD.

Introduction

The saying "Old habits die hard" accurately describes the difficulties in treating addiction. Everyday language often links addiction to "bad" habits. For example, an early dictionary from 1828 noted that frequent drinking leads to a "habit of intemperance," suggesting that harmful habits should be corrected by changing one's behavior. Historically, alcohol use has been linked to personal suffering and widespread health problems, leading to its complex and sometimes contradictory image.

Currently, excessive alcohol use is a serious global public health crisis, contributing to about 5% of worldwide disease burden. Alcohol dependence affects 2.6% of people aged 15 and older globally, with much higher rates in many developed nations, and it causes more harm than illegal drugs. Very heavy drinking, defined as over 100 grams per day for males or 60 grams per day for females (which affects about 0.8% of Europeans aged 15-65), leads to severe health issues and significantly reduces life expectancy.

Alcohol dependence, often known as severe alcohol use disorder (AUD), involves a consistent preference for alcohol over healthier options, with individuals continuing to drink despite negative effects, showing signs of "compulsivity." This resistance to changing harmful behavior needs more understanding than simply labeling it an individual choice. This raises the question: is AUD just a bad habit? To explore this, one can examine the term "habit" and its definitions in experimental psychology. A common definition of "habit" is a "settled tendency or usual manner of behavior," but it also refers to an "acquired mode of behavior that has become nearly or completely involuntary." In experimental psychology, habits are defined as learned connections between a stimulus, context, or internal state and behavioral responses that become almost or entirely automatic, regardless of the outcome. In contrast, "goal-directed" behavior is driven by expected results and requires knowing the specific outcomes of actions. This distinction between habit and goal-directed behavior is tested by observing a learned response after a reward's value is reduced. It is assumed that if a behavior is habitual, it will not change even if the reward is devalued, but if it is goal-directed, the behavior will decrease. The traditional idea that habitual and goal-directed controls are entirely separate is debated; newer views suggest more complex interactions. Nonetheless, the idea that excessive habits explain the shift to compulsive drug addiction is common. Leading theories suggest that forming habits means losing control over drug-seeking and taking, while goal-directed behavior indicates behavioral control. However, another perspective argues that excessive goal-directed behavior is central to addiction's development, which also presents a fixed view of behavior.

This review defines habits and related ideas, explaining how they are studied in animals and humans. It also covers the key brain concepts connected to habitual and goal-directed responses. The review then examines animal and human studies specifically looking at habit and goal-directed behavior in alcohol use and AUD. Most of these studies assume that goal-directed and habitual control systems compete. The conclusion is that both mechanisms are crucial parts of a complex decision-making process. When this process strongly favors automatic responses, it can play a role in the development and persistence of AUD.

Animal studies

Laboratory animals readily learn to press levers for rewards, including food, alcohol, and other substances used by humans. While animals initially find alcohol unappealing, they can increase their intake over time, especially under specific experimental conditions like scheduled access or stress. Researchers then investigate if this increased intake indicates a loss of control and how much habitual behavior contributes. Rodent studies have primarily focused on two aspects of alcohol's effect on habits after a reward's value is reduced: whether alcohol is more habit-forming than natural rewards, and how long-term alcohol exposure affects habitual control and its impact on other non-drug rewards.

Early studies demonstrated that alcohol consumption in rats led to behaviors resistant to reward devaluation, unlike food or sugar solutions, especially after shorter training periods for alcohol. One significant study found that rats given alcohol for four weeks showed less sensitivity to feeling full after alcohol intake, while sucrose-fed rats remained sensitive even after eight weeks. However, this sensitivity was lost if rats also had non-scheduled access to alcohol. Even moderate alcohol intake correlated with habitual behavior. Importantly, brain regions like the dorsomedial striatum (DMS) and dorsolateral striatum (DLS) play distinct roles: inactivating the DMS accelerated habit formation, while DLS lesions restored goal-directed behavior. Research indicates that habitual alcohol self-administration involves dopamine D2 and AMPA glutamate receptors in the DLS. Alcohol also affected AMPA receptor activity in the DMS. Simultaneous recordings showed DMS neurons primarily activate with reward cues, while DLS activity is linked to lever-pressing, suggesting that these regions contribute differently to behavioral control and respond uniquely to chronic alcohol exposure. Overall, these findings suggest alcohol has a greater and faster potential to create habits than non-drug rewards. Some studies, however, did not find strong evidence of habitual responding for alcohol or sucrose, possibly due to task limitations or differences in devaluation methods.

Recent research investigates how alcohol's long-term effects influence decision-making for goal-directed actions, potentially leading to habitual patterns. Animals can be exposed to alcohol through voluntary drinking or passive methods like chronic intermittent alcohol vapor exposure (CIE), a common model for AUD that creates high blood alcohol levels. Studies using CIE have shown that mice become less sensitive to satiety devaluation, which is linked to changes in the orbitofrontal cortex (OFC), a brain region critical for top-down control. CIE reduces OFC activity, and boosting its activity can restore sensitivity to outcome devaluation. Chronic alcohol exposure alters OFC function, favoring habitual responses, though more research on its specific role is needed. Some studies found CIE-treated mice developed habitual alcohol-seeking but not sucrose-seeking, suggesting alcohol may impair goal-directed alcohol behavior specifically. However, other studies with lower alcohol exposure did not find changes in devaluation tests. One study showed that rats with a history of chronic drinking became insensitive to devaluation when cognitive demands increased, indicating habitual strategies might be engaged under higher cognitive load. This shift was linked to impaired cholinergic neurons in the DMS, which affect behavioral flexibility, and enhancing their input could reduce habitual bias.

Research has also investigated how age and sex influence alcohol-induced habits. One study found sex differences after CIE, with adult male rats showing less sensitivity to satiety devaluation for sucrose than younger males, and only adolescent female rats displaying habitual tendencies. Conversely, chronic high alcohol exposure during late adolescence increased habitual responding in adulthood for both sexes. Even without a history of alcohol dependence, adult rats showed higher susceptibility to alcohol-induced habits than adolescents, suggesting that habit formation risk increases from adolescence to adulthood, particularly in male rats. Further research is needed on these age and sex effects. Experiments with rats undergoing CIE treatment showed increased automatic responses in both T-maze spatial navigation and instrumental tasks. CIE-exposed rats also made more errors in navigation, indicating chronic alcohol dependence impacts action control beyond simple learned behaviors. These changes depended heavily on DMS function, as inhibiting this area in normal rats increased habitual bias. These findings suggest that alcohol's harmful effects on these cells can impact various behaviors relying on striatal learning, not just reward-seeking.

Researchers have also examined specific brain mechanisms that can be targeted with drugs to understand their role in habitual biases. Increased endocannabinoid signaling through CB1 receptors in the DLS appears vital for habitual tendencies. Studies show that blocking or inhibiting endocannabinoid effects reduces habitual responses to alcohol cues, while enhancing them increases habitual behavior. Since CB1 receptors are more abundant in the DLS than the DMS, this allows for more specific drug targeting. Chronic alcohol exposure also increases CB1 signaling, strengthening DLS control in learning, which suggests potential drug interventions for habitual biases. Furthermore, injecting rapamycin, a specific inhibitor, into the OFC of rats with chronic alcohol use reduced their habitual alcohol responses. This is because rapamycin affects mTORC1, a protein involved in building synaptic connections, which is overactive after long-term alcohol exposure. Thus, controlling mTORC1 signaling might reverse alcohol-induced brain changes and help improve control over alcohol intake. Two studies investigated whether insensitivity to reward devaluation could predict addiction progression and link to compulsive behavior (resistance to punishment). One study with a large group of male rats found that a small subset (5 out of 47) developed addiction-like behaviors, showing higher alcohol intake, increased motivation, and resistance to negative consequences like quinine-adulterated alcohol. However, these rats did not show differences in a satiety devaluation test compared to non-addicted rats. In contrast, another study found that individual differences in habitual control over alcohol seeking did predict the development of compulsive alcohol intake. Most rats in this study developed habitual tendencies, but only a minority of these then progressed to compulsive intake, suggesting a partial but not complete link. Both studies highlight that despite long-term alcohol access, only a minority of rats develop addiction-like behavior with resistance to negative consequences, which aligns with findings that some rats naturally show persistent alcohol intake despite punishment, possibly due to genetic factors. Whether these genetic factors affect habitual responding is still unknown.

Meta-analysis of rodent studies

A meta-analysis was performed to address the varying experimental conditions in previous reports and to draw more reliable conclusions. Standard effect sizes were calculated for each experiment by normalizing the difference in responses between reward-devalued and non-devalued conditions, regardless of the specific devaluation method used. This meta-analysis included 10 studies with 17 independent experiments, involving a total of 404 animals. These 10 studies are listed in Table 1 and discussed earlier.

Method

A PubMed search was conducted in December 2022 using keywords such as "alcohol addiction," "habits," "rats," and "mice." From an initial screening of 202 abstracts, relevant studies were identified, and their bibliographies were reviewed for additional papers. This process yielded 10 studies comparing alcohol pre-exposure to a control condition. The effect size of devaluation or contingency degradation was measured for both groups in each experiment, resulting in 25 comparisons involving 203 exposed and 201 non-exposed animals. Means and standard deviations from pre-test and test conditions were extracted from graphs. Since the effect of interest was derived by comparing pre-test to test conditions within the same subjects, Cohen’s dav was initially calculated, then converted to Hedges’ gav to correct for small sample size bias. When animals underwent multiple tests, their effect sizes were averaged into a single representative effect size to prevent sample size inflation, resulting in 17 final comparisons between ethanol and control groups. Variance for individual effect sizes was calculated assuming a correlation coefficient of 0 to avoid overestimating confidence. The method for estimating variance was conservative to rigorously test the robustness of findings. A subgroup meta-analysis compared alcohol-treated and non-treated conditions using a random-effect model. Publication bias was assessed using Egger’s test, and heterogeneity across experiments was evaluated using I2. The robustness of the findings was further confirmed through various sensitivity analyses.

Results

The meta-analysis demonstrates that alcohol treatment significantly impacts animal behavior related to habitual responding. A zero difference between the pre-test and test condition indicates fully habitual behavior, while a decrease after reward devaluation suggests goal-directed behavior. The findings show a significant reduction in goal-directed behavior in alcohol-exposed groups compared to controls. Alcohol-treated animals also showed a significant difference from zero, indicating that chronic alcohol exposure led to a gradual decrease in goal-directed responding rather than a complete shift to habitual behavior. Meta-regression analysis revealed that experimental factors like animal characteristics, alcohol exposure details, training parameters, and reward type did not significantly influence the results. There was no strong evidence of publication bias, and heterogeneity across experiments was low. The meta-analytical results were further confirmed by sensitivity analyses. Overall, the meta-analysis suggests a continuous relationship between habitual and goal-directed control, which is impaired by prolonged or chronic alcohol exposure, rather than a clear separation.

Human studies

Nine human studies have explored the balance between goal-directed and habitual choices in individuals with Alcohol Use Disorder (AUD) or those at high risk, as well as in broader community samples to assess links with AUD severity. Most of these studies used tasks to differentiate between model-based (goal-directed) and model-free learning systems, while two used outcome devaluation procedures. These investigations primarily focused on how humans make decisions for non-drug rewards in new learning situations, essentially examining generalized habitual responses for new learned behaviors within a single session. There is currently a shortage of human studies examining sensitivity to contingency degradation in AUD. One study using an outcome devaluation procedure found that abstinent AUD participants had poorer understanding of action-outcome relationships compared to healthy individuals, indicating a greater reliance on habitual stimulus-response links rather than goal-directed ones during new learning. AUD participants showed increased activity in the posterior putamen during habitual learning, while control participants showed stronger activity in the ventromedial prefrontal cortex (vmPFC) and anterior putamen during goal-directed learning. These findings align with animal and human evidence showing distinct brain roles in supporting goal-directed versus habitual behavior. However, the duration of AUD did not correlate with behavioral or neural indicators of control.

An interesting finding from one study was that using either drug-related (alcohol pictures) or drug-unrelated (fruit pictures) stimuli did not change behavioral or neural results, suggesting that the alcohol context itself did not uniquely affect habitual choices in AUD individuals. Another study used a different version of the contingency degradation paradigm, incorporating a Pavlovian-to-Instrumental Transfer test and fMRI to compare abstinent AUD participants with controls. Participants were trained to press buttons for different food snacks, then one snack was devalued by making it taste bitter and showing it as unappetizing. Both AUD and control groups showed significant devaluation effects, suggesting that goal-directed control remained intact in AUD. Brain imaging analysis found no group differences or major task effects when comparing choices for devalued versus non-devalued outcomes. It is important to note that these two devaluation studies used quite different methods: one assessed explicit knowledge after direct instruction, while the other examined spontaneous responses after creating a taste aversion.

Indirect evidence suggests a shift in brain activity from the ventral to the dorsal striatum. One study found that heavy drinkers had reduced neural reactivity to alcohol cues in the ventral striatum compared to social drinkers. A follow-up study showed that self-reported automatic alcohol craving and habitual alcohol use were linked to increased activity in dorsal striatal, pallidal, and prefrontal brain regions during an alcohol cue-reactivity task.

Beyond studies directly translating from animal models, another area of human habit research has described habitual and goal-directed processes within a reinforcement learning framework, termed model-free and model-based control. One study comparing abstinent AUD participants and controls on a "2-step task" found that AUD individuals showed reduced model-based control, but their model-free control remained unchanged. This impairment in model-based control, particularly in non-reward conditions, appeared linked to differences in cognitive speed between groups, emphasizing the need to account for such factors. While other studies did not find direct evidence of reduced model-based control in AUD participants, model-based control did predict relapse during follow-up assessments. Individuals who relapsed showed weaker neural signs of model-based control in the medial prefrontal cortex (mPFC). Furthermore, the balance between these two types of control improved with longer periods of abstinence in AUD participants. These results suggest that reduced model-based control might contribute to relapse risk in AUD, but this impairment could lessen with sustained abstinence.

Another study examined young severe binge-drinkers and found they had reduced model-based control, along with slower learning and more repetitive errors in decision-making tasks. However, a study of young male social drinkers with a looser definition of binge drinking found no differences in model-based or model-free control, nor correlations with alcohol consumption or age of drinking onset. A three-year follow-up of this group, however, showed that lower model-based control was linked to developing binge drinking over time, while increased model-free reward signals in certain brain areas correlated with higher alcohol consumption. These results suggest that the balance between model-based and model-free learning can predict treatment outcomes and drinking patterns. Two online studies also explored goal-directed control in the general population. One found a weak but significant link between lower model-based control and higher alcohol use severity, specifically related to compulsive behavior. Another online study, using a simpler task with over 800 participants, found no relationship between alcohol use and model-based control in a non-patient group.

Discussion

This review indicates that rodent studies consistently show reduced goal-directed control and increased habitual tendencies following long-term excessive alcohol exposure. A meta-analysis of over 400 animals supports a spectrum of behavior rather than a strict division between habitual and goal-directed responses, with chronic alcohol shifting this balance towards more habitual actions. Evaluating habitual tendencies could potentially serve as an indicator of an AUD-like state in animals. This meta-analysis also provides a model for improving research rigor and translating animal findings to better understand human AUD. The observed bias in responses in the meta-analysis appears unaffected by the specific method of chronic alcohol exposure, despite variations in alcohol amount, duration, and administration. Recent studies offer insights into brain changes associated with habitual responding. Research on voluntary alcohol consumption, with or without chronic intermittent alcohol vapor exposure (CIE), shows that a history of drinking causes significant, lasting brain adaptations in areas like the prefrontal cortex and dorsal striatum. CIE and renewed alcohol access intensify these changes, especially in the DMS. Studies have also identified specific brain regions, such as the dorsal striatum and amygdala, that are affected by drinking history, showing increased cell activity in high-drinking animals. Furthermore, electrophysiological studies found enhanced glutamatergic activity in the DMS and opposite effects in the DLS after intermittent alcohol access, particularly in high alcohol drinkers. While DLS adaptations returned to normal after 48 hours of abstinence, the DMS continued to show heightened excitability.

These findings indicate that different levels of alcohol exposure or consumption can lead to similar brain adaptations, with certain regions showing greater susceptibility to higher doses. Multiple studies point to the dorsomedial striatum (DMS) as a key area, particularly sensitive to higher alcohol doses and displaying long-lasting brain changes that persist even during extended abstinence. These dose-dependent, long-term neuroadaptations in rodents offer insights into how chronic drinking reduces cognitive control in humans, as observed in large population data and AUD patients.

Understanding how chronic alcohol exposure leads to habitual response biases remains a complex challenge. Numerous abnormal brain adaptations, leading to changes in brain circuits, have been documented. This discussion highlights two key mechanisms: the loss of metabotropic glutamate receptor 2 (mGluR2) function and neuroinflammation caused by withdrawal. Both animal and human studies show reduced mGluR2 levels in the mPFC after chronic alcohol exposure. This reduction impairs long-term synaptic changes critical for learning, leading to poor executive control and increased craving. Chronic alcohol also strengthens connections from the DLS, suggesting a preference for sensorimotor pathways and indicating significant changes in how different neurons control learning. Another mechanism seen in both humans and rats during early abstinence is increasing neuroinflammation, driven by microglia. This inflammation affects the spread of brain chemicals, possibly contributing to abnormal dopamine levels during prolonged abstinence. Such dopamine fluctuations can make individuals vulnerable to reduced cognitive control, leading to craving and relapse. Additionally, alcohol-induced neuroinflammation damages white matter, hindering brain communication and memory processes, which reduces cognitive flexibility. These alcohol-induced brain changes, whether specific (like mGluR2 alterations) or general (like neuroinflammation), can systematically reduce the efficiency of brain communication. This may lead the brain to use less demanding information processing methods, resulting in observed biases towards habitual responses. Importantly, these alcohol-induced changes are reversible and are promising targets for new treatments to improve cognitive function, with their effects on habitual biases currently being researched.

Specific learned associations are stored in groups of neurons called neuronal ensembles. Research shows that certain ensembles related to drug non-availability cues can suppress habitual responses to alcohol and cocaine, meaning animals can regain control over their behavior by responding to different cues, mediated by specific neurons. These ensembles form dynamic "meta-ensembles" across brain circuits, processing information as needed for efficient and flexible decision-making. The observation of overlapping cell populations controlling different behaviors, like lever pressing and maze navigation, should be further explored in the context of these dynamic meta-ensembles. Findings that distinct sets of DMS and OFC neurons are active during outcome revaluation, correlating with goal-directedness but not habit execution, highlight the need to study meta-ensembles in AUD models more closely. Encouragingly, methods to identify similar neural codes in human brains using fMRI are being developed. In humans, results are mixed, but some studies link reduced goal-directed control in AUD individuals to higher relapse risk and alcohol use severity. However, showing habitual responses in devaluation tests has been challenging in humans, possibly because human studies often use secondary rewards like money. Adapting tasks with primary rewards (e.g., juice or alcohol) could improve comparability with animal studies. Simple motor learning tasks might also reveal response biases in AUD subjects and could be easily translated to animal experiments. The "2-step task" for assessing model-based versus model-free learning also shows promise in predicting drinking behavior or relapse in humans, and its successful adaptation for rats and mice will greatly advance understanding of biased decision-making in AUD.

The limited effectiveness of laboratory tasks in revealing habitual control contrasts sharply with how commonly individuals describe their addictive behaviors, including alcohol use, as habitual. This mismatch is also seen in the weak links between self-reports and behavioral measures of the same concept in patients with substance use disorder. This divergence is common in experimental psychology, but its reasons are not fully clear. It may be partly due to the differences between controlled lab settings and the complexity of real-life experiences, which could also explain why specific lab tasks have very limited power in predicting real-world alcohol and drug-taking behaviors in humans. Despite the practical difficulties in assessing them, habitual or automatic response biases have been successfully addressed in AUD treatment. Training programs specifically designed to reduce automatic urges towards alcoholic drinks have consistently shown effectiveness in improving long-term recovery outcomes.

Conclusions and further directions

How can habitual response biases be integrated into the addiction cycle? A bias toward habitual responding might be particularly important during prolonged withdrawal and anticipation stages, raising the risk of relapse. However, during intoxication, once a relapse has occurred, compulsive mechanisms may be more significant. It is important to distinguish between compulsivity and habitual tendencies. Compulsivity refers to behavior that persists despite negative consequences, while habitual control is a temporary process dependent on context and cues. In animal studies, compulsive responding lasts longer and is less affected by settings, whereas habitual control is brief and can shift to a more adaptive mode quickly. Animal studies on alcohol behavior offer only weak support for a direct link between habitual and compulsive control. Although some overlap exists in brain circuits, compulsive drinking is strongly linked to stress and emotional regulation, particularly involving insula circuits. Therefore, contrary to common beliefs, habitual and compulsive responding are likely not on a continuum. Habitual biases may act as moderators, rather than direct causes, in the relationship between chronic alcohol use and the development of compulsivity, regardless of whether these biases are pre-existing or acquired through drug use. This question requires further investigation.

In summary, there is limited evidence for a strict division between habit and goal-directed behavior, especially regarding habits as a primary sign of impaired decision-making and loss of control in AUD, or goal-directed behavior as the solution to preventing problematic drinking. The same behavior, like an animal pressing a lever, can be driven by different control systems and potentially different brain circuits. Both goal-directed and automatic decision-making are crucial for flexible behavior: the automatic system allows for fast decisions with minimal mental effort, freeing up resources for the goal-directed system when executive control is needed in new or important situations. Consequently, these two systems likely operate in parallel and interact in complex ways, making it difficult to pinpoint their individual contributions to behavioral control.

The complexities of this relationship are not fully captured by a terminology based on a strict habit-goal division. Furthermore, in the context of addiction, the term "habit" can have negative associations and might increase the stigma for affected individuals. Instead, a more precise terminology is suggested for research, one that describes the probabilistic nature of observed response biases. Phrases such as "level of goal-directedness" or "degree of automaticity" better reflect the temporary and varying allocation of mental resources during complex decision-making processes.

Future research should explore model-free and model-based decision-making concepts, particularly in rodent models, to answer basic neurobiological questions about learning, behavioral control, and addiction. Detailed studies on the molecular and cellular aspects of dynamic decision-making, focusing on neuron groups and networks related to different levels of goal-directed responding in AUD models, will provide important insights, especially when considering factors like sex and age. Clinically, while animals with chronic alcohol exposure consistently show a lower degree of goal-directedness in reward devaluation tasks, the predictive ability of similar tests in humans, including the 2-step task, is limited. Therefore, these laboratory tests may have limited use as clinical indicators for AUD severity, progression, or treatment response. As discussed, individuals with AUD often initially use less demanding cognitive strategies, but these systematic biases alone do not fully explain compulsive drug use. The question of whether and how much individual habitual biases influence the development of compulsivity still needs further investigation within addiction theories.

Open Article as PDF

Abstract

Excessive alcohol consumption remains a global public health crisis, with millions suffering from alcohol use disorder (AUD, or simply “alcoholism”), leading to significantly reduced life expectancy. This review examines the interplay between habitual and goal-directed behaviors and the associated neurobiological changes induced by chronic alcohol exposure. Contrary to a strict habit-goal dichotomy, our meta-analysis of the published animal experiments combined with a review of human studies reveals a nuanced transition between these behavioral control systems, emphasizing the need for refined terminology to capture the probabilistic nature of decision biases in individuals with a history of chronic alcohol exposure. Furthermore, we distinguish habitual responding from compulsivity, viewing them as separate entities with diverse roles throughout the stages of the addiction cycle. By addressing species-specific differences and translational challenges in habit research, we provide insights to enhance future investigations and inform strategies for combatting AUD.

Introduction

The common saying, "old habits die hard," often describes the challenges in treating addiction, where addiction is sometimes seen as just a "bad habit." Alcohol abuse has long been a source of personal suffering and public health concerns. Currently, excessive alcohol use is a major global health crisis, affecting many people with alcohol dependence (AUD). Individuals with AUD continue to use alcohol despite harmful effects, showing a strong pull towards it. This behavior suggests a deeper issue than just a simple choice. In psychology, a "habit" is a learned behavior that becomes nearly automatic, happening without much thought about its outcome. This is different from "goal-directed" behavior, which is done specifically to achieve a desired result. This review explores the role of habits in alcohol use disorder (AUD), looking at how these behaviors are defined and studied in animals and humans, and the brain processes involved.

Animal studies

Laboratory animals can easily learn to seek rewards like food or alcohol. Alcohol, however, seems to create habits more strongly and quickly than other rewards. Studies have shown that after long-term alcohol use, animals may continue to seek alcohol even when the reward is no longer appealing, a sign of habitual behavior. This suggests that alcohol use can lead to stronger habits.

Further research explored how extended alcohol exposure affects these habits. For example, using a method called chronic intermittent alcohol vapor exposure (CIE), scientists found that alcohol made animals less sensitive to changes in reward value, meaning their behavior became more habitual. This effect was linked to changes in brain areas like the orbitofrontal cortex (OFC) and the dorsal striatum (DMS and DLS), which are important for decision-making and learning. Alcohol exposure can alter how these brain regions function, shifting behavior towards automatic responses.

Some studies found that chronic alcohol use can impair an animal's ability to engage in goal-directed actions, particularly when tasks are more complex. The age and sex of the animals also played a role, with older animals and some female adolescents showing greater tendencies towards alcohol-induced habits. This suggests that the brain's vulnerability to habit formation from alcohol can change over time.

Research also identified specific brain pathways involved in alcohol-driven habits. For instance, increased activity in certain chemical systems, such as endocannabinoid signaling in the DLS, appears to contribute to habitual responses. Blocking or changing these pathways, as well as another pathway involving mTORC1 in the OFC, has been shown to reduce habitual alcohol-seeking. These findings point to potential targets for medications that could help reverse these brain changes and improve control over alcohol use.

Finally, studies have looked at whether habitual alcohol seeking in animals predicts more severe, compulsive alcohol use. While some links were found, only a small number of animals showed truly compulsive drinking despite long-term alcohol exposure. This indicates that while alcohol can lead to habits, it doesn't always result in the most severe, uncontrollable forms of addiction.

Meta-analysis of rodent studies

A broad analysis was conducted to combine findings from multiple rodent studies, involving over 400 animals, to draw clearer conclusions about how alcohol affects habitual behavior. This analysis focused on measuring how much animals' responses changed after rewards were devalued, which indicates the level of goal-directed behavior versus habitual control.

The findings showed that alcohol treatment significantly affected how animals behaved. Specifically, animals exposed to alcohol showed less goal-directed behavior compared to those not exposed. This means their actions became more driven by habit rather than by the current value of the reward.

Importantly, the analysis suggested that chronic alcohol exposure does not cause a sudden, complete shift to habitual behavior. Instead, it leads to a gradual move towards less goal-directed actions. Various factors like the animal's characteristics, the details of alcohol exposure, training methods, or the type of reward used did not significantly alter this main finding. The results were consistent across different experiments, showing little variation between studies.

This large-scale review supports the idea that habitual and goal-directed controls exist on a spectrum. It demonstrates that prolonged or repeated alcohol exposure pushes this balance towards more habitual responses, rather than supporting a clear separation between the two types of control.

Human studies

Research on how alcohol use disorder (AUD) affects goal-directed versus habitual choices in humans is more limited. Most studies have looked at how people with AUD make decisions about everyday rewards, not specifically alcohol-related ones. These studies often use tasks that test how well a person can adjust their actions based on new information about rewards, which indicates goal-directed behavior.

Some studies suggest that individuals with AUD show impaired ability to use new information to guide their actions, indicating a greater reliance on habits. This behavioral shift is sometimes linked to different patterns of brain activity, with increased activity in areas associated with habitual behavior and less activity in areas related to goal-directed control. However, other research has found that goal-directed control remains largely intact in people with AUD.

A common task used in human studies is the "2-step task," which helps distinguish between two types of learning: "model-based" (goal-directed) and "model-free" (habitual) control. While some initial studies found reduced model-based control in AUD, later studies did not always confirm this directly. Despite mixed findings on immediate impairment, lower model-based control has been shown to predict a higher risk of relapse for people in recovery from AUD. Also, the balance between model-based and model-free control can predict future drinking habits.

Studies conducted online with large groups of people have yielded varied results. Some found a weak link between lower model-based control and more severe alcohol use, especially related to compulsive behaviors. Others found no connection between model-based control and alcohol use in the general population.

Overall, the human research provides mixed evidence for increased habitual tendencies in AUD. Differences in study methods make direct comparisons challenging. However, the tasks that assess model-based control show promise in predicting relapse risk and drinking patterns in individuals with AUD. There is a need for more research using tasks that are more comparable to animal studies, such as those involving primary rewards like alcohol itself.

Discussion

Studies in rodents consistently show that long-term, excessive alcohol use leads to less goal-directed behavior and more habitual actions. This supports the idea that habitual and goal-directed control exist on a sliding scale, with chronic alcohol pushing the balance towards habits. This shift in behavior appears regardless of how the alcohol was given, suggesting that certain brain changes are common across different types of alcohol exposure.

Chronic alcohol use causes several important changes in the brain. For example, it reduces the function of a specific receptor (mGluR2) in the brain area responsible for decision-making, leading to poorer control and increased cravings. Alcohol also causes brain inflammation, which can affect dopamine levels and damage the brain's communication pathways. These changes can make the brain less efficient at processing information, leading to a greater reliance on less demanding, habitual ways of behaving. These brain changes, however, may be reversible, offering new targets for treatment.

Neurons in the brain can form specific groups, or "ensembles," that are active during certain behaviors. Research has shown that even when an animal is showing habitual drug-seeking, another group of neurons in a different brain area can regain control over behavior, allowing for a more flexible response. This suggests that the ability to control behavior might still be present, even if hidden, and encourages more detailed study of how these brain cell groups work together in addiction.

In humans, the evidence for increased habitual tendencies in AUD is less clear and often shows mixed results. This might be because human studies often use money or points as rewards, which are different from primary rewards like alcohol itself. There is also a gap between what lab tasks show and how people describe their addiction as "habitual" in everyday life. This suggests that current lab tests may not fully capture the complex reality of addiction.

Despite the challenges in measuring habitual control in the lab, approaches that aim to reduce automatic responses to alcohol cues have proven effective in AUD treatment. This means that even if the science is still developing, practical treatments can help people gain more control over their drinking behavior.

Conclusions and further directions

Habitual response biases appear to be particularly important during the recovery and anticipation phases of addiction, as they can increase the risk of relapse. It is important to distinguish between "compulsivity," which means continuing a behavior despite negative outcomes, and "habitual tendencies," which refer to momentary, automatic responses often linked to specific situations or cues. These two behaviors are likely separate, rather than existing on a single spectrum. Habitual biases may influence, but do not directly cause, the development of compulsive drinking.

The idea of a strict separation between habitual and goal-directed control may not fully capture the complexity of decision-making in addiction. Both types of control are necessary for flexible behavior: the automatic system allows for quick decisions, freeing up mental resources for the goal-directed system when more thought is needed. These two systems likely work together, making it hard to pinpoint their exact contributions.

To better describe these processes, it is suggested that researchers use terms like "level of goal-directedness" or "degree of automaticity." These phrases more accurately reflect how mental resources are temporarily used in complex decisions, avoiding the negative connotations often associated with the word "habit" in addiction.

Future research should explore "model-free" and "model-based" decision-making in both animal and human studies to better understand how learning, behavioral control, and addiction work at a brain level. It is also crucial to study how specific groups of brain cells are involved and to consider factors like sex and age.

Currently, the ability of laboratory tests to predict the severity, progression, or treatment response of AUD in humans is limited. While individuals with AUD may initially use simpler, less demanding thought processes, these tendencies alone may not fully explain compulsive drug use. Overall, the evidence supports a gradual view of addiction, where external factors like stress can shift the balance of decision-making. Improving our understanding of these processes can lead to more effective research and treatments for AUD.

Open Article as PDF

Abstract

Excessive alcohol consumption remains a global public health crisis, with millions suffering from alcohol use disorder (AUD, or simply “alcoholism”), leading to significantly reduced life expectancy. This review examines the interplay between habitual and goal-directed behaviors and the associated neurobiological changes induced by chronic alcohol exposure. Contrary to a strict habit-goal dichotomy, our meta-analysis of the published animal experiments combined with a review of human studies reveals a nuanced transition between these behavioral control systems, emphasizing the need for refined terminology to capture the probabilistic nature of decision biases in individuals with a history of chronic alcohol exposure. Furthermore, we distinguish habitual responding from compulsivity, viewing them as separate entities with diverse roles throughout the stages of the addiction cycle. By addressing species-specific differences and translational challenges in habit research, we provide insights to enhance future investigations and inform strategies for combatting AUD.

Introduction

The idea that 'old habits are hard to break' seems true for treating addiction. Many people think of addiction as simply a 'bad' habit. Even long ago, people believed that drinking too much could lead to a habit that was hard to stop. Throughout history, heavy drinking has caused sadness for individuals and big problems for public health.

Today, drinking too much alcohol is still a major public health issue, causing about 5 out of every 100 diseases worldwide. Alcohol dependence affects about 2.6 out of every 100 people aged 15 and older around the world. In many richer countries, this number is even higher. Heavy drinking can cause serious health problems and greatly shorten a person's life.

Alcohol dependence often means a person keeps choosing alcohol even when it causes bad results. It is like they cannot stop, even when they know it hurts them. This strong urge to keep drinking suggests that it is more than just a simple choice. The question is, is alcohol dependence just a bad habit?

A habit is often seen as a settled way of acting or a usual behavior. It can also mean a learned behavior that happens almost or completely without thinking. In simple science, habits are seen as learned links between a trigger (like a sight or feeling) and an action, where the action happens without much thought about the outcome. On the other hand, 'goal-directed' behavior is done on purpose to get a certain result. Many ideas about addiction say that when bad habits take over, a person loses control over seeking and using drugs.

This review explains what habits are and how they are studied in animals and people. It also talks about what is known about how the brain handles habits and goal-directed actions. The review then looks at studies on how alcohol use affects habits and goal-directed behavior. Most studies suggest that both automatic and thoughtful ways of making choices are part of how people decide things. When the brain leans too much towards automatic actions, it can lead to and keep alcohol dependence going.

Animal studies

Lab animals can easily learn to press a lever to get rewards, like food or alcohol. After many tries, animals may drink more alcohol. Scientists want to know if this increase means they are losing control and if habits play a part.

Studies with animals have looked at two main things: Does alcohol create habits more strongly than food? And, what happens to habit control after long-term or heavy alcohol use?

One early study found that alcohol-seeking habits were harder to break than food-seeking habits in rats. Other studies also showed that alcohol could create strong habits faster than other rewards. Long-term alcohol use was found to make animals less sensitive to changes in the value of their reward, meaning their habits were stronger. Moderate alcohol use, even without being drunk, was linked to stronger habits.

Some studies showed that certain parts of the brain are important for these changes. For example, damage to one part of the brain (the dorsomedial striatum or DMS) made habits form faster. Damage to another part (the dorsolateral striatum or DLS) helped bring back goal-directed behavior. Alcohol also changed how brain cells work in these areas.

Different studies sometimes had different results. For example, some studies found that past alcohol use made it harder for rats to make goal-directed choices, especially when things were tricky. Other studies found no change. The amount of alcohol animals drank might explain these differences. Also, a history of heavy drinking could make animals use habit-based actions more when their minds were under more stress.

Scientists have also looked at how age and sex affect alcohol-related habits. Some studies found that older male animals and younger female animals might be more likely to form habits after alcohol exposure. Alcohol's effects on the brain seem to make different types of behaviors more automatic, not just actions for rewards.

Scientists are also looking for ways to stop these strong habits. Some brain chemicals and pathways seem important. For example, blocking certain signals in a brain area (DLS) reduced habitual alcohol seeking. Another substance (rapamycin) injected into a different brain area (OFC) also reduced habits in rats that drank a lot. These findings suggest new ways to treat alcohol dependence.

Some studies tried to see if strong habits could predict addiction-like problems. One study found that only a few rats showed signs of addiction, and strong habits did not always predict this. Another study suggested that habits might predict problems with compulsive drinking, but again, not for most animals. This means that even with long-term alcohol use, only a small number of animals develop severe addiction-like behaviors.

Meta-analysis of rodent studies

Scientists combined results from many animal studies to get a clearer picture of how alcohol affects habits. This is called a meta-analysis. They looked at 10 studies with 17 separate experiments, involving over 400 animals.

The meta-analysis showed that alcohol use significantly changed animal behavior, making them more likely to act out of habit. When animals were given alcohol, their goal-directed behavior, which means acting with a specific purpose in mind, went down. This was not an all-or-nothing change, but a gradual shift toward more automatic actions.

This combined research suggests that when animals have long-term alcohol exposure, they shift toward more habitual ways of acting, rather than showing a clear switch between habit and goal-directed control. This finding remained strong even when considering different factors like the type of animal, how much alcohol was given, or the kind of reward used.

Human studies

Nine studies looked at how goal-directed and habitual choices are balanced in people with alcohol dependence or those at high risk. Most of these studies used tasks where people made choices in steps to see if they were learning from the outcome (goal-directed) or just repeating actions (habitual).

One study found that people with alcohol dependence had trouble with goal-directed thinking. This meant they relied more on habits when learning new tasks. Brain scans showed different brain parts were active in people with alcohol dependence compared to healthy people. However, how long a person had alcohol dependence did not seem to be linked to these brain or behavior changes.

Some studies used tasks that did not involve alcohol directly but used other rewards like money. The results were mixed. Some studies found that people with alcohol dependence had less goal-directed control. This lack of control was sometimes linked to a higher risk of relapsing (starting to drink again). Other studies did not find these clear differences.

One study found that young people who drank a lot showed less goal-directed control. But another study with many young male social drinkers found no difference in their habit or goal-directed control. However, a follow-up of these young men showed that those with less goal-directed control were more likely to develop heavy drinking over time.

Online studies also had mixed results. One study linked less goal-directed control to more severe alcohol use, especially related to compulsive behaviors. Another online study found no link between alcohol use and goal-directed control in people who were not patients. Overall, human studies on habits and alcohol dependence are not always consistent, partly due to different ways of testing. However, some studies suggest that a lack of goal-directed control could predict if someone will relapse.

Discussion

Studies on animals consistently show that long-term, heavy alcohol use reduces goal-directed control and increases habitual actions. The combined analysis of these studies confirms this, showing a gradual shift towards more habitual ways of acting, not a sudden change. This suggests that how strong habits are could be a sign of alcohol dependence in animals.

The bias towards habitual actions seems to happen regardless of how animals are exposed to alcohol (amount, time, or way of getting it). This means that different ways of heavy drinking can cause similar brain changes that lead to habits. Specific brain changes, called neuroadaptations, are believed to cause these problems.

Two key brain problems linked to long-term alcohol use are a loss of function in a certain brain signal (mGluR2) and brain swelling or inflammation during withdrawal. These changes make it harder for the brain to learn and control actions. They can make the brain rely more on simpler, faster ways of processing information, which leads to more habitual responses. The good news is that these alcohol-caused brain problems can be reversed, offering new ways to treat alcohol dependence.

Scientists are also looking at how groups of brain cells work together to control habits. They are finding that even when habits are strong, the brain can still regain control if given the right signals. More research is needed to understand how these brain cell groups change with alcohol use.

In humans, the results are less clear. Some studies suggest less goal-directed control in people with alcohol dependence, which might be linked to relapse. However, it is hard to show strong habits in humans in the lab using certain tests. This might be because human studies often use money as a reward, not real alcohol. Also, there can be a big difference between what people say about their habits and what they do in lab tests. Even so, treatments that help people change their automatic leanings towards alcohol have been successful in helping them stay sober long-term.

Conclusions and further directions

A tendency towards habitual responses can be very important in alcohol dependence, especially during withdrawal and when a person is tempted to drink again, increasing the risk of relapse. It is important to know that simply having a habit is different from compulsivity. Compulsivity means doing something repeatedly despite bad results, while habits are more about automatic actions that happen in certain situations. While some brain areas overlap, compulsivity is strongly linked to stress and feelings, not just habits. Habits might make compulsivity worse, but they are not the same thing.

There is not strong support for a strict "habit versus goal" idea where one is always bad and the other always good. Both goal-directed and automatic decision-making are needed for a person to act flexibly. The automatic system helps with quick choices, saving brain power for harder situations where careful thought is needed. These two systems work together in complex ways, making it hard to tell exactly how much each contributes to behavior.

The word "habit" for addiction can sometimes make people feel bad about themselves. It might be better to use terms like "level of goal-directedness" or "degree of automaticity" to describe how the brain uses its resources when making choices.

Looking ahead, future research should explore how the brain makes automatic versus thoughtful decisions, especially in animal models, to understand how learning and addiction work. Studies should also look at how these decision-making processes happen in groups of brain cells and consider factors like sex and age.

For people, while lab tests show that alcohol can reduce goal-directedness in animals, these tests are not always good at predicting how severe a person's alcohol dependence is or how they will respond to treatment. However, treatments that target and change automatic urges for alcohol have shown good results. The available information supports the idea that addiction is a complex problem involving biology and psychology, and that decision-making is a gradual process, not just one or the other. Understanding how people make choices and why they might lean towards automatic responses offers new hope for research and treatment of alcohol dependence.

Open Article as PDF

Footnotes and Citation

Cite

Giannone, F., Ebrahimi, C., Endrass, T., Hansson, A. C., Schlagenhauf, F., & Sommer, W. H. (2024). Bad habits–good goals? Meta-analysis and translation of the habit construct to alcoholism. Translational psychiatry, 14(1), 298.

    Highlights