Abstract
Keywords
Introduction
Myotonic dystrophy type 1 (DM1) is an autosomal inherited, multisystemic disorder exhibiting abnormal muscular, cardiac and brain phenotypes. It is caused by a (CTG⋅CAG)n-repeat length expansion within the non-coding region of
Despite the progress made in understanding of mechanisms underlying muscle dysfunction in DM1, neuromuscular complications are not the only symptoms impacting on the patient's quality of life. Cognitive symptoms (including reduced motivation, decreased IQ (Intelligence Quotient and cognitive inflexibility) allied to insomnia and fatigue have a considerably more pronounced clinical impact in DM1. Congenital and childhood-onset DM1 cases exhibit high severity of symptoms, as such, mild to severe mental retardation is often reported in affected children. 3 DM1 neuropsychological profile is characterized by increased anxiety,4,5 depression,4,6 and anhedonia,7,8 while obsessive-compulsive (OCD) and autistic traits also feature prominently in this disease.9,10
To model cognitive changes in DM1, we studied the DMSXL mouse model. DMSXL mouse is the only animal DM1 model which carries more than 1300 CTG repeats in a human
In order to assess behavioral flexibility and rule-reward conditioned learning in DMSXL mice, a series of visual discrimination and reversal learning experiments were conducted, followed by a paradigm where stimulus-reward associations were extinguished (appetitive extinction). The experiments were performed in a touchscreen-based operant chamber which mimics human tablet-based cognitive testing methods, such as CANTAB,13,14 and enables translational comparisons relevant to rule learning impairments such as in OCD and autism. Reversal learning evaluates the ability to switch behavioral strategy in response to a shift in stimulus-reward relationships. In this case – coupling of the reinforcing reward with a previously non-rewarded stimulus. It is expected that the DMSXL mouse model displays reversal learning abilities analogous to how patients with OCD symptoms perform worse than healthy individuals in reversal learning tasks.15–17
In the current study, we examined the link between DM1 and compulsivity-like behavior using the DMSXL mouse model. In short, we sought to test male and female DMSXL mice and their wild type controls on 2 operant behavioral tests for cognitive flexibility and perseveration of behavior (namely reversal learning and the extinction). Breeding of these mice is associated with high mortality so we tried to assess equal number of each sex but this was not possible. As such, we examined both sexes and statistically checked to see if sex resulted in different behavioral outcomes. This study aimed to assess the differences in reversal learning and extinction performances between DMSXL and WT mice and to interpret these findings in the light of the reported behavioral inflexibility in human DM1.
Materials and methods
Mice
The DMSXL mice (>90% C57BL/6 background) carried 45 kb of human genomic DNA cloned from a DM1 patient as previously described (Seznec et al., 2000). DMSXL mice and control wild type (WT) mice on C57/BL6 background (n = 12 DMSXL mice, 8 male and 4 females and n = 12 wildtype mice, 8 male and 4 female) derived from breeding pairs obtained in collaboration with the group of Dr Gourdon (Centre de recherche en myologie, Paris, France). Equal numbers (
Genotyping DMSXL mice
DNA was isolated from mouse ear clips using a Proteinase K treatment, followed by a isopropanol precipitation and a 70% ethanol wash. Two PCR analysis for genotyping were performed, i) one specific PCR for the human DMPK exon 3, to distinguish WT and heterozygous mice, ii) one PCR for the mouse Fbx7 gene to distinguish homozygous and heterozygous mice. Since earlier research (not yet published) showed integration from the total 45 kb insert in the human Fbx7 gene. When the insert is incorporated at both alleles in homozygous mice, using primers flanking the fragment, this will deliver no PCR product due to the large size of the insert. For the human DMPK exon 3 PCR we used next primers: Waltgour forward: 5′-GACAGTCCTAGGGTGAAGC-3′, and Waltgour reverse: 5′-TACC-TGAGGTCGAGATAGTGA-‘3. Amplification was performed using Taq2000 polymerase. The Fbx7 gene PCR was performed with the following primers: Fbx7 forward: 5′-CACCTATCCTGCCATCCC-TGTC-3′ and Fbx7 reverse: 5′-GGGACATCAGAATGG-GACATCTGC-3′.
Food restriction and reward habituation
After the mice had completed the weaning period, they were transferred to cages for single housing and allowed 2 weeks of acclimatization to the new environment and were daily handled prior to the start of the experiments. Animals were put on a mildly restricted but not harmful diet, the animal weight loss was under strict control (85–95% of their free-feeding body weight). Prior to the start of the experiment, the food reward (strawberry milkshake, Inex N.V., Bavegem, Belgium) was introduced to the home cages in a small bowl for 2 consecutive days.
Apparatus
Experiments were conducted in a bright room, with dim light in the boxes. The order of mice chosen for testing was randomized, however, the animals were tested in the same order during all experiments. All training was performed in Bussey-Saksida Mouse Touch Screen Chambers (Campden Instruments Ltd, Loughborough, UK) using Whisker Server (Cardinal and Aitken, 2010) and ABET II Touch software (Lafayette Instruments, Lafayette, IN, USA). These sound-attenuating, trapezoid chambers consist of fibreboard walls, a perforated stainless steel floor and a touchscreen behind a black perspex mask containing 2 – in the case of visual discrimination and reversal (VDR) – or 3 – in the case of extinction – horizontal apertures. Opposite to the touchscreen is a reward collection magazine connected to a liquid dispenser pump containing the same strawberry milkshake food reward (described above).
Pre-training
Mice underwent daily (5–7 times per week) training sessions, with each session lasting for a maximum of 60 min or 30 sessions (whichever was reached first). Pre-training of the animals consisted of five different stages: habituation to the touchscreen chambers, initial touch training (ITT), must touch training (MTT), must initiate training (MIT) and punish incorrect training (PIT), see Figure 1. 18 This pre-training phase gradually shaped mice's behavior to respond to a stimulus presented in either one of the 2 response windows, which was reinforced by delivery of a liquid reward to the food magazine. The stimuli were randomly selected from an image library of 40 varied black and white shapes.

Schematic diagram of the experimental workflow to delineate the pretraining.
Visual discrimination and reversal (VDR) learning paradigm
After pre-training, mice were transferred to the visual discrimination and reversal-learning task. During this task, a typical session was initiated by a nose poke into the reward collection magazine (after magazine illumination), following which a stimulus is presented within one of the response windows. Correct response to a stimulus caused reward delivery (800 ms pump activation), stimulus offset, activation of the magazine light and a brief (1 s, 3 kHz) audio stimulus (conditioned reinforcers). After reward was retrieved, a 20-s inter session interval took place. Touching the unrewarding response window during stimulus presentation was discouraged using correction sessions, which encompassed representation of the same stimulus in the same location. 19
During the acquisition phase of the visual discrimination task, either a picture with the “Fan” or the “Marbles” functioned as a conditioned stimulus (CS+) whereas the other one was the unconditioned stimulus (CS-). Both the DMSXL and WT mice tested were able to distinguish between the 2 stimuli (Figure 2).

During the acquisition phase of the visual discrimination task, either a correct (conditioned rewarded) image with “fan” is on the right side or the incorrect (non-rewarded) image “marbles” is on the left side. “Fan” functioned as a conditioned stimulus (CS+) whereas the “Marbles” functioned as an unconditioned stimulus (CS-). For the Reversal learning the side on which the images were shown was randomized and the previous CS- now became the CS + and vice versa,
Appetitive extinction
After conclusion of the VDR paradigm, animals underwent an acquisition phase of the extinction paradigm (Figure 3). During this stage, animals were trained to respond to a rewarding white stimulus (presented in the center location of the touchscreen using a mask with three horizontal apertures). Once an animal reached the performance criterion (performing 30 sessions within 13 min for 5 consecutive days) it was moved to extinction phase of the experiment, in which the previously rewarded stimulus becomes unrewarded. A stimulus was presented for ten seconds or shorter (if the animal responded to it). The mice's performances during this phase were recorded for at least 10 consecutive days.

(A) in the acquisition phase, the animals are trained to collect a reward (strawberry milkshake) after correctly placing a nose poke to a stimulus (white square) and reach a stable performance criterion: 80% correct responses. (B) In the extinction phase, the stimulus (white square) is no longer rewarded.
Data analyses
During the acquisition phase of extinction paradigm, the total number of responses and number of correct responses were recorded, whereas during the extinction phase the number of omissions was recorded. Latencies for correct and incorrect responses, as well as reward collection were analyzed throughout the reversal and extinction sessions after outlier correction (where applicable). For outlier correction the Grubbs’ test was applied.
20
The data was tested for normality using the Shapiro-Wilk test, after which the respective parametric or non-parametric analyses were performed. For all outcomes from mice performances in VDR behavioral tests and extinction paradigms, we analysed significance with a repeated measures ANOVA including genotype, time and the genotype*time interaction as factors and sex as covariant. We report the p-values for differences between the wild-type (WT mice) and DMSXL knock-in genotypes (DMSXL mice). The data in bar diagrams are displayed as Area Under the Curve (AUC) with mean ± SEM with
Results
Phenotype
DMSXL mice are known to have a developmental disturbance resulting in decreased body weight in animals at the same age / developmental time point as compared to their littermate controls, in this research body starting body weights were not measured. Due to the high mortality rate observed in homozygous mice shortly after birth, the use of males and females were not in the same proportion (males n = 8; females n = 4), since this amount of animal sexes were available from the breeding. We checked if the results could be combined across sex and since the data from the females was not significantly different from the males, we sought to report the combined groups. Unfortunately, during the behavioral performance in acquisition of extinction, for measuring the number of blank touches, one DMSXL female mouse was taken out the test (right after session 12) due to healthy problems, later this mouse died. As consequence the mouse number for the DMSXL group became n = 11, this is mentioned in the behavioral performance tests; acquisition of extinction and extinction (Figures 4 and 5 respectively).

Percentage of correct touches to rewarded visual stimuli over a series of sessions in reversal phase of VDR, each bar represents an n = 12 per strain. (A) VDR reversal task is subdivided into an early phase – < 50% of correct touches, and late phase – ≥ 80% of correct responses, following reversal. No significant difference between DMSXL mice compared to WT mice (69.63 ± 7.80 vs 67.40 ± 7.32), (F = 1.007 (1,21),

Latencies for reward collection in DMSXL and WT mice throughout VDR reversal phase. DMSXL mice display increased reward collection latencies across sessions compared to WT in a consistent manner with AUC values (mean ± SEM of 1.309 s ± 0.058 s vs 1.100 s ± 0.031 s), (F = 16.217 (1, 20),
Pretraining
During pretraining, the DMSXL and WT mice needed a similar number of sessions to reach the criterion for each stage, indicating that they learned to correctly operate the touchscreen at a similar rate.
Visual discrimination and reversal learning (VDR)
In the experiment both WT and DMSXL mice were able to distinguish between the 2 visual stimuli and displayed a consistent increase in proportion of the correct touches (reaching 90% accuracy) during the acquisition phase of the VDR. Both groups showed performance improvements over time with no significant difference between the two. Once all the mice have demonstrated sufficient acquisition to learn the paradigm (demonstration of >80% correct in >5 subsequent sessions), the experiment moved into the reversal phase. In the VDR trials 3 phases can be defined, namely the entirety phase consisting of session 1–16, the early phase consisting of session 1–4, and late which consist of session 8–16. Unfortunately, some recordings from the VDR, like the last 4 sessions of the extinction, for measuring the number of omissions, were lost due to technical failure, due to lower power. In the corresponding figure legend (Figure 4C) the appropriate number of trials is explicitly mentioned.
During reversal, both animal groups clearly exhibited the ability to learn and switch to a new reinforced stimulus with no consistent difference between the two groups (Figure 4A). Although both DMSXL and WT mice were able to switch strategy and learn the reversal paradigm (as shown by no difference in % of correct responses in the early phase of reversal (< 50% correct responses)), in the late reversal phase (at >80% correct responses) – DMSXL mice displayed a higher % of correct responses compared to WT (Figure 4B).
No statistical difference was noted either between genotypes or across sex in the number of correction sessions in both the acquisition and reversal phase. As shown in Figure 4E, the DMSXL mice made less mistakes, as shown by a decreased number of total errors in reversal phase in DMSXL mice compared to WT mice.
The latencies of mouse responses to stimuli related to correct touches, blank touches as well as reward collection were determined throughout both acquisition and reversal phases. Following reversal, DMSXL mice required more time to correctly respond to a stimulus in comparison to WT (Figure 4D), in the early phase of the reversal learning (Figure 1F) but not seen in the late phase (Figure 4H), suggesting that the DMSXL mice are initially slower in learning, following the switch of the rewarded stimulus. Since there is a significant effect associated with sex during early and overall reversal learning phase, we checked if there was any statistical difference between sexes underlying the difference between WT and DMSXL mice in other metrics by including sex as covariate in the analysis. No significant difference was found between WT male versus female and DMSXL male versus female in the other metrics reported.
From Figure 4A, it appears that the late phase (session 8–16) is significant different, therefore, a post hoc analysis for comparison between DMSXL and WT within the sessions is performed with an independent t-test, see Table 1 in the supplementary materials.
In the reversal learning test, after multiple testing, there were no significant differences between WT and DMSXL mice in correct responses in each session. However there is a visible trend showing that the DMSXL mice perform better compared to the WT mice.
Throughout the reversal, DMSXL mice displayed considerably higher reward collection latencies than WT (Figure 5), possibly implying a generally decreased motivation to obtaining the reward.
Appetitive extinction
During acquisition phase within the appetitive extinction paradigm both mouse groups reached the criterion in an equivalent number of sessions. However, DMSXL group was observed to perform more blank touches than the control group (Figure 6A). Moreover, DMSXL mice compared to WT mice, displayed considerably higher correct touch latencies (Figure 6B) as well as reward collection latencies (Figure 6C).

Measures of behavioral performance in acquisition of extinction in DMSXL and WT mice. Statistics, for Figures A, B and C, was performed with mixed model ANOVA with sex as covariates. The bars for WT mice represent means ± SEM with n = 12 per group and for DMSXL mice n = 11 per group. (A) Number of blank touches made by DMSXL mice in extinction acquisition phase is consistently significantly higher, results with AUC values (mean ± SEM of 12.680 ± 2.310 vs 3.404 ± 0.485), (F = 9.872 (1,20),
Following acquisition phase, in extinction DMSXL mice produced significantly less correct touches than WT (Figure 7B). An opposite trend was observed in omissions (Figure 4C) and correct touch latencies (Figure 7D), where DMSXL showed an overall lack of interest in performing the task.

Percentage of correct responses to non-rewarded visual stimuli, number of omissions and correct touch latencies over a series of sessions within the extinction task. For all 4 panels, WT mice represent means ± SEM with n = 12 per group and for DMSXL mice n = 11 per group. For panel 7A, statistics was performed with repeated measures ANOVA. For panels 7B-7D, statistics was performed with mixed model ANOVA with sex as covariates. The bars for WT mice represent means ± SEM with n = 12 per group and for DMSXL mice n = 11 per group. (A) Temporal view of the percentage of correct touches performed by DMSXL and WT mice during the extinction task. DMSXL consistently performed less correct touches than WT in each of the consecutive session. (B) Area under the curve (AUC) bar-chart representation of graph A, displaying a significant difference in performance averages between DMSXL mice and WT mice, results with AUC values (mean ± SEM of 26.04 ± 1.750 vs 36.09 ± 1.614), (F = 17.669 (1,20),
For the breakdown of the different types of responses and percentages, including the total number of sessions, is presented in Table 2 in the supplementary materials.
Discussion
Behavioral flexibility of DMSXL mice was analyzed in order to evaluate the presence of compulsive-related behaviours which are reported to be altered in human DM1.
12
The DMSXL mouse model does have the limitation that the site of transgene insertion (reported by unpublished data to be at the Fbx7 locus) which may impact the findings. This possibility is not controlled for in the design used here given the difficulty in breeding sufficient numbers of DMSXL mice and as such this is a pilot behavioural study. Despite these caveats, the current study sought to extend our knowledge of the DMSXL phenotype using two different behavioural tasks which evaluate different aspects of compulsive behaviour namely the ability to learn and switch strategy (reversal learning) and the ability to inhibit action following a change in contingency from rewarded to non-rewarded (extinction). The abundance of expanded (CTG)n repeats in
Our data suggest that both DMSXL and WT mice can switch and learn a new behavioral strategy, suggesting that cognitive flexibility is intact in both groups, as shown by similar correct response performances between DMSXL and WT mice in the early phase of reversal learning. It appears, however, that while both groups can successfully acquire the rule, DMSXL mice were able to automate it to a higher degree as shown by an increased percentage of correct touches in the late phase of reversal learning compared to WT mice. This has been suggested to indicate increased habit formation
21
and may be interpreted as perseverative-like behaviour in DMSXL mice. If the mice make a mistake in their response, they have the opportunity to perform a correction session,
The DMSXL mice post-reversal demonstrate a high level of behavioral perseveration compared to WT controls and suggest deficits in rule learning and switching. Perseverative behaviours are one of the prominent clinical features of OCD and autism which are also present in DM1 patients. 22 As such they reflect cognitive inflexibility in affected individuals and validates these aspects of the clinical phenotype as also being present in DMSXL mice. To be clear our data suggest that DMSXL mice shown no difference in the early phase of reversal learning but perseverative behaviours in the late phase of reversal learning which may reflect reliance on habits formed.
Furthermore, the correct touch latencies after reversal differ between DMSXL and WT mice with DMSXL group taking significantly more time to respond overall and in the early phase of the reversal learning (sessions 1–4), whereas in the late phase (sessions 8–16) no such difference is observed. This may imply that DMSXL mice are more careful learning the task, therefore taking them increased amounts of time to make a choice. Once the task has been learned however, DMSXL mice are able to perform automated responses at similar latencies to WT, thereby indicating that the difference in the early phase is not related to motor impairment. (Moreover no gross motor deficits were seen in the DMSXL mice compared to WT controls.). There are no differences across sex in the correct touch latencies following reversal overall while in the early phase (sessions 1–4), females tend to be slower in making correct responses than their genotype matched male equivalent. Whether this represents more a greater degree of care in the females before making the response or a relatively higher degree of impulsivity in the males is not clear, but this is short-lived as no sex associated differences are seen when the reversal learning is sufficiently acquired. An alternative explanation for the data seen could be that the DMSXL mice are trading latency for accuracy on these tasks in the early phase; a strategy that may be an adaptation or even beneficial.
Interestingly, the latencies to collect the reward following reversal are higher in DMSXL mice compared to WT. This may imply a decrease in motivation to obtain the reward in DMSXL mice, as we have previously concluded this increase in latency is unlikely to be related to locomotion. Alternatively, DMSXL mice may be slower to turn: since the reward chamber is on the opposite side to the touchscreen of the chamber, mice must make an effort to turn 180 degrees away from touchscreen to collect the reward. It is possible that given the cerebellar synaptic plasticity deficits known to occur in DMSXL mice, 23 that these mice have balance and motor coordination deficits which may impair their reward collection latency which was not observed on a gross motor coordination level. Clinically, it is known that impaired balance, stumbles and falls happen daily in DM1 patients, which may be associated with impaired cerebellum functioning.
In summary, DMSXL mice tend to perform just as well or better than WT controls in reversal learning (increased percentage of correct touches, reduced number of errors, similar correct touch latencies). The only differences in reversal learning were in slightly higher number of correct touches number in the late phase and a slower ability to collect rewards (which may reflect apathy-like behaviour) in DMSXL mice compared to their WT controls.
With regards to the appetitive extinction task, it is clear that the DMSXL mice show an increased number of blank touches and increased correct touch and rewards latencies. This may reflect that the DMSXL mice make more mistakes in the initial learning of this task and are also slower to make correct responses. Similar to the reversal learning task, they also demonstrate increased reward collection latency which may reflect a lack of motivation (apathy-like behaviour) to collect the reward. This lack of selectivity for the stimulus - reward relationship may also underlie the increased number of non-rewarded blank touches in DMSXL mice compared to controls.
The extinction task examines the act of decoupling a previously established stimulus - reward relationship, such that the stimulus is no longer rewarded. During extinction, DMSXL mice make fewer correct touches than WT mice but also make fewer correct touches very rapidly (already within the first few sessions) which means they rapidly stop a non-rewarded action. This suggests heightened sensitivity to the value of a reward.
The reduction in the number of correct touches made by the DMSXL mice compared to WT controls may also suggest that DMSXL mice are less motivated to make a correct response in the absence of a reward and can successfully inhibit a non-rewarded strategy. The increase in number of omissions in DMSXL mice also indicates that the DMSXL mice are unwilling to make any extra effort in the absence of reward, further supporting the idea of decreased motivation in these mice. A more thorough investigation of the motivation for reward in DMSXL mice could involve their testing on a progressive ratio schedule task in the future. This decrease in motivation, if confirmed, is similar to the apathy reported in DM1 patients. 24 Moreover, no effect of sex was observed in any of the extinction learning paradigm metrics suggesting any differences are related to the genotype alone.
In our behavioral analysis, DMSXL mice are evidently slower in learning novel tasks, as observed by the latency increase to perform a correct action each time a new contingency is administered, compared to WT mice. The DMSXL mice show an increased latency to perform an action following a change in contingency. This has also been reported by others in the context of novelty-induced inhibition in the open field in DMSXL mice, suggestive of increased anxiety. 12 This freezing / slower response behaviour when challenged with an unfamiliar environment / stimulus is also seen in the visual discrimination reversal and extinction paradigms as well as in the first few sessions of all new contingencies.
In both reversal learning and extinction tasks, it is clear that DMSXL mice show increased reward collection latencies across both tasks suggestive of decreased motivation compared to their WT controls. This is important as it shows a decreased incentive to perform an action for a sweetened reward. This is in concurrence with earlier data reporting differences in the saccharine preference test between DMSXL and WT mice in which DMSXL mice consumed less saccharine compared to WT mice which is a proxy of anhedonia like behaviour. The sensitivity for saccharine sweetened reward was less in the DMSXL mice compared to WT controls.
12
With regard to rule learning and rule switching, DMSXL mice are slower to learn but perform better in reversal learning and appetitive extinction compared to WT controls. Only an increased number of correct touches in the late phase of reversal learning may reflect perseverative behaviour. Taken together, this suggests that the DMSXL mouse does not necessarily reflect the reported OCD and autistic traits seen in clinical DM1 groups but may reflect the apathy seen in these patients. Initial learning deficits observed in the DMSXL mouse model may reflect similar behavioral phenotypes seen in DM1 such as cognition and motor learning impairments. Homozygous DMSXL mice reflect more the congenital form of DM1 than the adult form.
25
The further validation of the DMSXL model is warranted in particular to evaluate the motivational aspects of the DM1 phenotype Additional behavioural testing with confirmatory experiments to extend the findings of this pilot study would be prudent. In particular, expanding the studies with an increased number of animals (to fully assess sex differences), and explore the motivation and other cognitive findings in these studies with confirmatory tests such as the progressive ratio schedule task. Ideally an interventional test using an anti-sense oligonucleotide against DMPK would also be included in order to evaluate whether these cognition and motivational aspects change with decreased DPMK expression. The inclusion of additional controls including the parental lines with smaller CTG repeat numbers in mutant DMPK would be helpful as well to help build confidence that the findings are attributable to the CTG repeat expansions and not the insertion site of the transgene
Furthermore, it would be interesting to characterize the molecular mechanisms underlying the motivational changes in the phenotype of the DMSXL mice in order to evaluate whether strategies targeting them could rescue the ‘apathy-like’ behavioral phenotype documented here. In this regard, the study to identify transcriptomic and protein markers in brain regions involved in motivation and reward valuation,
Abbreviations
Myotonic dystrophy type I
dystrophia myotonica-protein kinase
obsessive compulsive disorder;
visual discrimination and reversal;
human
wild type
area under the curve
central nerve system
not significant
seconds
Supplemental Material
sj-docx-1-jnd-10.1177_22143602251339350 - Supplemental material for Altered reversal and extinction learning in the DMSXL mouse model of type I myotonic dystrophy (DM1): An exploratory study
Supplemental material, sj-docx-1-jnd-10.1177_22143602251339350 for Altered reversal and extinction learning in the DMSXL mouse model of type I myotonic dystrophy (DM1): An exploratory study by Sylvia Nieuwenhuis, Denys Kozakov, Kasia Kapusta, Geneviève Gourdon and Jeffrey C Glennon in Journal of Neuromuscular Diseases
Footnotes
Acknowledgements
Funding
Declaration of conflicting interests
Data availability statement
Supplemental material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
