For some years we have puzzled over the low working memory (WM) performance in some undergraduate participants. In search of an underlying cause, we noted frequent anecdotal reports of ski accidents, horseback falls, cheerleading catastrophes, and other stories of concussion, or mild traumatic brain injury (mTBI). In the United States, TBI causes 235,000 hospitalizations annually (Cameron, Marshall, Sturdivant, & Lincoln, 2012; Corrigan, Selassie, & Orman, 2010; Larrabee, 2012; Taylor, Greenspan, Xu, & Kresnow, 2015), with > 85% considered mild (Bazarian et al., 2005; Faul, Xu, Wald, & Coronado, 2010). Barring the development of postconcussive syndrome, recovery is assumed to occur within a few months (Cooper et al., 2015). Early recovery reveals deficits in processing speed (De Monte & Geffen, 2005; Shumskaya, Andriessen, Norris, & Vos, 2012), attention (Catale, Marique, Closset, & Meulemans, 2008; Konrad et al., 2011; Mayer et al., 2012), and episodic memory (Wammes, Good, & Fernandes, 2017), and few have assessed cognitive performance in the chronic mTBI population, who are more than 3 months postinjury. In the chronic mTBI population, behavioral deficits are reported in a majority of participants (McInnes, Friesen, MacKenzie, Westwood, & Boe, 2017), accompanied by neural differences (Eierud et al., 2014; Ham et al., 2014; Pan et al., 2016; Sharp & Ham, 2011; Sharp, Scott, & Leech, 2014; Shenton et al., 2012; Tate, Shenton, & Bigler, 2012) detectible even years after an mTBI (Bajaj, Dailey, Rosso, Rauch, & Killgore, 2018; Dall’Acqua et al., 2017).

Why might we see effects of a history of mTBI on WM? WM engages broad frontoparietal networks that are vulnerable to heterogeneous impacts. Thus, across individuals, diverse injuries could lead to behavioral deficits that affect WM. Yet others investigating this issue have reported mixed results. For instance, one earlier meta-analysis reported no residual deficits in WM after the acute stage of mTBI (> 7 days postinjury; Belanger & Vanderploeg, 2005); others found WM deficits exclusively in those with postconcussive syndrome, but not in the chronic mTBI sample (Dean & Sterr, 2013), whereas others reported low WM accuracy accompanied by event-related potential differences (Gosselin et al., 2012; Hudac, Cortesa, Ledwidge, & Molfese, 2018) or slightly greater proactive interference (Vanderploeg, Curtiss, & Belanger, 2005). Thus, in addition to the heterogeneity of injuries, there are data reporting heterogeneous behavioral effects.

To be clear, the students we tested fell into the chronic mTBI category, because their injuries were > 3 months before their participation (e.g., Fino et al., 2018; Goetzl et al., 2019; Lotan et al., 2018; McNerney et al., 2019). Furthermore, these undergraduates reported no sustained symptoms (e.g., headache, mental fog, etc.) that could be attributed to the mTBI itself. In other words, the only factor that distinguished the groups we tested was that one group reported having had an mTBI; thus, we use the term “history of mTBI” to characterize our participants. With regard to our questions, it is important to note that a history of mTBI is high in our undergraduates. In one 400-level course, 31% reported an mTBI, as did 37% of our Disability Resource Center clients. We conducted these experiments to test whether undergraduates with or without a history of mTBI were impaired on simple change detection WM tasks, and consequently, whether we should continue to include them in our ongoing studies of “neurotypical” participants. Across four behavioral experiments, we observed a consistent pattern: Participants with a history of mTBI performed worse at the group level than did those without a history of mTBI. These data are available at https://wolfweb.unr.edu/~mberryhill/, and we note that none of the following experiments were preregistered.

Experiment 1

Materials and method

Participants

Undergraduates participated in groups of those with self-reported mTBI or controls (see Table 1 for demographics). All participants were right-handed throughout all experiments. In this and all other experiments, participants reported, in written format, whether they had a history of mTBI, how many they had had, and when the injury or injuries took place. University of Nevada, Reno’s, institutional review board approved all protocols. Participants provided written consent and received their choice of $15/h or bonus credit.

Table 1 Demographics across experiments

Apparatus

The task was presented on a 19-in. NEC MultiSync CRT monitor (75 Hz, 1,024 × 768) in MATLAB (The MathWorks, Natick, MA) with the Psychophysics Toolbox 3.0 extension (Brainard, 1997; Pelli, 1997), using a Mac mini 2.5-GHz dual-core Intel Core i5.

Stimulus and procedure

In the change detection visual WM task (Fukuda, Awh, & Vogel, 2010; Ikkai, McCollough, & Vogel, 2010; Luck & Vogel, 2013; Luria, Balaban, Awh, & Vogel, 2016; McCollough, Machizawa, & Vogel, 2007; Vogel & Machizawa, 2004), one to three colored squares (0.7° × 0.7°) were presented, chosen from a set of seven colors (cyan, white, red, blue, yellow, green, and magenta); see Fig. 1a. Trials began with a fixation cross (0.4° × 0.4°, 300 ms), followed by a left or right white arrowhead (2.1° × 0.4°, 200 ms) cueing the hemifield to covertly attend. After a delay (300–400 ms), the stimuli were presented (100 ms) in two rectangular areas (7.1° × 12.2°) 4.6° from fixation. After a delay (900 ms), the probe appeared (3 s). Participants indicated whether the stimulus and probe item matched (“o” key; 50%) or not (“n” key). These self-paced trials included three breaks. Before testing, participants completed 24 practice trials. There were 576 trials, or 192 trials per set size (one to three items). Participants were instructed to maintain fixation, and eye movements were monitored by HD-EEG. The primary performance measure was WM capacity: K = Set size * (Hit rate – False alarm rate) (Cowan, 2001; Pashler, 1988). The median correct reaction time was also recorded. Four controls performed 2 + standard deviations below the mean and were removed.

Fig. 1
figure 1

Task paradigm, stimulus configurations, and behavioral results for Experiments 13. (a) Trial sequence. (b) Experiment 1: A Group × Set Size interaction emerges at set size 3. (c) Experiment 2: A replication of the Set Size × Group interaction, for which feedback made no difference. (d) Experiment 3: The mTBI group is impaired across maintenance delays at set size 3. Error bars indicate 95% confidence intervals

Results

All K values were subjected to a mixed-model analysis of variance (ANOVA) with the factors set size (1–3) and group (control, mTBI). Violations of sphericity were Greenhouse–Geisser-corrected throughout. We found significant main effects of set size [F(2, 64) = 582.61, p < .001, ηp2 = .95] and group [F(1, 32) = 4.47, p = .042, ηp2 = .12], as well as a significant Set Size × Group interaction [F(2, 64) = 3.74, p = .03, ηp2 = .1]. Pairwise tests revealed that mTBI impairment emerged at set size 3 (p < .04), but not at set size 1 (p = .3) or 2 (p = .07); see Fig. 1b. Reaction times slowed as set size increased [F(1.5, 47.94) = 55.2, p < .00001, ηp2 = .64], and did so comparably across groups [F(1, 31) = 3.2, p = .08, ηp2 = .1].

Experiment 2

Having observed a significant deficit in the mTBI group at set size 3, we considered the possibility of motivation differences across groups (Ransom et al., 2016), by replicating Experiment 1 with the addition of feedback. Our rationale was that feedback might improve motivation (Miranda & Palmer, 2014) and decrease distractibility (Adam & Vogel, 2017) in the mTBI group.

Method

Participants

Again, our mTBI participants provided written self-reports indicating whether they had a history of mTBI, how many mTBIs, and when the injury or injuries took place. All new undergraduates participated (see Table 1).

Protocol

Trials were presented in two counterbalanced blocks. One half (N = 288) were identical to the trials in Experiment 1, and the other half included visually presented feedback after each trial, indicating “correct” or “incorrect”; see Fig. 1a.

Results

Behavioral results

Participants’ K measurements were subjected to a mixed-model ANOVA with the factors set size (1–3), feedback (feedback, no feedback), and group (control, mTBI). K values increased with set size [F(2, 71.17) = 1,901.82, p = .00001, ηp2 = .975], and the mTBI group was significantly impaired [F(1, 48) = 6.4, p = .015, ηp2 = .12]. Again, a significant Set Size × Group interaction [F(1.23, 71.17) = 6.28, p = .01, ηp2 = .116] emerged at set size 3, regardless of feedback (with feedback, p = .01; without feedback, p = .02); see Fig. 1c. The feedback thus had no effect (F < 1, n.s.), and no other interactions reached significance (all ps > .5). Reaction time data showed no group differences [F(1, 48) = 0.3, p = .6, ηp2 = .5] but revealed slowing with increased set size [F(1.2, 60.5) = 50.62, p < .0001, ηp2 = .5].

Experiment 3

Feedback failed to rescue mTBI performance, suggesting that overall motivation level could not fully account for the set size 3 deficit. Given that performance was not significantly different at set sizes of 1 or 2, we suspected that encoding was relatively well preserved. We next varied maintenance delay durations, to test whether maintenance was disturbed after a history of mTBI, which would lead to greater effects at longer delays.

Materials and method

Participants

We tested 42 participants (see Table 1). Five of the participants with a history of mTBI who had participated in Experiment 2 also participated in Experiment 3. Here we included data collection regarding mTBI etiology and whether participants had experienced any loss of consciousness (LOC). In all, 11 of the mTBI participants reported no LOC, and ten reported LOC (M = 5.47 min, SD = 9.04, range = 1–30 min). All mTBIs were closed-head injuries: 12 were sports-related, six falls/accidents, one blast, one hit by a car, and one participants did not answer.

Procedure

There were three changes: Only set size 3 was included in this experiment, delay durations of 500, 900, and 1,500 ms were pseudorandomized, and participants completed three sessions.

Results

All K values were subjected to a mixed-model ANOVA including the factors delay (500, 900, 1,500 ms) and group (control, mTBI). There was a main effect of delay [F(1.48, 59.43) = 10.31, p = .001, ηp2 = .205] with lower capacity at 500 ms than at either 900 ms (p = .003) or 1,500 ms (p = .002). Again, the mTBI group was impaired [F(1, 40) = 8.78, p = .005, ηp2 = .18]; see Fig. 1d. However, no interactions approached significance (all ps > .77). Reaction times showed no significant main effect of group [F(1, 40) = 3.5, p = .07, ηp2 = .08] and slowed with increased delays [F(1.6, 64.2) = 20.51, p < .000001, ηp2 = .34].

Combined Experiment 1–3 analyses

To begin to parse the main effect of group, we ran a multiple regression across Experiments 13, inputting the K values for the single condition included across all experiments (set size 3, 900-ms delay condition), to test whether number of mTBIs and time since last mTBI could predict WM performance (K value). However, no significant relationship was identified [F(2, 63) = 1.04, p = .36, R2 = .03], and neither the number of TBIs nor the time since injury significantly predicted WM performance (ps > .77).

Experiment 4

Increasing maintenance demands revealed no disproportionate deficit in the mTBI group, suggesting that WM maintenance cannot fully account for the group differences. We turned to retrieval demands, to test whether retrieval processing accounted for the mTBI deficit. We included recognition and recall probes in order to evaluate retrieval processing with new stimuli.

Materials and method

Participants

New participants were recruited (see Table 1). Of these, 12 mTBI participants reported no LOC, whereas ten reported LOC (M = 6.41 min, SD = 10.52, range = 1 s–30 min). The mTBIs were all closed-head injuries: 12 sports-related, six falls/accidents, one fight, and three participants elected not to answer.

Procedure

The stimuli were displayed on a 15-in. MacBookPro. This task was previously described in detail (Gozenman, Tanoue, Metoyer, & Berryhill, 2014). Several differences in the stimuli, set size, and task demands were implemented. Four oriented line segments (7.5° × 1.5°) appeared 6.5° from fixation. Trials began with fixation (1,500 ms), which was followed by encoding (1,000 ms) and maintenance (900 ms). Recognition probes contained one item, and participants reported whether it matched the original orientation (“o” match, “n” nonmatch, 50% probabilities). Recall probes required rotation of the probe item (with the arrow keys, 1°/keypress). Participants completed 200 trials per task in counterbalanced blocks; see Fig. 2a. Performance on recall trials was reflected by degrees error. Participants performed an auditory suppression task throughout, to minimize verbal strategies.

Fig. 2
figure 2

Task paradigm, stimulus configurations, and behavioral results for Experiment 4. (a) Trial sequence for recognition and recall trials. (b) Recognition results. (c) Recall results. Both sets of results show impairment in the mTBI group. All error bars reflect 95% confidence intervals

Results

Significant group differences for each retrieval task revealed mTBI impairments [recognition K, t(21) = 2.21, p = .03; recognition reaction time, t(21) = – 1.96, p = .05; recall error rates, t(21) = – 2.84, p = .006]; see Fig. 2b, c. Recall reaction times were not measured, because multiple button presses were required.

To evaluate the interaction across probes using different performance metrics, we converted the recognition and recall performance of the mTBI group into z scores and subtracted them (zRecallzRecog). A one-sample t test using a test value of 0 revealed no significant difference [t(21) = 0.30, p = .78], indicating the absence of an interaction across retrieval tasks.

Discussion

Few studies have investigated the chronic mTBI population, despite its large size and despite evidence that the consequences of mTBI can be long-lasting (Hou et al., 2012; Kenzie et al., 2017; Moser & Schatz, 2002; Stockbridge, 2018). The present participants reported no residual symptoms of mTBI, despite having a history of mTBI. In this population, our data revealed that four different samples of undergraduates with a history of mTBI had WM impairments. Notably, the mean time since injury was measured in years, not weeks or months. Thus, even in active, otherwise healthy undergraduates, traces of mTBI remain.

A history of mTBI appears to be detrimental to WM. Specifically, the WM deficit reliably emerged at a set size of 3, suggesting that encoding for one or two items was adequate. Feedback did not rescue mTBI performance, although a financial incentive might have been more compelling (Jones, Gozenman, & Berryhill, 2015). WM maintenance failures cannot account for the mTBI deficit, because lengthening maintenance did not increase WM deficits. Manipulating the retrieval demands did not identify a disproportionate WM deficit in the mTBI group. We also identified a deficit in the mTBI group using an encoding duration ten times longer than that used in Experiments 13. The nature of this deficit remains unclear, likely due to the heterogeneity of mTBI. We are left without a clear attribution to a particular stage of WM (e.g., encoding, maintenance, or retrieval). Thus, despite a reliable performance difference in such a heterogeneous population, the nature of the WM deficits emerges across multiple stages of WM, and potentially differently across participants. Having identified a reliable difference in our undergraduate population, we may now probe these questions more carefully going forward.

Limitations

We wanted to understand whether some of the low WM performance in our laboratory studies could be due to a history of mTBI, since a high number of our participants report having had one or more of these. It seems unlikely that a participant would deliberately mislead us, given the option of participating as a control. We made no attempt to restrict participation, and we relied on self-reports of mTBI. We also recruited multiple cohorts of mTBI participants rather than more comprehensively probing a single cohort. We argue that recruiting multiple cohorts should increase the between-subjects noise and work against finding a consistent mTBI-specific deficit at the group level. Another limitation is that we only tested WM and did not test cognitive performance more broadly, or measure personality traits to see whether high risk-taking or impulsivity accounted for some of the difference. It seems quite unlikely that we stumbled on the only task that would reveal impairment. The present data indicate that unaddressed long-term consequences of mTBI persist in an otherwise successful population: enrolled undergraduates. Future efforts must carefully measure and weigh multivariate factors, including mTBI etiology, quality and compliance with medical advice, social support surrounding the injury, and so forth (Kenzie et al., 2017). We would benefit by more comprehensively testing one cohort in order to characterize the boundary between intact and impaired function in a within-participants fashion. These findings reveal the need for asking participants regarding their history of mTBI and suggests that it will be useful to develop restorative protocols across the mTBI survivor population, regardless of how highly functioning they are or how long since their injury.