Highlighting in Early Childhood: Learning Biases ...

Viewer
Transcript

Cognitive Science (2016) 1–24 Copyright © 2016 Cognitive Science Society, Inc. All rights reserved. ISSN: 0364-0213 print / 1551-6709 online DOI: 10.1111/cogs.12408

Highlighting in Early Childhood: Learning Biases Through Attentional Shifting Joseph M. Burling,1 Hanako Yoshida2 1

Department of Psychology, University of California, Los Angeles 2 Department of Psychology, University of Houston

Received 22 January 2015; received in revised form 2 March 2016; accepted 7 March 2016

Abstract The literature on human and animal learning suggests that individuals attend to and act on cues differently based on the order in which they were learned. Recent studies have proposed that one specific type of learning outcome, the highlighting effect, can serve as a framework for understanding a number of early cognitive milestones. However, little is known how this learning effect itself emerges among children, whose memory and attention are much more limited compared to adults. Two experiments were conducted using different versions of the general highlighting paradigm: Experiment 1 tested 3 to 6 year olds with a newly developed image-based version of the paradigm, which was designed specifically to test young children. Experiment 2 tested the validity of an image-based implementation of the highlighting paradigm with adult participants. The results from Experiment 1 provide evidence for the highlighting effect among children 3–6 years old, and they suggest age-related differences in dividing attention among multiple cues during learning. Experiment 2 replicated results from previous studies by showing robust biases for both image-based and text-based versions of the highlighting task. This study suggests that sensitivity to learning order emerges early through the process of cued attention, and the role of the highlighting effect in early language learning is discussed. Keywords: Cognitive development; Learning; Attention

1. Introduction The ability to adapt to unfamiliar situations based on knowledge from past experience is fundamental to human learning. For example, a child might be extra cautious at first when told to try an unknown food placed on his/her plate, particularly if this unfamiliar item is green and shaped like a vegetable. Making accurate predictions based on Correspondence should be sent to Hanako Yoshida, Department of Psychology, 126 Heyne Building, University of Houston, Houston, TX 77204-5022 USA. E-mail: [email protected]

2

J. M. Burling, H. Yoshida / Cognitive Science (2016)

outcomes from previous encounters is a continuous learning process and relies on basic cognitive functions such as memory and attention for selecting important information from the environment and storing it for later use. However, recent work has shown a protracted course in the development of these most basic functions, particularly with regards to memory capacity and the distribution of attention among multiple cues, with rapid changes taking place during the preschool years (Bertrand & Camos, 2015; Fisher, Thiessen, Godwin, Kloos, & Dickerson, 2013; Ruff & Rothbart, 2001; Schneider, 2015). Understanding the nature of these changes in early childhood can provide insight into the basic learning mechanisms that are responsible for the gradual accumulation of knowledge over time at all ages of development (for similar developmental approaches in the spatial learning domain, see Darby, Burling, & Yoshida, 2014; Dixon, Zelazo, & Rosa, 2010). In this study, we take what has previously been considered an adult-specific learning phenomenon, the highlighting effect, and compare differences in learning outcomes between preschool and primary school children on a customized version of the task. 1.1. Highlighting: A unique ordered learning bias The highlighting effect is a special case of cognitive bias from a class of biases formed as a result of disproportionately learning multiple relationships over time. A typical task involves presenting a pair of cues alongside potential referents of these cues, with instructions for learning associations between cues and referents. Some of the cues may be repeated over time and shown with different referents, establishing redundancies when learning the relationships. Biases emerging from learning may sometimes be classified as irrational (Shanks, 1995) in the sense that learners show behaviors that link together some of the cues with one of the referents, even though any of the possible cues were equally likely to fit with any of the provided referents—likelihood is based on the frequency of occurrence from past exposure (Kruschke, 1996). These purportedly irrational behaviors might manifest themselves as biased preferences, such as looking preferences (Kruschke, Kappenman, & Hetrick, 2005; Pelley, Beesley, & Griffiths, 2011) or selection preferences (Kamin, 1968; Mackintosh, 1975), and they are counterintuitive given expectations about the predicted learning outcomes. Previous literature has typically used symbolic notation in the place of cues and outcomes (Kruschke, 2003). For example, the paired set of cues A.B where B is one item and is always present with the other item A, will both correspond to some outcome. In this case the outcome is the referent X, which may be any event, label, object, etc. Theoretically, cues A and B should have equal weight for predicting the referent X, provided that they are always together when establishing their relationship. Biases start to form with the later introduction of a new set of paired cues A.C indicating referent Y. One specific element of this pair, cue A, has already been previously established and is now repeated across both instances of learning. This leads to cue A being classified as highly ambiguous—as it unreliably refers both to X and Y at different points in time. Its repetitive nature does not provide much in terms of additional information during later learning

J. M. Burling, H. Yoshida / Cognitive Science (2016)

3

instances. However, cues B and C have a direct, one-to-one relationship with their referent and are treated as unambiguous indicators of that referent (cue B is always learned in relation to X and never for Y, and vice versa for cue C). Testing novel combinations of cues that were seen during training reveals selection biases, which have emerged due to the order in which each of the relationships were shown. For instance, if given the repeated cue A in isolation, individuals will often choose referent X over the competing referent Y. This selection preference will take place despite A being seen the same number of times for A.B ? X as A.C ? Y, in other words the p (X|A) = p (Y|A) = 50%. Also, when pairing both the unambiguous cues together, such as B.C, participants will likely choose outcome Y, despite both referents being equally probable for selection, that is, p (X|B) = p (Y|C) = 100%. 1.2. Competing accounts for highlighting In the extant literature, there are multiple competing explanations for the biased selection preferences. Among the list of possible explanations are inferences based on rule learning (Juslin, Wennerholm, & Winman, 2001; Wood & Blair, 2010) or causal relationships (Griffiths, Sobel, Tenenbaum, & Gopnik, 2011; Sobel & Munro, 2009) that each rely on assumptions about top-down processing and higher-level cognitive abilities being responsible for the biases, as well as explanations based on general cognitive mechanisms such as the involvement of memory and attention during associative learning tasks with adults (Hogarth, Dickinson, Austin, Brown, & Duka, 2008; Lamberts & Kent, 2007; Shanks, 1992). The domain-general mechanisms responsible for the highlighting effect are based on rapid shifts of attention during critical moments in time (Kruschke, 2011; Wills, Lavric, Hemmings, & Surrey, 2014) and are not based on explanations regarding abstract, rule-like representations. For example, given the structure of learning these relationships sequentially, that is, at Time 1: A.B ? X and at Time 2: A.C ? Y, attention is redirected toward novel cues at Time 2, and correlations are established between new information and any corresponding referent being learned during this time. Attentional resources are thus actively and rapidly being focused toward potentially meaningful input, consequently strengthening or highlighting the link between C and Y. As a byproduct of redirected attention, and due to its redundancy, the association between cue A and outcome Y is weaker, preserving any previously established relationship (in this case, A.B ? X). The domain-general framework is of theoretical importance because it implies that the mechanisms responsible for this type of learning are applicable throughout all of development, including early childhood (Smith, 2001). However, previous work investigating the phenomenon in 8- and 9-year-old children have failed to observe the expected response patterns (Winman, Wennerholm, Juslin, & Shanks, 2005). These findings might be rooted in assumptions about the underlying mechanisms responsible for generating the biases, in that the behaviors are guided by top-down, inductive inferences or evaluations of one’s hypothesis space during decision making. These assumptions could lead to experimental design choices not particularly suitable for use with young children, such as written labels serving as cues, which are then used to make judgments about potential outcomes. Research on the

4

J. M. Burling, H. Yoshida / Cognitive Science (2016)

development of inductive inferences from perceptual features has shown that children under 7 years old make inferences based on perceptual similarity among cues, and it is not until 11 years and older do they perform induction based on labels and knowledge-based processes (Sloutsky, Lo, & Fisher, 2001). Furthermore, associative learning models have shown that it is the order of learning visual features with labels that has an effect on the kind of predictions the model can generate about new events (Ramscar, Yarlett, Dye, Denny, & Thorpe, 2010), and work done primarily in the visual domain has demonstrated that active maintenance of items in working memory has a direct impact on rapidly shifting selective attention (Downing, 2000). This line of work suggests that lower-level mechanisms interacting with temporal information can account for the types of biases observed in the highlighting task without the need for explanations regarding higher-level cognitive capacities. In this study, we observe the patterns of biases in both young children and adults using a child-friendly version of the highlighting task based on perceptual features to establish cueto-outcome relationships and the resulting biases. 1.3. Why study development? The highlighting effect is similar to other important developmental phenomena focusing on sequential learning contexts and domain-general processes. For example, work done in the areas of habituation (Perone & Spencer, 2013; Sch€oner & Thelen, 2006), mutual exclusivity (Merriman, Bowman, & MacWhinney, 1989), online word recognition (Fernald, Thorpe, & Marchman, 2010; Swingley & Aslin, 2000), and contextually cued attention (Chun & Jiang, 1998; Smith, Colunga, & Yoshida, 2010), among others, can be considered to contain elements of biased preferences due to learning multiple cue-to-outcome relationships across time. Each of these examples involves the reallocation of attention toward one or more cues when provided with an abundance of information, and the dynamic nature of this process over time is driven by one’s previous learning history (Johnson, Spencer, & Sch€ oner, 2009; Thelen & Smith, 1996). Memory for previously learned relationships interacts with new information through mechanisms of cue competition (Desimone & Duncan, 1995; Yurovsky, Yu, & Smith, 2013) and is also subject to interference effects, such as when new information partially or fully overwrites memory for old information (Bower, Thompson-Schill, & Tulving, 1994; Darby & Sloutsky, 2015; Howe, 1995). Gradations in attentional control and memory capacity at specific stages of development can influence how these mechanisms interact with one another to create a series of unique learning trajectories. Assessing developmental differences in highlighting task performance between preschool and primary school children provides an opportunity to present a case for gradations among the related learning literature (Gibson & Rader, 1979). While the highlighting effect provides an ideal context for studying the mechanistic origins of biased learning in different developmental phenomena, its relevancy within the developmental literature is rarely addressed. One of the few considerations is from within the domain of early word learning (Ellis, 2006; Regier, 2005; Yoshida & Burling, 2012). For example, assume that a child initially learns that a wug creature’s defining features are both a tail and wings, then later learns that a dax creature also has a tail but has

J. M. Burling, H. Yoshida / Cognitive Science (2016)

5

claws instead of wings. It is during the time in which the child learns about the dax that the relationship between the most defining feature (claws) and the dax might be strengthened, or highlighted. Variations in associations between individual features and creatures can lead to unexpected biases when presented with ambiguous feature combinations, such as how the child might predict a dax when asked which creature consists of both wings and claws or predict a wug when asked only about a creature with a tail. Under this scenario, highlighting-like phenomena could be a likely candidate for explaining some of the disambiguation difficulties in early word learning. For example, adjective learning takes place at a much later age than noun learning (Gasser & Smith, 1998), and it is also the case that in the English language, adjectives maintain a temporal role within a sentence structure, as well as being ambiguous in isolation—that is, when not modifying a noun in a sentence (Mintz & Gleitman, 2002). A more complete understanding of the mechanisms driving the mastery of adjectives and word learning is one justification for the need to employ learning paradigms such as the highlighting task, tasks that are amenable and effective for understanding the processes taking place during temporally ordered learning (Yoshida & Burling, 2012). 1.4. Overview of approach In this study, we conducted two separate experiments of the highlighting paradigm with both children and adults to further investigate the relationship between domain-general cognitive processes and biased learning outcomes, and to assess how ordered learning preserves critical pieces of information across multiple learning instances and across multiple developmental periods. We implemented a standard temporally ordered learning structure (learning of some relationships before moving on to others) common among many iterations of the highlighting task (Kruschke, 1996) and measured selection biases (or asymmetrical preferences) after training participants with this structure. One important manipulation of the training structure is that training be constrained to be equal in frequency for both initial and later learning instances, so that when biases are observed after shown ambiguous combinations of previously learned cues, these biases cannot be attributed solely to the frequency of learning previous relationships. In Experiment 1, we tested children from 3 to 6 years old on their ability to establish the learning biases typically seen under the highlighting task, with an additional emphasis on the selection preferences between preschool and primary school children. In Experiment 2, we tested adult participants on both our child-friendly implementation of the task and a standard version typically administered in the adult learning literature. This within-subjects design was used to establish that the two perceptually different types of tasks were comparable in generating the expected response biases.

2. Experiment 1 Experiment 1 tests for biased object preferences in young children. We propose that the learned biases are established by forming associations between a single referent and a

6

J. M. Burling, H. Yoshida / Cognitive Science (2016)

subset of one or more cues from a pair of cues, and that the pattern of biases can be explained by an interaction between the order in which relationships are learned and the nature of cue overlap throughout learning. The current experimental hypothesis is that children will show evidence of the highlighting effect, and that associations from sets of paired cues might be more fragile due to age-related differences in cognitive capacities, such as differences in memory capacity and the distribution of attention among cues. Hence, we expect to observe younger and older children responding differently on the critical testing items as opposed to treating them equally, reflecting their cognitive capacity state at some point in development. 2.1. Participants We recruited children from the surrounding University of Houston community and within the Greater Houston Area. Thirty-three children ages 38.4–71.4 months (M = 55.5) participated in the task in exchange for a small gift at the end of the session. An additional eight children also participated, but we removed them from the analysis for failure to adequately learn both A.B and A.C sets by the end of the training phase (either they did not reach the testing phase or the final training accuracy for either set was < 67%; there were no differences due to age or stimulus type). 2.2. Stimuli and materials Our implementation of the highlighting task consisted of a sample of illustrated images presented on a 19″ touch screen monitor with a resolution of 1,280 91,024. We designed a series of nine images (common objects) to serve as paired cues (cues that, when presented together, refer to another distinct type of image). For each child, we selected three of the nine images and assigned them to the role of predictive cue A, B, or C. We reused these object illustrations throughout the task. Two of the objects were always presented as pairs, side-by-side at the top of the screen. In addition, we created a separate sample of six images (animals) and used these as referents for the paired objects. We selected two of the six animals and assigned them the role of referent X or Y, and displayed them side-by-side along the bottom of the screen (see Fig. 1 for an example of the task screen). We preselected groups of images to avoid the situation of strong preexisting associations between specific groups of objects and animals. Table S4 and corresponding Fig. S4 show a complete breakdown of the groups used in the study and how they were assigned to specific cues and referents. 2.3. Procedure Two main phases of the task were administered to the children, training then testing. We provided verbal instructions along with a familiarization phase before starting the training phase (see Table S5 for the list of instructions used for this task). Participants began the task with six familiarization trials designed to instruct them on using the touch

J. M. Burling, H. Yoshida / Cognitive Science (2016)

7

Fig. 1. Example of a selection of cues and referents for the image-based version of the highlighting task. The symbol for each type of cue is marked with a letter next to the image (not shown in the actual task). The figure depicts image group 2 of 3 (see Figure S4 for others). Objects such as the spoon and apple serve as paired cues, whereas the elephant and monkey are potential referents.

screen monitor. Children were required to drag a pair of triangles at the top-center of the screen toward one of two boxes presented along the bottom of the screen. They then pressed a button on the top right corner to accept their response and were then provided with corrective feedback before moving on to the next trial. 2.4. Training procedure After familiarization with the setup, participants proceeded to the training phase with the goal of learning the correct referents (animals) for a set of paired objects. It is during the training phase that any biases are expected to be established. A single-object pair (predictive cues) and its referent was learned first before moving on to another set of paired objects and its corresponding referent. The first relationship A.B ?X we will refer to as Set 1, and the second relationship A.C? Y as Set 2. We instructed children to drag the object pair placed at the top-center of the screen down to one of the referents placed at opposite ends along the bottom of the screen (see Table S5 for these instructions). That is, children placed the predictive cues in the appropriate box, with each box displaying

8

J. M. Burling, H. Yoshida / Cognitive Science (2016)

the image of one of the referents (Fig. 1). Both possible referents were always displayed along the bottom of the screen, and the horizontal orientation of objects and referents was randomized across all trials. Dragging either one of the objects downward toward a box led to both items moving across the screen synchronously to illustrate a coupled relationship between pairs of objects and to avoid separating objects into different boxes during a trial. We provided auditory feedback using either cheers (correct) or a buzzing sound (incorrect) after the children made their final referent selection. A gradual progression (between early learning of Set 1 and late learning of Set 2) took place by designing three distinct phases of training: Early, Mixed, and Late (see Table 1 for a summary on how this was structured). We tracked their progress during each phase to guarantee that participants learned the items throughout the training session. The total number of training trials per participant depended upon his/her performance in each phase. The starting number of trials in each phase and the final total were taken from Kruschke’s (2009) canonical design. The canonical design allowed for children to become equally exposed to the different sets (A.B ? X and A.C ? Y) while keeping intact the progression from early to late learning (Table 1). All participants progressed through each training phase without interruption and were not informed when one type of phase progressed to the next. A participant began Early training with four consecutive Set 1 trials (2 blocks) before assessing accuracy. If a participant reached at least 75% accuracy after these initial four trials, he/she moved on to the Mixed phase; otherwise we added an additional block of trials until both of those trials were correctly answered. The Mixed phase served as a gradual introduction to the Set 2 association, and during this phase we displayed three trials from Set 1 and one trial from Set 2 (1 block). Accuracy was again assessed with a criterion of 75% for the Mixed phase, and additional blocks were added as necessary. We recorded the total number of blocks for determining the length of the Late training phase. We based the final number of blocks in the Late training phase on the sum of the total number of Early and Mixed blocks (Table 1). The Late training phase started with a minimum of 12 trials (3 blocks), 9 of them being from Set 2, and 3 from Set 1. If a participant required no additional blocks throughout all of training, the exact number of Set 1 training items learned during training was 10, with the number of Set 2 training items Table 1 Highlighting training structure Phase

Total Blocks

Item Type & Trials per Block

Total Trials

Early Mixed Late

E=2 M=1 L=E+M=3

A.B ? X (92) A.B ? X (93), A.C ? Y (91) A.B ? X (91), A.C ? Y (93)

4 4 12

Note. Participants moved from learning A.B ? X (Set 1) toward learning A.C ? Y (Set 2). Accuracy was assessed at the end of each training phase and additional blocks were added as necessary. The total number of blocks in the Early and Mixed phases was recorded to determine the final number of blocks in the Late training phase. The number of blocks and trials shown represents the minimum that participants were exposed to if they met the accuracy criteria throughout the entire training session.

J. M. Burling, H. Yoshida / Cognitive Science (2016)

9

also equaling 10. The average number of Set 1 training for all participants was 11.97, which was also the same for training set Set 2 (see Table S7 for a breakdown within each phase). 2.5. Testing procedure Before the start of the testing phase, we informed participants that they were about to see pictures of what they previously saw, and to try their best at choosing one box for their final selection (Table S5). We presented testing items similar to that of training, except that we now showed the participants novel cues (either novel combinations of objects or objects as single cues—no longer paired together). The same previously used training pairs were also intermixed with the new testing trials for comparison purposes. Each type of the six test cues presented in Table 2 was randomly assigned and repeated five times, resulting in a total of 30 testing trials per participant. Unlike the training session, participants were not provided with feedback after each test trial, and the completion of each trial immediately led to the next one. 2.6. Results We derived accuracy measurements for each test item by calculating the proportion of correct responses from the total number of test trials completed for each child. For the novel, untrained items, we recorded a response as correct or incorrect based on the expected answer choice obtained from previous literature (Table 2). We then estimated the probability of a particular frequency of responses by fitting a Bayesian hierarchical logistic regression model with a logit link function and binomial likelihood using the following set of predictors: type of testing cue (e.g., A.B, . . . B.C), the child’s age, the image group used (Fig. S4), the interaction of test cue 9 age (age-related effects for each test cue), and the interaction of image group 9 test cue (as a control for image-related effects on response). The choice of the use of a hierarchical regression model allowed us to vary the testing cue type coefficients by child, meaning that we fitted different intercepts and test cue parameters for each individual, which were estimated from a common covariance matrix. This was to account for dependencies such as the correlation of responses from the same individual due to repeated measures for each test cue type. We fit the fully Bayesian implementation to the generalized linear mixed model, which allows for estimating a probability distribution for each of the parameters given the data we Table 2 Expected selection biases Test cue(s)

A.B

A.C

B

C

A

B.C

Corresponding referent

X

Y

X*

Y*

X*

Y*

Note. The first two items are the trained items, introduced again during testing. Items with an * indicate the expected referent to be associated with the novel cue(s) that were provided during testing. Accuracy scores were based on these expectations.

10

J. M. Burling, H. Yoshida / Cognitive Science (2016)

collected instead of a single estimate as in frequentist approaches. Due to shrinkage and deriving a distribution of parameter values, we can apply the appropriate set of contrasts required for the multiple comparisons without the need for type I error correction methods (Gelman, Hill, & Yajima, 2012). Highest density intervals (HDI, or credible intervals) are used when reporting results and are based on the most dense regions of the posterior distributions of parameters, which are estimated from the model (Kruschke & Liddell, 2015). See the Analysis section for Experiment 1 in Data S1 for additional model details. 2.7. Evidence of the highlighting effect in children When considering all children used in the analysis as a whole, the results show strong evidence for biased object preferences in children, which are established from the temporally ordered training session. Figure 2A displays the accuracy (proportion of correct responses) collapsed across children for each type of testing trial. Performance on the trained cues (A.B ? X = 0.78) and (A.C ? Y = 0.69) was well above chance performance of 0.50 (equal preference for either referent), demonstrating that the child’s accuracy on the training items persisted well into testing, and despite the testing phase mixing trained cues with novel occurrences. The single cues were also above chance (B? X = 0.63, C ? Y = 0.78). These trials have an unambiguous, one-to-one correspondence given that they were never paired with more than one referent (unlike cue A). However, there was evidence of accuracy differences between the two single, unambiguous cues. That is, we observed a significant difference between the C and B test trials (the difference in accuracy between the association of C ? Y and B ? X was .15, 95% HDI = [0.06, 0.22]), indicating a robust highlighting effect for the later learned unambiguous cue, and how strongly highlighted items could lead to better learning of implicit relationships (cues individually correspond to referents just as they do when shown in pairs). The increase in association for C ? Y also directly influences how children respond to the ambiguous set of testing trials, A and B.C (Kruschke, 2003). We observed a strong bias for the early learned referent X when tested on the ambiguous, single cue (A ? X = 0.73), and a weaker bias toward the later learned referent Y when given the ambiguous pair of cues (B.C ? Y = 0.58). Together, these set of results reflect the expected trends as seen in the adult highlighting literature (Kruschke, 2009; Kruschke et al., 2005). Our results showing that the strength of the bias is not equal among the two ambiguous test items gives some insight into the nature of what makes each test item considered uniquely ambiguous. Given children’s changing cognitive capacities along different points in development, they may treat these two items differently than what is typically expected in adults (adults often respond to B.C ? Y and A ? X with similar frequency). 2.8. Age-related differences in selection biases To account for developmental differences among the set of cues tested, we analyzed performances between preschoolers and primary school children. We collapsed all

J. M. Burling, H. Yoshida / Cognitive Science (2016)

11 (A)

Trained: p(X | A.B)

Single: p(X | B)

Ambiguous: p(X | A)

Posterior samples

3000 2000 1000 0 0.5

0.6

0.7

0.8

0.5

0.6

Trained: p(Y | A.C)

0.7

0.8

0.5

Single: p(Y | C)

0.6

0.7

0.8

Ambiguous: p(Y | B.C)

3000 2000 1000 0 0.5

0.6

0.7

0.8

0.5

0.6

0.7

0.8

0.5

0.6

0.7

0.8

Accuracy, Pr(correct | cue/s) (B)

(C) p(Y | C) − p(X | B)

0.102

B

p(Y | B.C) − 0.5

0.04

0.17

0.30

A=B

0.019

A

p(X | A) − 0.5

0.018

A.B

−0.05

0.014

C

−0.04

−0.04

B>A

0.007

A.C

0.34

−0.060

B.C −.2

−.1

0.12

0.38

A>B 0

.1

.2

Accuracy diff. (old − young)

−.2

0

.2

.4

−.2

0

.2

.4

−.2

0

.2

.4

Accuracy diff.

Fig. 2. For all plots, the gray dotted vertical line marks chance performance (selecting X and Y equally) or no difference (zero value). Thin horizontal lines display the 95% highest density interval (HDI), whereas the thick horizontal lines display 1 SD of the posterior distribution. The point in the center is the posterior mean. For consistency between plots, accuracy scores are always presented along the horizontal axis. (A) Histograms correspond to the distribution of posterior accuracy, the probability of choosing the expected referent given a specific test item. Solid black vertical lines indicate the observed mean accuracy computed from the data (unmodeled). (B) Difference in mean accuracy proportions between older children and younger children for each test item. Negative values indicate younger children performed better, thus positive values indicate better performance for older children. The difference score is displayed above the HDI. (C) The extent of the highlighting effect (panel 1) and other selection biases (panels 2,3) given how children performed on cues A and B. Panel 1 is the difference between two test cues, panels 2 and 3 are differences from chance.

preschool children 4.5 years and younger (mean age = 47.4 mos.; range = [38.4, 54]; n = 14) into a “young” group, and collapsed primary school children older than 4.5 years (Mage = 61.5 mos.; range = [55.5, 71.4]; n = 19) into an “old” group. Based on mean accuracy for each test item for each group, we calculated differences scores (old – young) to analyze group contrasts (shown as points in Fig. 2B). Accuracy proportions for both the young and old groups are shown in Table 3. The largest difference in accuracy we observed was for the single cue B, where we found better performance in older children for this cue. The older group of children were more likely to remember that the single

12

J. M. Burling, H. Yoshida / Cognitive Science (2016)

cue B unambiguously corresponds to the early referent X, with a difference in proportion of 0.10 (95% HDI = [0.001, 0.21]). We found that younger children performed better on the ambiguous pair of cues, B.C. Younger children were more likely to choose the later learned referent Y when tested on B.C, a difference of .06 (95% HDI = [ 0.04, 0.17]). These results suggest that older children were more likely than younger children to maintain the relationship of cue B corresponding to the early referent X, and thus more likely to learn the correct, implied relationships between cues and referents despite never learning them separately. 2.9. Attentional distribution among early training items Developmental differences between each of the testing cues may be attributed to differences in how children remember the early learned relationships, specifically how they might have treated paired cues as separate entities. To investigate the role of memory versus distributed attention among the early trained relationships, we looked at the accuracy of the single test items A and B to observe how performance on these items impacts other critical test items. Three groups of children were formed based on (1) a child’s mean accuracy on the A test trials was equal to their accuracy on the B test trials, (2) A > B, or (3) A < B. The A = B group indicates that the cues A and B—learned during the start of training—were each equally likely to correspond to X. Despite never being taught in isolation, children were able to learn implied relationships between these cues and the early referent, in addition to maintaining these associations well into testing. Groups A > B and A < B indicate that at least one cue from the pair was not equally associated with the corresponding referent, thus performance on each during testing differed considerably from one another. Table S6 in the supplementary materials displays the sample size, mean age, and accuracy proportions on all test items for each of these three groups. The results from Fig. 2C indicate that the extent of the selection biases seen in testing is dependent upon how the child attended to the single cues A and B during their training of the association A.B ? X. For the group consisting of participants where B > A, we

Table 3 Accuracy for each testing cue separated by age group Test item A.B A.C B C A B.C

Expected response

Younger children

Older children

X Y X Y X Y

0.79 0.69 0.55 0.76 0.71 0.64

0.78 0.69 0.68 0.79 0.75 0.54

Note. Accuracy scores are based on the proportion of selecting the expected response out of the total number of repeated trials of the same kind. Difference scores are shown in Fig. 2.

J. M. Burling, H. Yoshida / Cognitive Science (2016)

13

observed no highlighting effect and found no evidence of biases whatsoever for any of the ambiguous test items. These results would be expected if individuals learned all implicit relationships and selected referents with equal probability when given ambiguous information. For the A > B group, where we observed an unusually strong bias for X when given the ambiguous cue A, we also observed a 34% increase in accuracy for cue C ? Y over B ? X, indicating a strong highlighting effect for C. We also observed a bias for referent Y when given the ambiguous pair of cues B.C, 12% higher than expected by chance. Lastly, for the A = B group we did not observe the same discrepancy between B and C performances as we did in the A > B group, and instead observed a small difference between C ? Y and B ? X, as typically seen in the adult highlighting literature. However, this still resulted in strong biases for both B.C ? Y (17% higher than chance) and A ? X (30% higher than chance). These results suggest that children’s differential learning of early pairs of cues has a direct influence on the likelihood of them forming selection biases during later learning.

3. Discussion of Experiment 1 3.1. Referential biases observed in children The findings from Experiment 1 demonstrate that children as a whole show the expected selection biases when provided with ambiguous cues and when provided with a childfriendly version of highlighting task. That is, children strongly preferred the earlier learned referent when given the repeated cue A (despite this cue being shown with both referents), and preferred the later learned referent when given the combination B.C (despite each individual cue already having a direct, one-to-one mapping with its corresponding referent). The case of A biased toward X is particularly interesting because the magnitude of this effect was approximately 10% larger than what is typically seen in adults, and demonstrates how attention was frequently redirected away from A when learning the relationship A.C ? Y. Primacy or recency effects (memory retention for early information only or late information only) cannot account for the results obtained in this study because if either scenario was the case, children would have preferred only one of the referents at all times—either the early or the late referent—when given ambiguous items. Instead, children formed different biases for both the early and late referents, which suggests different mechanisms resulting in the asymmetric associations across the two ordered learning instances. 3.2. The role of memory and attention in forming biases We also observed that the extent of the biases depended on how well the child attended to individual cues during the early training set. If cues A and B were not treated equally when learning the initial relationship of A.B ? X, then this significantly affected their accuracy during testing. We found that during testing, older children were more likely to infer the correct relationship of B ? X than younger children. This may be because

14

J. M. Burling, H. Yoshida / Cognitive Science (2016)

memory capacity differences are expected in children of different ages, particularly for older children being able to remember associations that were established much earlier in training. Alternatively, developmental differences may also be attributed to how children of different ages distribute attention across multiple items during early learning. When children strongly preferred B ? X much more than A ? X, we observed no evidence of selection biases for any of the ambiguous testing items. This situation may imply that memory for A as a predictor of X was poor during training, or that more attentional resources were allocated toward cue B during this time. However, if memory for A was in fact degrading over time and throughout training, then children should instead show biases for A ? Y, which is more in line with a recency effect explanation. Children were much more likely to associate A ? X, or prefer both referents equally when given the options X and Y, and it was never the case that A strongly refers to the late item Y. Thus, it is likely that the early learned relationships at the start of training were reliably maintained in memory over time and that any type of biases that may be formed are due to differences in attentional allocation, particularly during early learning. Furthermore, the accuracy of the ambiguous pair B.C ? Y provides weight to the argument for distributed attention among items in children (or lack thereof). If limited memory capacity was solely responsible for the results obtained in children, we should have observed a much stronger preference for Y when given the pair B.C than what is observed, given that B would have been largely forgotten, and C was only seen late in training. However, the B.C bias was the weaker of the two ambiguous items for both preschool and primary school children. Lastly, the two different types of ambiguous items might affect attentional distribution in different ways. If children are able to distribute attention equally among multiple items, then they can also evaluate these items separately and individually when making selection preferences. Attention can be distributed between objects in the ambiguous B.C case, whereas for the single ambiguous item A, this is not even possible. Equal distribution of attention is particularly likely if B is a strong competitor, and thus, the highlighting effect may be attenuated under ambiguous situations with additional, potentially informative cues. This may explain the lower performance for the B.C test item in general, and in particular for older children with greater attentional distribution ability. These results suggest that memory, but attentional development in particular, plays an important role in the formation of these biases. Specifically, competition between pairs of cues, the redirection of attention toward novel, more informative sources of information and away from previously established sources, can result in some items having a strong relationship with current events, in contrast to those cues that do not receive as much attention.

4. Experiment 2 Our results are inconsistent with the results from a study showing that children as old as 8–9 years failed to show the highlighting effect under more adult-like, diagnostic conditions (Winman et al., 2005). One may argue that our child-friendly version of the highlighting task may not be compatible with a more standard version, and the nature of the

J. M. Burling, H. Yoshida / Cognitive Science (2016)

15

highlighting effect is specific to a task context in which participants make inferences about causal outcomes given a set of predictive cues. If this is the case, the selection biases observed in Experiment 1 are not the same as the highlighting effect, and we should observe measurable differences in the biases formed when compared to the standard task using diagnostic conditions (Kruschke, 2009; Medin & Edelson, 1988). The goal of Experiment 2 was to address the possibility that the image-based implementation of the highlighting task we administered in Experiment 1 resulted in biases that were fundamentally different compared to adult-oriented designs of the highlighting paradigm, questioning the generalizability of the developmental findings documented in Experiment 1. We tested adults on the same image-based task used with children in Experiment 1 and on a standard, text-based version of the highlighting task (Kruschke, 2009; Medin & Edelson, 1988) to rule out this possibility. The primary difference between designs was the content matter for learning cue–referent associations, with one task based on associations between sets of images, and the other based on textual information. We compared testing cues common to both the image-based and text-based designs to evaluate the selection preferences from the two tasks as equally as possible, with the expectation that the adults will form similar biases when given both tasks. 4.1. Participants Forty-seven adults (mean age = 23.8; range = [18, 38]) with normal to corrected vision from the University of Houston and the Greater Houston Area participated in both the image-based and text-based task designs. All participants received some form of compensation for their time by either providing them with partial course credit or a $5 gift card as a form of payment. 4.2. Stimuli and materials The stimuli for the image-based version of the highlighting task were identical to stimuli used with children from Experiment 1. For the standard text-based task, we displayed cues and referents as typed text surrounded by outlined boxes, shown on the same touch screen monitor used for the image task. Following the task structure used by Kruschke (2009), participants partaking in the text task viewed two different sets of early learned items and two different sets of late learned items—as opposed to just one of each as in the previous experiment. We use subscripts to identify different sets. For example: A1.B1 ? X1 denotes early Set 1, A2.B2 ? X2 early Set 2, A1.C1 ? Y1 late Set 1, and A2.C2 ? Y2 late Set 2. The cues in this version of the task are defined by known symptoms instead of images whereas referents correspond to novel diseases in place of animals. We took symptom and disease terms from Medin & Edelson (1988), the complete list can be viewed in Table S9. We randomly sampled six of the symptoms and assigned them to the role of paired cues and sampled four diseases to take the place of referents. We displayed two symptoms per trial with the vertical orientation of the text randomized. We showed all four possible diseases at all times at the bottom of the screen with the

16

J. M. Burling, H. Yoshida / Cognitive Science (2016)

order also randomized across trials. Figure S6 in the Supporting Information illustrates the layout of the stimuli for a single trial as seen on the touchscreen monitor. 4.3. Procedure Each adult participant completed both the image-based and text-based version of the task. The order of each task was randomly assigned between participants. The procedure used for the image-based version of the task in Experiment 2 was identical to the one used in Experiment 1, except that the adults performed the task three times, each time viewing a distinct group of images (and each time learning new sets of associations), thus exhausting all possible groups from the set shown in Fig. S4. We counterbalanced the order of the image groups for each participant. Responses were collapsed across image groups for each test cue type and for each participant. 4.4. Text-based procedure Training for the text-based task was similar to the image-based design. We presented written instructions before initiating training (see Table S10). The temporal training structure for the text version was the same as in the image-based version (progression through Early, Mixed, and Late phases). We chose a larger, fixed number of training trials to adhere to previous studies as much as possible, instead of assessing accuracy after blocks of trials and adding more as necessary. As previously mentioned, the early learning phase presented sets of items A1.B1 ? X1 and A2.B2 ? X2 instead of a single set. The mixed and late learning phases contained the early learned items as well as A1.C1 ? Y1 and A2.C2 ? Y2. Each phase had a predetermined frequency of training trials. The early learning phase contained a total of 16 learning trials with both early sets presented 8 times each. The mixed learning phase contained a total of 32 trials with 24 being early items (12 each) and 8 late items (4 each). The late phase contained a total of 64 trials with 16 early items (8 each) and 48 late items (24 each). In total, there were 112 training trials split between four different relationships that were learned across temporally ordered phases. Participants simply touched the correct disease within the rectangular boundary to make a response when items were presented during each trial. We provided participants with corrective feedback after their response. After training in the text task, we displayed written instructions before starting the testing phase (Table S10). The text-based testing procedure was identical to the training procedure with the exception of the different combinations of cue(s) and no corrective feedback for each trial. Learning multiple sets of early and late items allowed for testing novel cue combinations between distinct sets. For example, participants could be shown a list of symptoms such as A1.B1.C2 or A2.B2.C1 and asked to choose the appropriate disease. In this particular situation, combinations of symptoms were shown that overlapped between different sets. We presented these types of cues during testing to maintain consistency with previous literature, but they were not of particular interest during analysis as the goal was to compare analogous test cues between the different types of

J. M. Burling, H. Yoshida / Cognitive Science (2016)

17

highlighting designs. We administered a total of 60 testing trials per participant, 8 trials of test item A.B (A1.B1 = 4, |A2.B2 = 4), 8 trials of A.C, and 4 trials each of every other test cue(s) shown in Table S11. More trials of A.B and A.C were added during testing in order to better evaluate adequate learning of the training items. 4.5. Results 4.5.1. Learning criteria for adults Performance on novel test items can only be accurately assessed if participants learned the correct cue-to-referent associations during training. For the text-based version of the task, we calculated accuracy for the test items A.B and A.C and we required at least 6 out of 8 correct responses for each item and for each individual (Kruschke, 2009). Unlike the fixed number of trials in the text-based design, the length of the image task training varied according to participant performance, with a mean of 60.4 trials completed for each participant, collapsed across image groups. We assessed whether or not participants learned the training items for the image-based design based on the last phase of training, similar to Experiment 1. We observed a noticeable difference in performance between the text task and the image task, with 13 adults removed from the analysis due to failure to adequately learn the training items A1.B1, A1.C1, A2.B2, and A2.C2 in the text-based version. After these removals, accuracy for the image-based design was assessed, and only one additional participant beyond the already removed participants from the text design failed to learn the training events during the late training phase during this task. Removals due to these learning criteria resulted in a total of 33 adult participants used in the subsequent analysis. 4.5.2. Testing performance in adults We obtained frequencies of correct responses for each type of test item, which were totaled across all image groups in the image task and across all sets in the text task. For example, a single participant’s score on test item A.B for the image-based task is the sum of all A.B trials from the first, second, and third image groups, whereas their score for A.B on the text-based task is the sum of test trials A1.B1 and A2.B2. The results, averaged across all participants, are presented in Tables S11 and S12. Only test trials common to both tasks were compared in the analysis. The same approach in terms of statistical analysis and class of model fitted was identical to that of Experiment 1. The frequency of correct responses out of the total number of trials was used as the outcome variable in the model, and was obtained from each of the participants’ performance on each of the six types of testing cues (A.B, A.C, . . ., B.C), for each type of task (text, image), along with the interaction between the cue type and task type factors. The coefficients for these effects were free to vary by person and standard deviations and correlations between coefficients were accounted for in the model. See the Analysis section for Experiment 2 in the Data S2 for more details on model formulation. The results shown in Fig. 3 display the 95% highest density intervals (HDI) from the posterior distributions of the test item parameters for both the image and text tasks. The

18

J. M. Burling, H. Yoshida / Cognitive Science (2016)

HDI (thin horizontal line) corresponds to the probability of choosing the correct response given one of the test items, with the mean of the posterior displayed to the right of the figure. All responses were above chance performance of 0.5 (based on the expected choice selections given a particular cue(s), see Table 2). The results show that there are no meaningful differences in performance between the image-based and text-based tasks when comparisons are made between each type of test item—the 95% HDIs between cues of the same type are all overlapping. When comparing overall mean accuracy between the two tasks (collapsed across all test items) accuracy was consistently higher in the image-based version by an average of 3% but not reliably different from zero (HDI ¼ ½0:6%; 8:7%]). In addition, the effect sizes of the selection biases observed in adults for the ambiguous items were similar to the proportions seen in previous literature (see the following for comparison: Kruschke et al., 2005; Lamberts & Kent, 2007). These results indicate that responses are consistent between tasks and the biases observed in the image-based version cannot be attributable to differences in forming associations for objects ? animals and between inferences made about symptoms ? diseases. The consistency in responses between the image-based and text-based designs indicate that the two tasks are comparable, and that the image-based study was effective in eliciting the kind of selection biases typically seen in past highlighting literature.

5. General discussion 5.1. The fragile nature of ambiguous associations in children In this study, we documented young children’s asymmetric learning of relationships, which consisted of multiple objects referring to some outcome, and observed biased selection preferences due to the order in which these relationships were learned. We provided evidence suggesting that a simple image-based task used for learning the multiple relationships was similar in function to the standard highlighting task typically used in inducing these biases. Age-related effects were observed between preschool and primary school children in that the older children were more likely to learn the implied relationships seen during training. Their accuracy was higher when tested on the association between the early learned cue B and its referent X, despite only learning this relationship alongside the paired cue A. These results reflect older children’s ability to adequately shift attention between cues when learning multiple cue ? referent relationships over time. When the early learned pair of cues was equally associated with their referent, we observed biases similar to that of adult performance. In addition, differences in distributed attention between paired cues in the early training set likely resulted in differences in biases for test items with multiple ambiguous cues, such as the pair B.C, which was never shown during training. Cue competition continues to play a role during the presentation of multiple ambiguous cues during testing and influences immediate selection preferences. If the highlighting effect can be described as the focus of attention primarily on the cue C when learning A.C ? Y, and this leads to biases when given ambiguous information

J. M. Burling, H. Yoshida / Cognitive Science (2016)

A.B

p(X | A.B) = 0.94

Image : A.B

p(X | A.B) = 0.98

Text :

A.C

p(Y | A.C) = 0.93

Image : A.C

p(Y | A.C) = 0.98

Text :

B

p(X | B) = 0.97

Image : B

p(X | B) = 1.00

Text :

C

p(Y | C) = 0.94

Image : C

p(Y | C) = 1.00

Text :

A

p(X | A) = 0.64

Image : A

p(X | A) = 0.67

Text :

B.C

p(Y | B.C) = 0.68

Image : B.C

p(Y | B.C) = 0.73

Text :

19

0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90 0.95 1.00

Accuracy, Pr(correct)

Fig. 3. Adult performance for the image- and text-based versions and for each testing cue from the highlighting task. Chance performance is marked at 0.5. The thin horizontal lines are the 95% HDI of the posterior distribution constructed from the estimated model parameters. The thicker horizontal line is 1 SD of the same posterior. The center point is the posterior mean. The value of the posterior mean is displayed on the right, along with the expected choice given each type of cue(s).

such as B.C, then competition between B.C might take place but largely in favor of the highlighted cue C. If attention is not equally distributed during early training, with attention being directed mainly toward the cue B during this time, the opposite effect may occur, resulting in stronger competition between the ambiguous pair of cues, and thus unbiased selection preferences. In general, the degree to which attentional resources are placed upon the repeated cue A when learning the early association A.B ? X directly influences the balance of competition between paired cues during late learning of A.C ? Y, and thus determines the strength of biases when shown ambiguous testing items. We also concluded that the role of memory seems to have less of an immediate influence when assessing the differences between preschoolers and primary schoolers, given that biases were observed asymmetrically for both early and late referents. However, its contribution is still important for retaining the associations across time and during the processing of overlapping cues. Further experimentation is necessary to fully understand the limits of memory capacity in preserving associations over much longer periods of time and the effect this has on selection biases. 5.2. Similar mechanisms for the two versions of the highlighting task Experiment 2 showed evidence that adults form the same biases in outcomes when tested on the text-based version of the highlighting task, the implementation often used in previous literature, or the image-based version we developed specifically for children. The consistency in accuracy proportions between the two tasks for the adults

20

J. M. Burling, H. Yoshida / Cognitive Science (2016)

demonstrates that learned biases can occur outside of a symptom-diagnosis context and under conditions of predominately visual associations. As adults performed similarly, particularly for the ambiguous testing items, we can conclude that the differences we see between younger and older children are likely due to changes in development as opposed to design choices we used in our implementation of the highlighting task. Lastly, we can conclude from the adult data that if the same processes are at work when diagnosing diseases and when predicting animals from random objects, it is unlikely that children as young as 3 years old are also employing the same type of higher level, rule-like strategies during selection. 5.3. Cued attention and word learning Highlighting, and temporally cued attention in general, has previously been suggested to have a prominent role in early word learning (Gogate & Hollich, 2010; Regier, 2005; Smith et al., 2010; Yoshida & Burling, 2012). For example, in children as young as 6 months old, auditory labels are indicated to cue attention to objects (Bergelson & Swingley, 2012). Cue competition driving selective attention is a powerful mechanism for generating looking preferences toward novel objects and establishing robust correlations that persist over time and under ambiguous contexts (Desimone, 1998). Ambiguity resolution can also take place at much smaller timescales, where attention is redirected moment to moment as new information competes with past information. In adults, attention is dynamically reallocated as the auditory information continues to unfold over time, eventually stabilizing on a final selection (Spivey, Grosjean, & Knoblich, 2005). Words and labels commonly refer to objects in the environment, and these objects can have multiple properties. An object can have both a common label, such as a noun, and a descriptive property, such as an adjective. Given that older children in our task were more likely to learn the implicit relationships by distributing attention across paired cues, these results may provide insight into the delay in children fully understanding the nature of adjectives until about 4 years old (Waxman & Klibanoff, 2000). During the early stages of adjective learning, looking-while-listening studies demonstrate how young children use multiple, ordered labels to efficiently direct attention to the correct object when referenced (Fernald et al., 2010). When two different objects share a common label, but differ in their properties (e.g., blue car and red car), the order in which these two objects are learned can bootstrap the learning of new, less frequently observed properties during later learning. In another recent study demonstrating the effect of early learned associations in the context of novel adjective learning, the authors suggest that early learned object–word pairings later help novel pairings to compete for attention (Yoshida & Hanania, 2013). These studies indicate that competitive processes may be mediated through one’s learning history, and that looking preferences within these contexts depend on the degree of mastery of nouns and adjectives at different periods of development. They also suggest that the biases derived through ordered learning can help to resolve conflict under highly ambiguous contexts. Therefore, the biases observed in highlighting paradigms are not merely obstacles that need to be overcome to obtain some greater understanding about

J. M. Burling, H. Yoshida / Cognitive Science (2016)

21

the true underlying relationships, but they serve as a means to preserve past information while new information is also attended to and learned just as well.

Acknowledgments We would like to acknowledge Lauren M. Baker for her work in designing and animating a set of stimuli suitable for use as a child-friendly version of the highlighting task. The relevant research and the preparation of the manuscript were supported by University of Houston, a National Institutes of Health grant (R01 HD058620), the Foundation for Child Development, and the University of Houston Grants to Enhance and Advance Research (GEAR) program. We especially would like to thank the students, parents, and children who participated in the studies. Lastly, we would like to express our sincere gratitude and appreciation for Dr. Linda B. Smith, for her dedication and mentorship provided throughout this study, during our academic journey in general, and beyond. Not only did her training shape and guide our scientific way of thinking, but it will also continue to influence how we live our lives as thinking scientists.

References Bergelson, E., & Swingley, D. (2012). At 6–9 months, human infants know the meanings of many common nouns. Proceedings of the National Academy of Sciences, 109(9), 3253–3258. doi:10.1073/ pnas.1113380109. Bertrand, R., & Camos, V. (2015). The role of attention in preschoolers’ working memory. Cognitive Development, 33, 14–27. doi:10.1016/j.cogdev.2014.10.002. Bower, G. H., Thompson-Schill, S., & Tulving, E. (1994). Reducing retroactive interference: An interference analysis. Journal of Experimental Psychology: Learning, Memory, and Cognition, 20(1), 51–66. doi:10.1037/0278-7393.20.1.51. Chun, M. M., & Jiang, Y. (1998). Contextual cueing: Implicit learning and memory of visual context guides spatial attention. Cognitive Psychology, 36(1), 28–71. doi:10.1006/cogp.1998.0681. Darby, K. P., Burling, J. M., & Yoshida, H. (2014). The role of search speed in the contextual cueing of children’s attention. Cognitive Development, 29, 17–29. doi:10.1016/j.cogdev.2013.10.001. Darby, K. P., & Sloutsky, V. M. (2015). The cost of learning: Interference effects in memory development. Journal of Experimental Psychology: General, 144(2), 410–431. doi:10.1037/xge0000051. Desimone, R. (1998). Visual attention mediated by biased competition in extrastriate visual cortex. Philosophical Transactions of the Royal Society B: Biological Sciences, 353(1373), 1245–1255. doi:10.1098/rstb.1998.0280. Desimone, R., & Duncan, J. (1995). Neural mechanisms of selective visual attention. Annual Review of Neuroscience, 18(1), 193–222. doi:10.1146/annurev.ne.18.030195.001205. Dixon, M. L., Zelazo, P. D., & Rosa, E. D. (2010). Evidence for intact memory-guided attention in schoolaged children. Developmental Science, 13(1), 161–169. doi:10.1111/j.1467-7687.2009.00875.x. Downing, P. E. (2000). Interactions between visual working memory and selective attention. Psychological Science, 11(6), 467–473. doi:10.1111/1467-9280.00290. Ellis, N. C. (2006). Selective attention and transfer phenomena in l2 acquisition: Contingency, cue competition, salience, interference, overshadowing, blocking, and perceptual learning. Applied Linguistics, 27(2), 164–194. doi:10.1093/applin/aml015.

22

J. M. Burling, H. Yoshida / Cognitive Science (2016)

Fernald, A., Thorpe, K., & Marchman, V. A. (2010). Blue car, red car: Developing efficiency in online interpretation of adjective-noun phrases. Cognitive Psychology, 60(3), 190–217. doi:10.1016/ j.cogpsych.2009.12.002. Fisher, A. V., Thiessen, E., Godwin, K., Kloos, H., & Dickerson, J. (2013). Assessing selective sustained attention in 3-to 5-year-old children: Evidence from a new paradigm. Journal of Experimental Child Psychology, 114(2), 275–294. doi:10.1016/j.jecp.2012.07.006. Gasser, M., & Smith, L. B. (1998). Learning nouns and adjectives: A connectionist account. Language and Cognitive Processes, 13(2–3), 269–306. doi:10.1080/016909698386537. Gelman, A., Hill, J., & Yajima, M. (2012). Why we (usually) don’t have to worry about multiple comparisons. Journal of Research on Educational Effectiveness, 5(2), 189–211. doi:10.1080/ 19345747.2011.618213. Gibson, E. J., & Rader, N. (1979). Attention: The perceiver as performer. In G. A. Hale & M. Lewis (Eds.), Attention and cognitive development (pp. 1–21). New York: Plenum. doi: 10.1007/978-1-4613-2985-5 Gogate, L. J., & Hollich, G. (2010). Invariance detection within an interactive system: A perceptual gateway to language development. Psychological Review, 117(2), 496–516. doi:10.1037/a0019049. Griffiths, T., Sobel, D. M., Tenenbaum, J., & Gopnik, A. (2011). Bayes and blickets: Effects of knowledge on causal induction in children and adults. Cognitive Science, 35, 1407–1455. doi:10.1111/j.15516709.2011.01203.x. Hogarth, L., Dickinson, A., Austin, A., Brown, C., & Duka, T. (2008). Attention and expectation in human predictive learning: The role of uncertainty. The Quarterly Journal of Experimental Psychology, 61(11), 1658–1668. doi:10.1080/17470210701643439. Howe, M. L. (1995). Interference effects in young children’s long-term retention. Developmental Psychology, 31(4), 579. doi:10.1037//0012-1649.31.4.579. Johnson, J. S., Spencer, J. P., & Sch€oner, G. (2009). A layered neural architecture for the consolidation, maintenance, and updating of representations in visual working memory. Brain Research, 1299, 17–32. doi:10.1016/j.brainres.2009.07.008. Juslin, P., Wennerholm, P., & Winman, A. (2001). High level reasoning and base-rate use: Do we need cue competition to explain the inverse base-rate effect? Journal of Experimental Psychology: Learning, Memory and Cognition, 27, 849–871. doi:10.1037//0278-7393.27.3.849. Kamin, L. J. (1968). “Attention-like” processes in classical conditioning. In M. R. Jones (Ed.), Miami symposium on the prediction of behavior, 1967: Aversive stimulation (pp. 9–31). Coral Gables, FL: University of Miami Press. Kruschke, J. K. (1996). Base rates in category learning. Journal of Experimental Psychology: Learning, Memory and Cognition, 22, 3–26. doi:10.1037//0278-7393.22.1.3. Kruschke, J. K. (2003). Attention in learning. Current Directions in Psychological Science, 12, 171–175. doi: 10.1111/1467-8721.01254 Kruschke, J. K. (2009). Highlighting: A canonical experiment. In B. H. Ross (Ed.), The psychology of learning and motivation: Advances in research and theory (Vol. 51, pp. 153–185). San Diego, CA: Academic Press. doi: 10.1016/S0079-7421(09)51005-5 Kruschke, J. K. (2011). Models of attentional learning. In E. M. Pothos & A. J. Wills (Eds.), Formal approaches in categorization (pp. 120–152). Cambridge, UK: Cambridge University Press. doi: 10.1017/ CBO9780511921322.006 Kruschke, J. K., Kappenman, E. S., & Hetrick, W. P. (2005). Eye gaze and individual differences consistent with learned attention in associative blocking and highlighting. Journal of Experimental Psychology: Learning, Memory and Cognition, 31(5), 830–845. doi:10.1037/0278-7393.31.5.830. Kruschke, J. K., & Liddell, T. M. (2015). The Bayesian new statistics: Two historical trends converge. SSRN Journal. doi:10.2139/ssrn.2606016. Lamberts, K., & Kent, C. (2007). No evidence for rule-based processing in the inverse base-rate effect. Memory & Cognition, 35(8), 2097–2105. doi:10.3758/bf03192941.

J. M. Burling, H. Yoshida / Cognitive Science (2016)

23

Mackintosh, N. (1975). A theory of attention: Variations in the associability of stimuli with reinforcement. Psychological Review, 82(4), 276–298. Medin, D. L., & Edelson, S. M. (1988). Problem structure and the use of base-rate information from experience. Journal of Experimental Psychology: General, 117(1), 68–85. doi:10.1037/00963445.117.1.68. Merriman, W. E., Bowman, L. L., & MacWhinney, B. (1989). The mutual exclusivity bias in children’s word learning. Monographs of the Society for Research in Child Development, 54(3–4), 1–132. doi: 10.2307/ 1166130 Mintz, T. H., & Gleitman, L. R. (2002). Adjectives really do modify nouns: The incremental and restricted nature of early adjective acquisition. Cognition, 84(3), 267–293. doi:10.1016/s0010-0277(02)00047-1. Pelley, M. E. L., Beesley, T., & Griffiths, O. (2011). Overt attention and predictiveness in human contingency learning. Journal of Experimental Psychology: Animal Behavior Processes, 37(2), 220–229. doi:10.1037/a0021384. Perone, S., & Spencer, J. P. (2013). Autonomous visual exploration creates developmental change in familiarity and novelty seeking behaviors. Frontiers in Psychology, 4(648). doi:10.3389/fpsyg.2013.00648. Ramscar, M., Yarlett, D., Dye, M., Denny, K., & Thorpe, K. (2010). The effects of feature-label-order and their implications for symbolic learning. Cognitive Science, 34, 909–957. doi:10.1111/j.15516709.2009.01092.x. Regier, T. (2005). The emergence of words: Attentional learning in form and meaning. Cognitive Science, 29 (6), 819–865. doi:10.1207/s15516709cog0000_31. Ruff, H. A., & Rothbart, M. K. (2001). Attention in early development: Themes and variations. New York: Oxford University Press. doi:10.1093/acprof:oso/9780195136326.001.0001. Schneider, W. (2015). Memory development from early childhood through emerging adulthood. Cham, Switzerland: Springer International Publishing. doi:10.1007/978-3-319-09611-7. Sch€ oner, G., & Thelen, E. (2006). Using dynamic field theory to rethink infant habituation. Psychological Review, 113(2), 273–299. doi:10.1037/0033-295X.113.2.273. Shanks, D. R. (1992). Connectionist accounts of the inverse base-rate effect in categorization. Connection Science, 4(1), 3–18. doi:10.1080/09540099208946600. Shanks, D. R. (1995). Is human learning rational? The Quarterly Journal of Experimental Psychology Section A, 48(2), 257–279. doi:10.1080/14640749508401390. Sloutsky, V. M., Lo, Y.-F., & Fisher, A. V. (2001). How much does a shared name make things similar? Linguistic labels, similarity, and the development of inductive inference. Child Development, 72(6), 1695– 1709. doi:10.1111/1467-8624.00373. Smith, L. B. (2001). How domain-general processes may create domain-specific biases. In M. Bowerman & S. Levinson (Eds.), Language acquisition and conceptual development (pp. 101–131). Cambridge, UK: Cambridge University Press (CUP). doi: 10.1017/cbo9780511620669.006 Smith, L. B., Colunga, E., & Yoshida, H. (2010). Knowledge as process: Contextually cued attention and early word learning. Cognitive Science, 34(7), 1287–1314. doi:10.1111/j.1551-6709.2010.01130.x. Sobel, D. M., & Munro, S. (2009). Domain generality and specificity in children’s causal inference about ambiguous data. Developmental Psychology, 45(2), 511–524. Spivey, M. J., Grosjean, M., & Knoblich, G. (2005). Continuous attraction toward phonological competitors. Proceedings of the National Academy of Sciences of the United States of America, 102(29), 10393–10398. doi:10.1073/pnas.0503903102. Swingley, D., & Aslin, R. N. (2000). Spoken word recognition and lexical representation in very young children. Cognition, 76(2), 147–166. doi:10.1016/s0010-0277(00)00081-0. Thelen, E., & Smith, L. B. (1996). A dynamic systems approach to the development of cognition and action. Cambridge, MA: MIT press. Waxman, S. R., & Klibanoff, R. S. (2000). The role of comparison in the extension of novel adjectives. Developmental Psychology, 36(5), 571–581. doi:10.1037/0012-1649.36.5.571.

24

J. M. Burling, H. Yoshida / Cognitive Science (2016)

Wills, A. J., Lavric, A., Hemmings, Y., & Surrey, E. (2014). Attention, predictive learning, and the inverse base-rate effect: Evidence from event-related potentials. NeuroImage, 87, 61–71. doi:10.1016/ j.neuroimage.2013.10.060. Winman, A., Wennerholm, P., Juslin, P., & Shanks, D. R. (2005). Evidence for rule-based processes in the inverse base-rate effect. The Quarterly Journal of Experimental Psychology Section A, 58(5), 789–815. doi:10.1080/02724980443000331. Wood, M. J., & Blair, M. R. (2010). Informed inferences of unknown feature values in categorization. Memory & Cognition, 39(4), 666–674. doi:10.3758/s13421-010-0044-1. Yoshida, H., & Burling, J. M. (2012). Highlighting: A mechanism relevant for word learning. Frontiers in Psychology, 3(262), 1–12. doi:10.3389/fpsyg.2012.00262. Yoshida, H., & Hanania, R. (2013). If it’s red, it’s not Vap: How competition among words may benefit early word learning. First Language, 33, 3–19. doi:10.1177/0142723711422632. Yurovsky, D., Yu, C., & Smith, L. B. (2013). Competitive processes in cross-situational word learning. Cognitive Science, 37(5), 891–921. doi:10.1111/cogs.12035.

Supporting Information Additional Supporting Information may be found online in the supporting information tab for this article: Data S1: Experiment 1. Data S2: Experiment 2. Data S3: Model code.

Cognitive Science Special Issue: 2013 Rumelhart Prize Honoring Linda B. Smith

Highlighting in Early Childhood: Learning Biases through Attentional Shifting: Supplementary Materials Joseph M. Burling University of California, Los Angeles

Hanako Yoshida University of Houston

Contents Experiment 1 Supplement Stimuli & Materials . . . . Analysis . . . . . . . . . . . Model formulation . . Model priors . . . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

2 2 3 3 6

Experiment 2 Supplement Stimuli & Materials . . . . Analysis . . . . . . . . . . . Model formulation . . Model priors . . . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

8 8 9 9 10

Model code Experiment 1 STAN code . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Experiment 2 STAN code . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

12 12 13

Corresponding authors: [email protected], [email protected]

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

2

Experiment 1 Supplement The following is a set of supplementary materials specific to Experiment 1 from the main text. See the accompanying text in the methods section from the main article that references each Table and Figure, as well as the Results section from the main article which describes results taken from the analysis and presented here in the supplement. Cue-referent assignments are displayed in Table S4, task instructions in Table S5, conditional results in Table S6, average training length in Table S7, model fitted correlation/standard deviation matrix in Table S8 and all image sets shown in Figure S4.

Stimuli & Materials This section contains a set of additional tables and figures that show the full extent of the stimuli used in the image-based task, as outlined in Experiment 1. All stimuli were designed in Adobe Flash CS5 Software. Instructions used for the highlighting task in Experiment 1 are also displayed, which were administered verbally. Symbol

Group 1

Group 2

Group 3

Cue: A Cue: B Cue: C Referent: X Referent: Y

cup glasses strawberry duck cow

apple spoon hat elephant monkey

chair shoe cake cat dog

Table S4 A complete list of illustrations used for the image-based task design. Each child was randomly assigned one of three groups, while adults were given all three groups in a random order.

Phase

Image Version Instruction

Familiarization “First let’s try dragging the triangles down to the same box. After you put them in there, press the red button at the top. Make sure you put them inside the box.” Training “Now it’s time to learn where some other pictures go. Drag the top pictures down to one of the boxes. Press the red button when you are done.” Testing “Now you will see some new pictures and also some old ones. Try your best to put them into the right box. You can only choose one box to put them in.” Table S5 List of instructions used for the image-based version of the highlighting task.

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

3

I magegr oup1:

I magegr oup2:

I magegr oup3:

Figure S4 . Full sample of illustrated images used in the image-based version of the highlighting task. Refer to Table S4 for the corresponding cue symbols. Analysis The fully Bayesian hierarchical logistic regression model was fitted using Stan version 2.5 (Stan Development Team, 2014b) from within the R programming language (R Core Team, 2015; Stan Development Team, 2014a). Twelve chains were run in parallel. Each chain allowed for a warm-up period of 10,000 samples in order to tune and adapt the Hamiltonian sampling algorithm. Each chain collected 210,000 samples, and were thinned every 100 ˆ < 1.002. samples to reduce autocorrelation. All chains had a high effective sample size and R Even though convergence criteria were met for all chains, the two chains with the lowest average effective sample size were automatically dropped and the rest of the samples from each chain were merged. This ultimately led to a total of 20,000 posterior samples used for the analysis (2,000 from 10 chains). Model formulation. The outcome variable yi indicates the probability of success and is an estimate of the proportion of correct trials answered out of the total number of trials administered. This proportion can be estimated given indicators for the type of test cue and any interactions with the test cue, such as how the set of stimuli and the person’s age can interact with the probability of success for a particular type of testing item. The coefficients estimated for each testing cue are allowed to vary by subject. Fixed effects such as the main effect of stimulus set and the main effect of age are also estimated but do not vary by subject. Let j be the index for a specific person out of 33 participants, k be the indicator of a

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

4

−650

−550

−450

Trace of lp__

50000

100000 150000 Iterations (without warmup)

200000

Figure S5 . Traceplot of the samples over time for each chain. The figure displays the degree of mixing for the log probability of the model in Experiment 1. particular testing item out of the five possible test items (the test cue A.B is used as the baseline and folded into the intercept), and l be the indicator for one of the stimulus sets out of two possible sets of images (stimulus set 2 is used as the baseline and also folded into the intercept). The regression equation for the probability of success is as follows for i = 1, . . . , n, where n is the total number of observations in the data: Pr(yi = 1) = logit−1 (ˆ yi )

(1)

yˆ = α0,j + αk,j ∗ cuek + βl ∗ stiml,j + β3 ∗ agej The function logit−1 is the inverse-logit/sigmoid function which constrains values in log-odds scale to be in the range of 0 to 1. The varying intercept α0,j is defined by the following equation, which includes any interactions with the intercept: α0,j = γ0,0 + γ0,l ∗ stiml,j + γ0,3 ∗ agej

(2)

The additive effect for one of the other testing cues k is defined by the following (which equals zero if the effect of interest is only for cue A.B and stimulus set 2, the intercept): αk,j = γk,0 + γk,l ∗ stiml,j + γk,3 ∗ agej

(3)

The equation presented above may be reformulated using matrix notation for the parameters and data, which directly corresponds to the model code listed below and better illustrates the full extent of the model and estimated parameters. Let Y stand for the vector of successes out of N , the vector of the number of trials for each combination of stimulus set, testing cue type, and participant. The notation Yˆ will denote the predicted values in the unconstrained log-odds scale. Thus, the likelihood for the model is . . . Y ∼ Binomial(logit−1 (Yˆ ), N ) where . . . Yˆ = XB + ZA

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

5

The data matrix X is a 198 × 4 design/indicator matrix for the fixed effects which do not vary by subject. Let i be the index for one of the 198 rows of data (6 cue frequencies for each of the 33 participants). The stimulus set indicator variables made up of ones and zeros and are CW.DK for the cow/duck set, and CT.DG for the cat/dog set. 1 stimCW.DK 1  1 stimCW.DK 2 X= ..  .. . . CW.DK 1 stimi 

stimCT.DG age1 1 stimCT.DG age2   2 .. ..   . .  stimCT.DG agei i 

and the 4 × 1 vector of regression coefficients corresponding to these predictors is . . . 

βInt



 β   B =  CW.DK   βCT.DG  βage

The indicator matrix for the varying subject level coefficients is denoted as Z, a 198 × 198 or i × (j ∗ 6) matrix of ones and zeros (1 intercept indicator and 5 testing cue indicators for each subject), and the 1 × 198 vector A corresponds to the testing cue parameters that vary by subject accounting for the repeated measures for each type of cue. 

Zi,j,k

Int1,1 cue1,1,1 . . . cue1,1,k . . . Int1,j  Int2,1 cue2,1,1 . . . cue2,1,k . . . Int2,j = .. .. .. .. ..  .. . . . . .  . Inti,1 cuei,1,1 . . . cuei,1,k . . . Inti,j 



cue1,j,1 . . . cue1,j,k cue2,j,1 . . . cue2,j,k   .. ..  ..  . . .  cuei,j,1 . . . cuei,j,k 

α1Int = γ0,0 + γ0,1 ∗ stimCW.DK + γ0,2 ∗ stimCT.DG + γ0,3 ∗ age1   Cue  α1,1 = γ1,0 + γ1,1 ∗ stimCW.DK + γ1,2 ∗ stimCT.DG + γ1,3 ∗ age1    ..     .

 Cue  CW.DK + γ CT.DG + γ α  k,2 ∗ stim k,3 ∗ age1   1,k = γk,0 + γk,1 ∗ stim   ..   .   A =  Int   αj = γ0,0 + γ0,1 ∗ stimCW.DK + γ0,2 ∗ stimCT.DG + γ0,3 ∗ agej   Cue   αj,1 = γ1,0 + γ1,1 ∗ stimCW.DK + γ1,2 ∗ stimCT.DG + γ1,3 ∗ agej      ..   .    Cue  CW.DK + γ CT.DG + γ αj,k = γk,0 + γk,1 ∗ stim k,2 ∗ stim k,3 ∗ agej 

The vector of varying subject-level coefficients A was constructed by converting the 6 × 33 matrix form of A into a single column vector. Where Ak,j is determined by the matrix of interaction parameters Gk,l , which consists of regression coefficients that influence each cue type. G is multiplied by the group level matrix Ul,j , which denotes the indicators for stimulus type and values for age for each subject. A = GU

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

6

Where the subject-level intercepts and slopes that interact with each type of test cue are contained in the 6 × 4 matrix G . . . 



γ0,0 γ0,1 γ0,2 γ0,3   γ1,0 γ1,1 γ1,2 γ1,3   G= . .. .. ..   . . .   .. γk,0 γk,1 γk,2 γk,3 . . . and multiplied by the 4 × 33 subject-level design matrix U . 

1

1

stimCW.DK  1 U =  stimCT.DG 1 age1

stimCW.DK 2 stimCT.DG 2 age2



... 1  . . . stimCW.DK  j   . . . stimCT.DG j ... agej

Model priors. Normal distribution priors were used for all regression coefficients with a mean of zero. All standard deviations were obtained by assuming a weakly informative folded-t prior with ν = 5, µ = 0, and σ = 3, which are based on values in the unconstrained log-odds scale. A non-informative prior was used to model the participant-level correlation matrix, with the single parameter set to 1 and is uniform across all possible positive definite correlation matrices. The priors for all estimated parameters start with modeling the standard deviations for the regression coefficients. Each standard deviation is distributed as a folded-t distribution with the β (4) coefficients having standard deviations σ, the α (6) coefficients having standard deviations τ , and the γ (24) coefficients having standard deviations ζ. σ, τ, β ∼ Folded-t(ν = 5, µ = 0, σ = 3) The regression coefficients can then be estimated using the standard deviations given the following: β ∼ N (0, σ) α ∼ N (0, τ ) γ ∼ N (0, ζ) The 6 × 6 correlation matrix which models the dependencies between types of testing cues due to the same person generating responses from each cue was modeled using a uniform prior over all possible positive definite correlation matrices. ρ ∼ LKJcorr(ν = 1) Along with the standard deviation priors τ , the covariance matrix used to model the varying subject-level coefficients can be computed given the following: Σ = Diag(τ )ρDiag(τ ) And thus the varying coefficients for each subject can be modeled under the assumption of a multivariate normal distribution with mean vector GUj for subject j (corresponding to each of the α’s for subject j) and covariance matrix Σ. Aj ∼ N (GUj , Σ)

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

7

Condition

n

Mean Age

A

B

C

A.B

A.C

B.C

B>A A>B A=B

11 14 9

57.8 54.5 54.6

0.46 0.88 0.80

0.76 0.46 0.76

0.70 0.80 0.81

0.76 0.74 0.83

0.65 0.68 0.75

0.46 0.62 0.67

Table S6 Posterior means for each test item separated by the three conditions used in Figure 1c from Experiment 1 in the main text.

Phase

Avg. n A.B → X

Avg. n A.C → Y

Early

5.15

0

Mixed

3.18

1.06

Late

3.64

10.91

Table S7 Average number of training trials completed across all children for each phase and for each outcome from Experiment 1.

Int A.C B C A B.C

Int

A.C

B

C

A

B.C

1.21 -0.05 -0.04 0.03 -0.17 0.04

1.11 -0.06 0.07 0.12 0.44

1.49 -0.30 -0.47 -0.36

2.04 0.28 0.48

2.12 0.26

1.65

Table S8 Correlation matrix for the coefficients that vary by subject. The standard deviations for each parameter are shown in bold on the diagonal (logit scale).

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

8

Experiment 2 Supplement The following contains additional materials from the main text regarding Experiment 2, which was conducted with adults. See main text for further discussion of methods and analysis. The text-based stimuli layout is shown in Figure S6, list of words used for the text task in Table S9, instructions used for the text task in Table S10, text-based results in Table S11, and image-based results from in Table S12. Stimuli & Materials This section displays additional tables and figures regarding the names of stimuli used for the text-based version of the highlighting task, the layout of the experimental screen, and the instructions that were shown to the adult participants for the text-based task.

A1 B1

X1

X2

Y1

Y2

Figure S6 . Example of a selection of cues and referents for the text-based version of the task. The symbol for each type of cue is marked with a letter next to the text (not shown in the actual task). Paired cues are symptoms (red box) while referents are novel diseases (blue boxes). Only adults were administered the text-based task.

List of symptoms

List of diseases

earaches, yellow eyes, rash, dizziness, sore muscles, nausea, hair loss, coughing, fever

Burlosis, Namitis, Terrigitis, Coralgia, Gouphosis, Midosis, Althrax

Table S9 A complete list of words used in the text-based task design. Symptoms and diseases were assigned at random during task onset.

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

Phase

9

Text Version Instruction

Training “In this experiment you will see some common symptoms on the top of the computer screen and fictional diseases on the bottom of the screen. Your job is to learn which symptoms indicate which disease. When the symptoms are presented, you can make a guess by touching one of the diseases. You can press any of the diseases.” Testing “Now you will diagnose diseases based on previous symptoms, some combinations may be new. You will choose the appropriate disease based on the given symptom/s. Please make an informed choice. You will touch the disease on the screen to make your choice.” Table S10 List of instructions used for the text-based version of the highlighting task. Analysis The model implementation in terms of program used, type of model fitted, the number of samples collected, and convergence criteria were the same for Experiment 2 as described in Experiment 1. The major differences are the structure of the hierarchical regression equation and priors used for the adult results. Since adults performed both the image-based and text-based version of the highlighting task, each participant has repeated measurements for each combination of task type and cue type. We analyzed the accuracy of cues that were common to both types of highlighting tasks for comparison purposes, with task type (text vs. image), cue type (A.B, A.C, . . . , B.C), and the their interactions set as random factors that are allowed to vary by subject.

−1300

−1100

−900

Trace of lp__

0

50000

100000 150000 Iterations (without warmup)

200000

Figure S7 . Traceplot of the samples over time for each chain. The figure displays the degree of mixing for the log probability of the model in Experiment 2. Model formulation. As in the analysis for Experiment 1, let Y be a vector of the number of successes out of N trials. The probability of success is a binomial distribution with the likelihood expressed as the following: Y ∼ Binomial(logit−1 (Yˆ ), N )

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

10

Where the predicted values in the logit scale, Yˆ , are estimated by the equation:

Yˆi = Xi Aj

And where X is the n × k design matrix of indicators based on the main and interaction effects, task + cue + task ∗ cue, and A is the k × j matrix of coefficients which vary by person, with n = 396, k = 12, j = 33.





α1,1 α2,1 . . . αk,1   α1,2 α2,2 . . . αk,2  0  A = . .. ..  ..  . . .   .. α1,j α2,j . . . αk,j

Model priors. The subject-level regression coefficients from A are determined from the following set of k hyper-priors:

β ∼ N (µ, σ) σ ∼ Half-cauchy(0, 1) µ ∼ Uniform(−∞, ∞) + N (0, 1) ∗ σ

And the subject-level coefficients from A are distributed as multivariate-normal with a mean vector of β hyper-parameters and covariance matrix Σ.

Aj ∼ N (β1,...,k , Σ)

As in Experiment 1, the covariance matrix can be constructed given the standard deviations for each β and the correlation matrix ρ.

ρ ∼ LKJcorr(ν = 2) Σ = Diag(σ)ρDiag(σ)

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

Test item

X

Xo

Y

Yo

AB AC B C A BC

0.94 0.07 0.92 0.06 0.61 0.35

0.01 0.00 0.03 0.00 0.08 0.02

0.03 0.91 0.02 0.93 0.21 0.64

0.02 0.01 0.02 0.01 0.09 0.00

ABo ACo BBo CCo AAo BCo ABC ABo C ABCo ABo Co

0.17 0.09 0.33 0.03 0.35 0.29 0.37 0.05 0.46 0.26

0.69 0.05 0.65 0.03 0.29 0.03 0.03 0.21 0.02 0.07

0.10 0.11 0.00 0.56 0.17 0.02 0.58 0.72 0.04 0.60

0.04 0.76 0.02 0.38 0.18 0.66 0.02 0.02 0.48 0.07

11

Table S11 Adult accuracy given the text-based version of the highlighting task in Experiment 2. Proportions are displayed for each possible outcome given different cue combinations. The subscript o denotes that the cue belongs to the other set than what is being tested.

Test item AB AC B C A BC

Accuracy p(correct | cue) 0.99 0.96 0.97 0.99 0.64 0.68

Table S12 Adult accuracy for the image-based version of the highlighting task. Scores are in proportion correct (based on expected response) given each type of test cue administered.

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

Model code Experiment 1 STAN code The script used for Experiment 1 in Stan formatted code: data { int < lower =1 > nObs ; // n observations (198) int < lower =1 > nSubs ; // n participants (33) int < lower =1 > nAlpha ; // n cue type coef . (6) int < lower =1 > nBeta ; // n unmodeled coef . (4) int < lower =1 > nGamma ; // n interaction coef . (4) matrix [ nObs , nBeta ] X; // obs lvl design matrix matrix [ nGamma , nSubs ] U; // group lvl design matrix matrix [ nObs , nSubs * nAlpha ] Z ; // random design matrix int Yi [ nObs ]; // vector of correct freq . int Yn [ nObs ]; // vector of n trials } t r a n s f o r m e d data { // none } parameters { vector < lower =0 >[ nBeta ] sigma_fixed ; // priors for unmodeled std . dev . vector < lower =0 >[ nAlpha ] tau_cue ; // priors for cue type std . dev . vector < lower =0 >[ nGamma ] zeta_intxn ; // priors for interaction std . dev . vector [ nBeta ] Beta ; // priors for fixed coef . matrix [ nAlpha , nGamma ] Gamma ; // priors for grp lvl intxn . matrix [ nAlpha , nSubs ] Alpha ; // priors for varying grp lvl coef . c h o l e s k y _ f a c t o r _ c o r r [ nAlpha ] Rho_chol ; // prior for cue type corr . matrix } transformed parameters { c h o l e s k y _ f a c t o r _ c o v [ nAlpha ] Sigma_chol ; // cholesky factored covariance matrix corr_matrix [ nAlpha ] Rho ; // correlation matrix // transform cholesky corr . matrix Rho <- tcrossprod ( Rho_chol ); Sigma_chol <- c h o l e s k y _ d e c o m p o s e ( diag_matrix ( tau_cue ) * Rho * ); } model { vector [ nObs ] y_hat ; matrix [ nAlpha , nSubs ] alpha_hat ;

to cholesky cov . matrix

diag_matrix ( tau_cue )

// predicted values on logit scale // temp . matrix

// folded - t priors with df =5 , mean =0 , sd =3 sigma_fixed ~ student_t (5.0 , 0.0 , 3.0); zeta_intxn ~ student_t (5.0 , 0.0 , 3.0); tau_cue ~ student_t (5.0 , 0.0 , 3.0); // normal priors for regression coefficients , mean =0 Beta ~ normal (0 , sigma_fixed ); for ( k in 1: nAlpha ) { for ( l in 1: nGamma ) { Gamma [k , l ] ~ normal (0 , zeta_intxn [ l ]); } } // uniform prior across pos . def . corr . matrices Rho_chol ~ l k j _ c o r r _ c h o l e s k y (1.0); // restructure : alpha matrix , varying subj . coef . alpha_hat <- Gamma * U ; // multivariate normal prior for subject level coefficients

12

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

13

for ( j in 1: nSubs ) col ( Alpha , j ) ~ m u l t i _ n o r m a l _ c h o l e s k y ( col ( alpha_hat , j ) , Sigma_chol ); // compute predicted values , logit scale y_hat <- X * Beta + Z * to_vector ( Alpha ); // binomial likelihood for ( i in 1: nObs ) Yi [ i ] ~ binomial _logit ( Yn [ i ] , y_hat [ i ]); } generated quantities { vector [ nAlpha ] Alpha_mu ; // means for cue type coef . for ( k in 1: nAlpha ) Alpha_mu [ k ] <- mean ( Alpha [ k ]); }

Experiment 2 STAN code The script used for Experiment 2 in Stan formatted code: data { int < lower =1 > nObs ; // n observations (396) int < lower =1 > nSubs ; // n participants (33) int < lower =1 > nBeta ; // n coef . (12) matrix [ nObs , nBeta ] X ; // design matrix int Yi [ nObs ]; // vector of sucesses int Yn [ nObs ]; // vector of n trials int sid [ nObs ]; // subject index } parameters { vector < lower =0 >[ nBeta ] beta_sigma ; // hyper - prior for std . dev . of beta vector [ nBeta ] beta_mu_raw ; // hyper - prior for unscaled beta means real beta_adj ; // hyper - prior for beta adjust , uniform vector [ nBeta ] beta_sub [ nSubs ]; // prior for sub . lvl beta coef . c h o l e s k y _ f a c t o r _ c o r r [ nBeta ] rho_chol ; // prior for beta coef . correlations } transformed parameters { c h o l e s k y _ f a c t o r _ c o v [ nBeta ] sigma_chol ; // cholesky factored covariance matrix corr_matrix [ nBeta ] rho ; // regular correlation matrix vector [ nBeta ] beta_mu ; // hyper - prior for means of beta coef . // to avoid stuck chain segments beta_mu <- beta_adj + beta_mu_raw .* beta_sigma ; // transform cholesky corr . matrix to cholesky cov . matrix rho <- tcrossprod ( rho_chol ); sigma_chol <- c h o l e s k y _ d e c o m p o s e ( diag_matrix ( beta_sigma ) * rho * diag_matrix ( beta_sigma )); } model { beta_mu_raw ~ normal (0 , 1); // standard normal prior beta_sigma ~ cauchy (0 , 1); // half - cauchy prior rho_chol ~ l k j _ c o r r _ c h o l e s k y (2); // diag . prior corr . matrix // multivariate normal prior for subject level coefficients for ( j in 1: nSubs ) beta_sub [ j ] ~ m u l t i _ n o r m a l _ c h o l e s k y ( beta_mu , sigma_chol ); // binomial likelihood for ( i in 1: nObs ) Yi [ i ] ~ binomial _logit ( Yn [ i ] , X [ i ] * beta_sub [ sid [ i ]]); }

HIGHLIGHTING IN EARLY CHILDHOOD SUPPLEMENT

14

generated quantities { vector [ nObs ] log_lik ; // log - likelihood for each observation for WAIC for ( i in 1: nObs ) { log_lik [ i ] <- b i n o m i a l _ l o g i t _ l o g ( Yi [ i ] , Yn [ i ] , X [ i ] * beta_sub [ sid [ i ]] ); } }

References R Core Team. (2015). R: a language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. Stan Development Team. (2014a). Rstan: The R interface to Stan, version 2.5.0. Stan Development Team. (2014b). Stan: A C++ library for probability and sampling, version 2.5.0.

Early Childhood Teacher Credential (ECTC) - CT Early Childhood ...

Theoretical perspectives in early childhood education.pdf ...

Developmentally Appropriate Practice in Early Childhood Programs ...

Fostering Emotional Development in Early Childhood Education

Early childhood education.pdf