Phonological categories in infant-directed speech: Supplementary Materials Alejandrina Cristia∗ August 18, 2011

Abstract I report on supplementary analyses to the manuscript entitled “Phonological categories in infant-directed speech”, which documented that tenseness contrasts are not enhanced in infant-directed speech (IDS) as compared to adult-directed speech (ADS). That tenseness is not enhanced (but on the contrary sometimes deteriorated) is also true for a non-parametric measure of separation and for Mahalanobis distances (Section 1). In addition, similar results ensue when each acoustic dimension is analyzed separately (Section 2). Lack of enhancement in our sample appears to be partially due to an increase in within-category variance, which is documented in Section 3.

1

Additional distance measures

The measures reported in the main paper are basic in that the raw distance connects current findings with those documented in previous research on IDS, and D(a) is more elaborate, and possibly a better predictor of adults performance [2]. However, it is possible that means and variances are not the best way to capture the present acoustic cue distributions, which need not be normal. Therefore, I recalculated distances using a non-parametric version of D(a) is used, namely the difference between medians divided by the average of the interquartile range of each category. In addition, most previous work was based on 1 or 2 dimensions, whereas here we used all of the dimensions that have been shown to be perceptually relevant, although they are all on different scales and they are not all independent. In the main paper, we got over the scaling problem by z-scoring, but covariance across dimensions remains a potential issue. Results for these two distances are shown in Figure 1. In no case is the red box (IDS distances) higher than the blue box (ADS distances). In contrast, significantly higher distances for ADS than IDS are found in /i-I/ in both age groups, ˜ for the non-parametric distance. with a trend in the same direction for /e-E/ and /æ-æ/ ∗

This work was supported by funds from NSF 0843959 to Amanda Seidl. Amanda Seidl and Kristine Onishi designed the elicitation material, collected the recordings and supervised the coders. I thank Titia Benders for stimulating comments to the main manuscript, that inspired me to expand the analyses here; and to Amanda for comments on this document. Please email comments to [email protected].

1

6 5



4





3



2



1











0

Non−parametric distance



11mo

4mo

11mo

4mo

11mo

i

4mo

11mo

an

4mo

en

6

e



5 3

4





1

2





0

Mahalanobis distance

● ●

11mo

4mo

e

11mo

4mo

11mo

i

4mo

an

11mo

4mo

en

Figure 1: Non-parametric (top) and Mahalanobis (bottom) distance by contrast, age group, and register (blue=ADS, red=IDS.

2

2

Distances in each acoustic dimension separately

It is usually the case that listeners do not treat all cues equally, but rather one subset of the acoustic dimensions is more important perceptually than another. For example, /sa/ and /Sa/ differ both in the frequency distribution of the frication noise and in the F2 found early in the following vowel, but the former may be more important for adult listeners [3]. The cues used to represent tenseness in the present study were chosen because previous research shows that English listeners use them (see the main paper for citations). Nonetheless, if talkers do weight some cues more heavily than others, it is likely that they will enhance primarily these highly weighted cues [1]. In this case, the distances measured will still reflect the expansion, since expansion is additive (not averaged). However, one can postulate a variant of the hyperspeech hypothesis by which they will enhance the heavily weighted cues and deteriorate cues that have low weighting, in order to bias infants’ attention away from them. In this case, averages would not show enhancement, since distances along some dimensions will decrease and others increase. To deal with this possibility, I recalculated distances over each dimension separately. An additional advantage of this analysis is that there is no need for z-scoring, since the dimensions are not combined. As reported in section 2, results are basically the same. All of the boxplots for these analyses are provided online in a separate pdf. To make this more specific, if this interpretation is correct, one should find compensation, such that the average distance for IDS is higher than that for ADS in one dimension, but the opposite occurs for one or more of the other dimensions. For the raw measure in /e-E/, F1 and F2 at both points in the vowel show higher distances in IDS than ADS in the 11mo group; no change is clear in the 4mo group, or in duration for the 11mos. In terms of the raw measure /i-I/ are further apart in IDS along F1 at both points and F2 at 40% for 11mos; the other age group and the other dimensions do not exhibit large differences between the registers. Therefore, there is no compensation in terms of the raw distances. As for the normalized distance D(a), average distance between /e-E/ for IDS is significantly lower than in IDS for F2 at the 40% measurement point in both age groups, and for duration in the 4mo; other dimensions show no significant changes. For /i-I/, F1 at both points and F2 at 40% show significantly lower distances for IDS than ADS; the other cases do not show any signficant changes. Here as well, there is no evidence of compensation, with changes along different dimensions always patterning in the same direction, which is the same as evidenced by the composite measures in the main paper.

3

Possible increase in variability

To directly assess whether vowels are acoustically more variable in speech addressed to infants, I calculated the average standard deviation along the z-scored identifying acoustic dimensions within each talker, for each of the 8 vowel types elicited (2 tense, 2 lax, 2 nasal, 2 oral). The number of talkers who could be included are listed in Table 3; they all had at least 8 tokens of the vowel in IDS and another 8 or more in ADS. As shown in Figure 2, for most of the vowels, more than half of the talkers had larger within-category variability in IDS than ADS, with the overall 3

average being 70%. The proportion is somewhat higher in Tenseness (78%) than Nasality (62%) ˜ seems to pattern categories, but this difference was driven by /E-˜E/. Thus, in these analyses, /æ-æ/ 1 with the tenseness contrasts. Finally, to ensure that these effects were not due to more tokens, or more diverse types, having been uttered during the IDS than the ADS portion, caregivers who produced the exact same number of tokens in IDS and ADS for a given type were selected, and the variability was calculated for each caregiver, type, and register. One-hundred and four caregiver-words could be thus matched, and greater variability was found in infant- than in adult-directed speech both in binary classifications [66 out of 104 cases had greater variability in IDS, p = .004] and continuous measures [average variability was greater in IDS, t(103) = 2.85, p = .005]. Figure 3 illustrates this with the instantiations of the word “beetle” by 5 different caregivers who spoke this word 3 or more times. Table 1: Number of caregivers that could be included in the calculation of the variability of each vowel. Contrast Tenseness

Nasality

Vowel /e/ /E/ /i/ /I/ /E/ /˜E/ /æ/ ˜ /æ/

4mo 23 19 21 23 17 19 22 24

11mo 15 13 15 12 13 14 15 15

References [1] Kyoung-Ho Kang and Susan G. Guion. Clear speech production of Korean stops: Changing phonetic targets and enhancement strategies. The Journal of The Acoustical Society of America, 124:3909–3917, 2008. [2] Rochelle S. Newman, S. A. Clouse, and J. Burnham. The perceptual consequences of acoustic variability in fricative production within and across talkers. The Journal of the Acoustical Society of America, 109:3697–3709, 2001. [3] Susan Nittrouer. Learning to perceive speech: How fricative perception changes, and how it stays the same. The Journal of the Acoustical Society of America, 112:711–719, 2002.

1

I find this particularly interesting because, perceptually and acoustically, æ undergoes a much larger quality change when nasalized: many non-native listeners (e.g., French) report hearing a nasalized /E/, and this quality change was also evidenced in this corpus; see Supplementary analyses to the English and French comparison in the project website.

4

Figure 2: Proportion of caregivers for whom a given category was more variable in IDS than ADS.

250

ADS beetle

300 ●



350

● ●

450

450

400



F1 (Hz)

350



400

F1 (Hz)

300

250

IDS beetle

3200

2800

2400

3200

F2 (Hz)

2800

2400

F2 (Hz)

Figure 3: Illustration of the increase in acoustic instantiation found in IDS as compared to ADS. Each shape and color represents an individual caregiver, and each point a token spoken by that caregiver in the relevant register.

5

Phonological categories in infant-directed speech ...

Aug 18, 2011 - most previous work was based on 1 or 2 dimensions, whereas here we used all .... to the English and French comparison in the project website.

151KB Sizes 3 Downloads 161 Views

Recommend Documents

SPEECH SOUND CATEGORIES IN LANGUAGE ...
to produce speech errors when their reading rhythm is accelerated. ...... Items within blocks were randomly selected (without replacement) by the program ...... Aslin, R. N., & Pisoni, D. B. (1980) Effects of early linguistic experience on speech.

Phonological Restructuring in Odawa
References. Syncope. Opacity. Innovation . Syncope. Core generalization: delete unstressed vowels. (Bloomfield 1957, Kaye 1973, Piggott 1983). ( . x) → (. x) .... New Syncope . New Grammar. New syncope in the two-sided open syllable. . .V C V CV .V

Categories and Haskell - GitHub
This is often summarized as a side-effect free function. More generally ... The composition g ◦ f is only defined on arrows f and g if the domain of g is equal to the codomain of f. ...... http://files.meetup.com/3866232/foldListProduct.pdf ... Pag

ePermit Beneficial Use Categories
Mar 26, 2013 - Instream Flow-only State of Wyo can apply. LAK. Maintain Natural Lake Level (Phase II Award). MUN_SW. Municipal-- Surface water. NAT.

Revisiting the phonological deficit in dyslexia
successful communication, not all of it is necessarily available to analytical ..... Each task was presented using E-Prime (Psychology Software Tools,. Pittsburgh ...

How is phonological processing related to individual differences in ...
... arithmetic problems with a small problem size and those for which a retrieval strategy is most ... findings indicate that the quality of children's long-term phonological ... addition to functional neuroimaging data, left temporo- parietal white

Revisiting the phonological deficit in dyslexia
to view the relationship between production data and implicit phonological ..... intended that by staging the tasks in order of increasing metalinguistic demands, ...

Derived Categories
D ⊆ D(A) denote the full subcategory corresponding to K. Let q (resp. qB) denote the localization functor K → D (resp. K(B) → D(B)). (1) A right derived functor of F : K → K(B) is a triangulated functor of triangulated categories. RF : D −â

Controlling loudness of speech in signals that contain speech and ...
Nov 17, 2010 - variations in loudness of speech between different programs. 5'457'769 A ..... In an alternative implementation, the loudness esti mator 14 also ... receives an indication of loudness or signal energy for all segments and makes ...

Sequential Effects of Phonological Priming in Visual ...
Phonological Priming in Visual ... Thus, the present experiments address two key issues re- .... RTs higher than 1,500 ms (less than 2% of the data) were re-.

Phonological Skills in Predominantly English-Speaking ...
repertoire of English speakers, even though it is not phone- mic. Thus, these specific examples probably should not be considered examples of cross-linguistic ...

How is phonological processing related to individual differences in ...
How is phonological processing related to individual differences in childrens arithmetic skills.pdf. How is phonological processing related to individual ...

Phonological Studies14_Sano
(3) Vg. e.g. dokuga. 'venom fang'. (4) Ng. e.g. ginga. 'galaxy'. (5) VgVg. e.g. eego-ga. 'English-case particle'. (6) NgVg. e.g. rongo-ga. 'Analects-case particle'.

Controlling loudness of speech in signals that contain speech and ...
Nov 17, 2010 - the implementation described here, the block length for cal. 20. 25. 30. 35 ..... processing circuitry coupled to the input terminal and the memory ...

PRO-CATEGORIES IN HOMOTOPY THEORY Contents ...
structure LKp Pro(S) which can be used as a setup for p-profinite homotopy theory. ...... (Note that the term left cofinal loc. cit. is what we call coinitial here.). ...... Z1(G) is the center of G. Alternatively, one can define Zi(G) as the inverse

Rime and syllabic effects in phonological priming
processing using phonological priming. This paradigm is based .... positron emission tomography data ([15], [16]) ... If a single mechanism is responsible for final ...

Does the phonological deficit in developmental ...
Apr 2, 2007 - Phoneme-based. ○ coat vs goat. ○. Stress-based. ○. ′hot+dog vs hot+′dog ... does /s/ occur in fussy or fuzzy? ○. Stress-based. ○ does “end-stress” occur in ′hot+dog or hot+′dog? ... (7 male, 14 female, mean age 24;

Masked repetition and phonological priming in picture ...
picture naming was facilitated by the prior masked visual ... Due to the hypothetical time course of information flow in this model .... 2 factorial design. Prime–target pairs were coun- terbalanced across the priming conditions across two groups o

Evidence of Coarticulation in a Phonological Feature ...
phone-based transcriptions to judge the performance of PF sys- ... However, speech recognition must ultimately deal in words to be useful and for this reason, ...

Masked Orthographic and Phonological Priming in ...
Sep 25, 1996 - Recognition and Naming: Cross-Task Comparisons. JONATHAN GRAINGER ...... quency, and were nonsignificant for both the phones of another French word was compared .... a system that assigns more weight to the initial.

Time pressure and phonological advance planning in ...
Available online 26 December 2006 ... introduction of a response deadline accelerated latencies, but did not alter the relative magnitude of the ... the Biotechnology and Biological Sciences Research Council .... substantial degree of phonological ad

Towards long-term visual learning of object categories in ... - CiteSeerX
50. 100. 150. 200. 250. 300. 350. 400. Fig. 3. Histogram of hue color component in the image of Fig. 2 .... The use of the negative exponential has the effect that the larger the difference in each of the compared ... As illustration, Figs. 6 and 7 .

Towards long-term visual learning of object categories in ... - CiteSeerX
learning, one-class learning, cognitive, lists of color ranges. 1 Introduction ... Word meanings for seven object classes ..... As illustration, Figs. 6 and 7 show the ...

Paraprofessional-Led Phonological Awareness Training With ...
ranged in age from 5.96 to 7.21 years of age (M = 6.59, SD = 0.31) at the onset of the ... grouped all students in their classes into two groups: average to high reading ability ..... Vocational and transition interventions for adolescents and young