University of Stockholm
Institute of Linguistics
PERILUS XIV
PERILUS mainly contains reports on current experimental work carried out in the Phonetics Laboratory at the University of Stockholm. Copies are available from the Institute of Linguis
tics, University of Stockholm, S-106 91 Stockholm, Sweden.
This issue of PERILUS was edited by aile Engstrand and Catharina Kylander.
Institute of Linguistics University of Stockholm S-10691 Stockholm
Telephone: 08-162 347
(+468 1623 47, international)
Telefax: 08-155389
(+468 1553 89, international) Telex/Teletex: 8105199 Univers
(c) 1991 The authors
ISSN
0282-6690This volume of PER ILUS presents brief summaries of the papers presented at Current Phonetic Research Paradigms: Implications for Speech Motor Control, a symposium organized at Stockholm Uni
v
ers
it
y during August 1 3-16, 1991.In taking this initiative we believe that we responded to a current, rather widely felt need for bringing together proponents of a number of different theoretical viewpoints and for facilitating cross-paradigm interaction. The
International Congress of Phonetic Sciences which took place immediately after the Stockholm event, appeared to provide ample evidence for that need. It brought it very much to the fore, perhaps more than ever before. It showed clearly that our discipline is right now
characterized by a great deal of diversity.
Do phoneticians share a sense of common direction despite the current breadth of their subject matter? Will a common research agenda
naturally
grow outo
fth
ev
ari
eg
at
edspectrum
oftopics and theoretical perspectives now presented
tous? Are the practical applications of phonetics and its theoretical development best served by leaving it alone as it goes through its unsupervised process of "mosaic evolution"?
Should the driving force be whatever research happens to promote its short-term academic survival and prosperity? Or does progress in phonetics presuppose interfering with that process, raising to one's awareness and acknowledging that the unification of the field is an absolute prerequisite for reaching both theoretical and applied goals?
Debating such questions is essential to the future role of phonetics, not only in Sweden but also internationally. They are big, hard, but important questions that must be addressed by all concerned whenever there is a chance to meet.
We are grateful to our sponosr, The Swedish Council for Research in the Social Sciences, SFR, for providing us with the opportunity to do so during the Stockholm symposium.
We also thank all participants for all their contributions to the meeting - and to the long-term cross-paradigm agenda of phonetics.
Olle Engstrand Bjorn Lindblom
i
i
iv
Contents
The phonetics laboratory group ... ix
Current projects and grants ... xi
Previous issues of PER ILUS ... .xiii
Participants in alphabetic order ... xxiii
Does increasing representational complexity lead to more speech variability?
... . . . .. . . .. . . ... . . 1Christian Abry & Tahar Lallouache
Some cross language aspects of co-articulation
...
. . . ..
. . . .. . .. . . ..
. . . ..
.7
Robert McAllister & Olle Engstrand
Coarticulation and reduction in consonants
:comparing isolated words and continuous speech
......... ........
. . . . ... . . ....... 11Edda F arnetani
Trading relations between tongue-body raising and lip rounding in production of the vowel luI
........ . ... ..... ... . ..... . ....... . . . ... ..
...... ..... . ... . . ... 1 7 Joseph S. Perkell, Mario A. Svirsky, Melanie L. Matthies & Michael I. JordanTongue
-jaw interactions in linugal consonants .
. . ..
. . ..
. . . ..
. . ..
....
. . . 21B Kuhnert, C Ledl, P Hoole & H G Tillmann
Discrete and continous modes in speech motor control
. . . 27Anders Lofqvist & Vincent L. Gracco
Paths and trajectories in orofacial motion
. . . .. ..
. . . ....... ..
. . ..
............
. . . 35DJ. Os try, K.G. Munhall, JR. Flanagan & A.S Bregman
Articulatory control in stop consonant clusters
. . . .... . .... . . .... . . ....... .... . . . 39Daniel Recasens, Jordi Fontdevila & Maria Dolors Pallares
Dynamics of intergestural timing ... .47
E. Saltzman, B. Kay, P. Rubin & J. Kinsella-Shaw
Modelling the speaker-listnener interaction in a quantitative model for speech motor control: a framework and some preliminary results
. . . ... 57Rafael Laboissiere, Jean-Luc Schwartz & Gerard Bailly
CONTENTS
Neural network modelling of speech motor control using physiological data
... ... . . . 63Eric Vatikiotis-Bateson, Makoto Hirayama & Mitsuo Kawato
Movement paths: different phonetic contexts and different speaking
styles
... . . . . .. . . ... 69 Celia Scully, Esther Grabe-Georges & Pierre BadinSpeech production. From acoustic tubes to the central representation
. . . 75 Rene Carre & Mohamed MrayatiOn articulatory and acoustic varibilities: implications for speech motor control
............. .... . ........... . . ......... ............ 79Shinji Maeda
Speech perception based on acoustic landmarks: implications for speech
production ................................... ....................... . . . ..... . . B3 Kenneth N. Stevens
An investigation of locus equ
at
io
ns as a source of relational invariance for stop place categorization
. . .... . .... . .... . ........ ........ . . ....... 89Harvey M. Sussman
A
first reporton conso
nant
underarticulation in spontaneous speech in French
........... .................................... . ............ . ....... .93Danielle Duez
Temporal variability and the speed of time's flow
..... .......... 99 Gerald D. LameProsodic segmentation of recorded speech
. . ............. . . . .... ..... . .... 1 01 W.N. CampbellRhythmical -
in
what sense?Some preliminary considerations
........ . . 1 07 Lennart NordFocus and
phonological re
duc
tio
n ... 1 1 3 Linda ShockeyRecovery of "deleted" schwa
...... . .................................. . 1 1 5 Sharon Y. ManuelInvariant auditory patterns in speech processing: an explanation for
normalization
... 1 1 9 Natalie WatersonCONTENTS
vii
Function and limits of the F1 :FO covaration in speech
. . . ...
. ....
. . . ...
. 1 25 H artmut TraunmiUlerPsychoacoustic complementarity and the dynamics of speech perception and production
. ...
.....
....
. ..
..
......
. . . . ..
. . . . ...
. . ...
. ..
. . . ..
. . .....
.. ...
. . ..
. . . . ...
. ..
.....
...
. . 1 31Keith R. Kluender
How the listener can deduce the speaker's intended pronunciation
..
..
. . 137 JohnJ.OhaiaPhonetic covariation as auditory enhancement: the case of the
[+voice]/[-vocie] distinction
.. . ... . ... ..
...
....
... . ....
....
....
......
..
.. . .... . . . ...
1 39 Randy L. Diehl & John KingstonCognitive-auditory constraints on articulatory reduction
. . ..
..
. ...
. . . ...
... 1 45 Klaus J. KohlerWords are produced in order to be perceived: the listener in the
speaker's mind
... . . .. . . ..
... .. . ... . . .....
..
.. ..
....
...
. . ..
. . ...
... ...
1 49 Sieb G. NooteboomAn acoustic and percetpual study of undershoot in clear and citation-form speech
. . . .. . . .. . . 1 53Seung-Jae Moon
Phonetics of baby tal speech: implications for infant speech
perception
. . . .. . . .. . . ....
. . . . ... . . ... 157 Barbara DavisUse of the sound space in early speech
...
....
....
..
. . . ..
....
....
. 1 59 Peter F. MacNei/ageThe emergence of phonological organization ... .... ... 161
M.M. Vihman & L. Roug-Hellichius
In defense of the Motor Theory
. . . ... . .. . . .. . . . ..
. . . .. . ..
. . . .. . . ..
... 167 Ignatius G. MattinglyLearning to talk
... 173 Michael Studdert-KennedyCONTENTS
The phonetics laboratory group
Ann-Marie Alme Robert Bannert Peter Branderud
Una Cunningham-Andersson Hassan Djamshidpey
Mats Duiberg Ahmed Elgendi
Olle Engstrand Garda Ericsson
1Anders Eriksson2 Ake Floren
Eva Holmberg3 Bo KasslingAmalia Khatchaturian 4 Diana Krull
Catharina Kylander Francisco Lacerda
Ingrid Landberg
B jam Lindblom5
Rolf Lindgren James Lubker6 Berti! Lyberg7 Robert McAllister Lennart Nord8Lennart Nordstrand9 Lise-Lotte Roug-Hellichius Richard Schulman
Johan Stark
Ulla Sundberg Gunilla Thunberg
Hartmut Traunmiiller Karen WilliamsEva Oberg
Ann-Marie OsterlO
1 Also Department of Phoniatrics, University Hospital, Linkoping 2 Also Department of Linguistics, University of Gothenburg
3 Also Research Laboratory of Electronics, MIT, Cambridge, MA, USA
ix
4 Visiting from Institute of Language, USSR Academy of Sciences, Yerevan, Armenia 5 Also Department of Linguistics, University of Texas at Austin, Austin, Texas, USA 6 Also Department of Communication Science and Disorders, University of Vermont,
Burlington, Vermont, USA
7 Also Telia Research AB, Stockholm
8 Also Department of Speech Communication and Music Acoustics, Royal Institute of Technology (KTH), Stockholm
9 Also AB Consonant, Uppsala
10 Also Department of Speech Communication and Music Acoustics, Royal Institute of Technology (KTH), Stockholm
Current projects and grants
Speech transforms - an acoustic data base and computational rules for Swedish phonetics and phonology
xi
Supported by: The Swedish Board for Technical Development (NUTEK) and The Swedish Council for Research in the Humanities and Social Sciences (HSFR), grant 89-00274P to One Engstrand
Project group: OUe Engstrand, Bjorn Lindblom, and Rolf LindgrenSpeech after glossectomy
Supported by: The Swedish Cancer Society, grant RMC901556 One Engstrand;
The Swedish Council for Planning and Coordination of Research (FRN), grant 900116:2 A 15-5/47 to One Engstrand
Project group: Ann- Marie Alme, Olle Engstrand, and Eva Oberg
The measurement of speech comprehension
Supported by: The Swedish Council for Research in the Humanities and Social Sciences (HSFR), grant F423/90 to Robert McAllister
Project group: Mats Dufberg , Robert McAllister, and Gunilla Thunberg
Articulatory-acoustic correlations in coarticulatory processes: a crOSS-language investigation
Supported by: The Swedish Board for Technical Development (NUTEK), grant 89-00275P to Olle Engstrand; ESPRIT: Basic Research Action, AI and Cognitive Science: Speech
Project group: OUe Engstrand and Robert McAllister
An ontogentic study of infants' perception of speechSupported by: The Tercentenary Foundation of the Bank of Sweden (RJ), grant 90/150: 1 to Francisco Lacerda
Project group: Francisco Lacerda, Bjorn Lindblom, and Ulla Sundberg; Goran Aurelius (S:t Gorans Children's Hospital).
Typological studies of phonetic systems
Supported by: The Swedish Council for Research in the Humanities and Social Sciences (HSFR), grant F421/90 to Bjorn Lindblom
Project group: Olle Engstrand, Diana Krull, and Bjorn Lindblom
Sociodialectal perception from an immigrant perspective
Supported by: The Swedish Council for Research in the Humanities and Social Sciences (HSFR), grant F420/90 to OBe Engstrand
Project group: Una Cunningham-Andersson and OBe Engstrand
Early language-specific sound development
Supported by: The Swedish Council for Research in the Humanities and Social Sciences (HSFR), grant F615/91 to OBe Engstrand
Project group: OBe Engstrand, Catharina Kylander, Francisco Lacerda, Lise
Lotte Roug-Hellichius and Karen Williams
Paralinguistic variation in speech and speech technology
Supported by: The Swedish Council for Research in the Humanities and Social Sciences (HSFR), grant F682/91 to Hartmut Traunmiiller
Project group: Hartmut TraunmiiBer
PROJECTS AND GRANTS
Previous issues of PERILUS
PERILUS I, 1978-1979 1. Introduction
Bjorn Lindblom and James Lubker
xiii
2. Some Issues In research on the perception of steady-state
vowels Vowel identification and spectral slope
Eva Age/fors and Mary Graslund
Why does [a] change to [0] when Fa is increased? Interplay between
harmonic structure and formant frequency
in the perception
of vowelquality Ake Floren
Analysis and
prediction of difference limen data for formant frequ
en
ci
es Lennart Nord and Eva Sventelius
Vowel
identification as a function of increasing fundamental frequency Elisabeth Tenenholtz
Essentials of a psychoacoustic model
of spectral matching Hartmut Traunmilller
3.
On the perceptual role of dynamic features in the speech signal
Interactionbetween spectral and durational cues in Swedish
vowel contrastsAnette Bishop and Gunilla Edlund
On the distribution of [h]
in
thelanguages of the world: is the rarity
of syllable final [h] dueto an asymmetry of backward and forward masking?
Eva Holmberg and Alan Gibson
On the function of formant transitions:
I. Formant frequency
target
vs.rate of change in vowel identification I I.
Perception of steady vs. dynamic vowel sounds in noiseKarin Holmgren
Artificiall y clipped syllables and the role of formant transitions in consonant perception
Hartmut Traunmilller
4. Prosody and top down processing
The importance of timing and fundamental frequency contour information in the perception of prosodic categories
Berti! Lyberg
Speech perception in noise and the evaluation of l
ang
uage
proficiencyAlan
C.Sheats
5. BLOD -A block diagram simulator
Peter Branderud
PERILUS
II,
1979-1980 IntroductionJames Lubker
A study of anticipatory labial coarticulation
in the
speech of childrenAsa Berlin, Ingrid Landberg and Lilian Persson
Rapid reproduction of vowel-vowel sequences by children
Ake Floren
Production of bite-block vowels
by children Alan Gibson and Lorrane McPhearson
Laryngeal airway resistance as
a function of phonation type Eva Holmberg
The declination effect in Swedish
Diana Krull and Siv Wandeback
Compensatory articulation by deaf speakers
Richard Schulman
Neural and mechanical response time in the speech of cerebral palsied subjects
Elisabeth Tenenholtz
An acoustic investigation of production of plosives
by cleft
palate
speakers
Garda Ericsson
PERILUS III, 1982-1983 Introduction
Bjorn Lindblom
Elicitation and perceptual judgement of disfluency and stuttering
Anne-Marie Alme
PREVIOUS ISSUES
Intelligibility vs. redundancy - conditions of dependency
Sheri Hunnicut
The role of vowel context on the perception of place of articulation for stops
Diana Krull
Vowel categorization by the bilingual listener
Richard Schulman
Comprehension of foreign accents.
(A
Cryptic investigation.)Richard Schulman and Maria Wingstedt
Syntetiskt tal som hjalpmedel vid korrektion av dovas tal
Anne-Marie Oster
PERILUS IV, 1984-1985 Introduction
Bjorn Lindblom
Labial coarticulation in stutterers and normal speakers
Ann-Marie Alme
Movetrack
Peter Branderud
Some evidence on rhythmic patterns of spoken French
Danielle Duez and Yukihoro Nishinuma
xv
On the relation between the acoustic properties of Swedish voiced stops and their perceptual processing
Diana Krull
Descriptive acoustic studies for the synthesis of spoken Swedish
Francisco Lacerda
Frequency discrimination as a function of stimulus onset characteristics
Francisco Lacerda
Speaker-listener interaction and phonetic variation
Bjorn Lindblom and Rolf Lindgren
Articulatory targeting and perceptual consistency of loud speech
Richard Schulman
The role of the fundamental and the higher formants in the perception of speaker size, vocal effort, and vowel openness
Hartmut TraunmtJller
PREVIOUS ISSUES
PERILUS
V, 1986-1987About the computer-lab
Peter Branderud
Adaptive variability and absolute constancy in speech signals: two themes in the quest for phonetic invariance
Bjorn
LindblomArticulatory dynamics of loud and normal
speech Richard SchulmanAn experiment on the cues to the identification of fricatives
Hartmut Traunmilller and Diana Krull
Second formant
locus patterns as a measure of consonant-vowel
coarticulation
Diana Krull
Exploring discourse intonation in Swedish
Madeleine Wulffson
Why two labialization strategies in Setswana?
Mats Oufberg
Phonetic development in early infancy - a study of four Swedish children during the first 1 8 months of life
Liselotte Roug, Ingrid Landberg and Lars
Johan Lundberg A simple computerized response collection systemJohan Stark and Mats
DufbergExperiments with technical aids in pronunciation teaching
Robert McAllister, Mats Oufberg and Maria Wallius
PERILUS VI, FALL 1987
Effects of peripheral auditory adaptation on the discrimination of speech sounds (Ph.D. thesis)
Francisco Lacerda
PERILUS VII, MAY 1988
Acoustic properties as predictors of perceptual responses:
a
study of Swedish voiced stops(Ph.D. thesis) Diana Krull
PERILUS VIII, 1988
Some remarks on the origin of the "phonetic code"
Bjorn Lindblom
PREVIOUS ISSUES
Formant undershoot in clear and citation form speech
Bjorn Lindblom and Seung-Jae Moon
On the systematicity of phonetic variation in spontaneous
speech aile Engstrand and Diana KrullDiscontinuous variation in spontaneous speech
aile Engstrand and Diana Krull
xvii
Paralinguistic variation and invariance in the characteristic freque
nc
ies of vowels
Hartmut TraunmOller
Analytical expressions for the
tonotopic sensory scaleHartmut TraunmOller
Attitudes to immigrant Swedish - A literature review and preparatory experiments
Una Cunningham-Andersson and
aileEngstrand
Representing pitch accent in Swedish
Leslie M. Bailey
PERILUS IX, February 1989
Speech after cleft palate treatment -an
a
lysis of a
1O-year
materialGarda Ericsson and Birgitta Ystrom
Some attempts to measure speech comprehension
Robert McAllister and Mats Dufberg
Speech after glossectomy: phonetic considerations and som preliminary results
Ann-Marie A/me and aile Engstrand
PERILUS X, December 1989
FO
correlates of tonal wordaccents in s
pontane
ous s
peec
h: range and systematicity of variationaile
Engstrand
Phonetic features of the acute and grave word accents: data from spontaneous speech.
aile Engstrand
A note on hidden factors in vowel perception experiments
Hartmut TraunmOller
Paralinguistic speech signal transformations
Hartmut TraunmOller, Peter Branderud and Aina Bigestans
PREVIOUS ISSUES
Perceived strenght and identity of foreign accent in Swedish
Una Cunningham-Andersson and aile
Engstrand
Second formant locus patterns and consonant-vowel
coarticulationi n spontaneous speech
Diana Krull
Second formant locus - nucleus patterns in spontaneous
spee
ch: some preliminary results on French
Danielle Duez
Towards an ele
ct
ropa
lat
ogr
aphi
c specificationof
consonant articulation inSwedish.
aile
Engstrand
An
acoustic-perceptual study of Swedish vowels produced by a subtotally glossectomized speakerAnn-Marie Alme, Eva Oberg and
aileEngstrand
PERILUS XI, MAY 1990
In what sense is speech quantal?
Bjorn LIndblom
& aileEngstrand
The status of phonetic gestures
Bjorn LIndblom
On the notion of "Possible Speech Sound"
Bjorn Lindblom
Models of phonetic variation and selection
Bjorn Lindblom
Phonetic content in phonology
Bjorn Lindblom
PERILUS XII, MAY 199 1
On the communicative process: Speaker-listener interaction and the development of speech
Bjorn Lindblom
Conversational maxims and principles of language planning
Hartmut TraunmOller
Quantity perception in Swedish [VC]-sequences:
word length and speech rate.
Hartmut TraunmOller and Aina Bigestans
Perceptual foreign accent: L2 user's co
m
pre
hen
si
on
abilityRobert McAllister
PREVIOUS ISSUES
Sociolectal sensitivity in native, non-native and non speakers of Swedish - a pilot study
Una Cunningham-Andersson
Perceptual evaluation of speech following subtotal and partial glossectomy
Ann-Marie Alme
VOT
in spontaneous speech and in citation form words Diana KrullSome evidence on second formant locus-nucleus patterns in spontaneous speech in French
Daniell Duez
Vowel production in isolated words and in connected speech:
an investigation of the linguo-mandibular subsystem
Edda Farnetani and Alice Faber
Jaw position in English and Swedish VCVs
Patricia A. Keating, Bjorn Lindblom, James Lubker, and Jody Kreiman
Perception of CV-utterances by young infants:
pilot study using the High-Amplitude-Sucking technique
Francisco Lacerda
Child adjusted speech
Ulla Sundberg
Acquisition of the Swedish tonal word accent contrast
Olle Engstrand, Karen Williams, and Sven Stromquist
PERILUS XIII, May 199 1
xix
(Papers from the Fifth National Phonetics Conference, Stockholm; May 199 1)
Initial consonants and phonation types in Shanghai
Jan-Olof Svantesson
Acoustic features of creaky and breathy voice in Udehe
Galina Radchenko
Voice quality variations for female speech synthesis
Inger Karlsson
Effects of inventory size on the distribution of vowels in the formant space: preliminary data from seven languages
Olle Engstrand and Diana Krull
PREVIOUS ISSUES
The phonetics of pronouns
Raquel Willerman and Bjorn Lindblom Perceptual aspects of an intonation model Eva G�rding
Tempo and stress
Gunnar Fant, Anita Kruckenberg, and Lennart Nord On prosodic phrasing in Swedish
Gosta Bruce, Bjorn Granstrom, Kjell Gustafson and David House Phonetic characteristics of professional news reading
Eva Strangert
Studies of some phonetic characteristics of speech on stage Gunilla
Th
unber
gThe prosody of Norwegian news broadcasts
Kjell
GustafsonAccentual prominence in French: read and spontaneous speech
Paul Touati
Stability of some Estonian duration relations
Diana Krull
Variation of speaker and speaking style in text-to-speech systems
Bjorn Granstrom and Lennart Nord
Child adjusted speech: remarks on the Swedish tonal word accent
Ulla
Su
ndber
gMotivated deictic forms in early language acquisition
Sarah Williams
Cluster production at grammatical boundaries by Swedish children:
some preliminary observations
Peter Czigler
Infant speech perception studies
Francisco Lacerda
Reading and writing processes in children with Down syndrome - a research project
Irene Johansson
Velum and epiglottis behaviour during production of Arabic pharyngeals:
fibroscopic study
Ahmed Elgendi
PREVIOUS ISSUES
Analysing gestures from X-ray m
oti
on
films of speech Sidney WoodSome cross language aspects of co-articulation
Robert McAllister and aile Engstrand
Articulation inter-timing variation in speech: modelling in a
recognitionsystem
Mats Blomberg
The context sensitivity of the perceptual interaction
between FO and F1 Hartmut TraunmOllerOn the relative accessibility of units and representations in speech perception
Kari Suomi
The QAR comprehension test:
a
progress report on test comparisonsMats Dufberg and Robert McAllister
Phoneme recognition using multi-level perceptrons
Kjell Elenius och
G.Takacs
Statistical inferencing of text-phonemics correspondences
Bob Damper
Phonetic and phonological levels in the speech of the deaf
Anne-Marie Oster
Signal analysis
and
speech perception in normal and hearing-impaired listenersAnnica Hovmark
Speech perception abilities of patients using cochlear implants, vibrotactile aids and hearing aids
Eva Age/fors and Arne Risberg
On hearing impairments, cochlear implants and the perception of mood in speech
David House
Touching voices - a comparison between the hand, the tactil ator and the vibrator as tactile aids
Gunilla Ohngren
Acoustic analysis of dysarthria associated with multiple sclerosis - a preliminary note
Lena Hartelius and Lennart Nord
Compensatory strategies in speech following glossectomy
Eva Oberg
PREVIOUS ISSUES
xxi
Flow and pressure registrations of alaryngeal speech
Lennart Nord, Britta Hammarberg, and Elisabet Lundstrom
PREVIOUS ISSUES
Participants in alphabetic order
Christian Abry
Institut de la
Communication ParleeUniversite Stendhal
Domaine Universitaire, B.P 25X F-38040 Grenoble
Cedex, FranceGerard Bailly
Institut de la
Comminication Parlee46, avenue Felix-Viallet
F-38031 Gr
enob
le Cedex, FranceNick Campbell
ATR Auditory
and
Visual Perception Research Laboratories Inuidani, Seika-choSoraku-gun
Kyoto 619-02 Japan Rene
Carre
Ecole Nationale Superieure des Telecommunications 46, rue Barrault
F-75634 Paris Cede x 13, France
Ba
rbara Da
vis
Department of Linguistics University of Texas at Austin Austin, TX 78712, USA Randy Diehl
Department of Psychology University of Texas at Austin 330 Mezes
Austin, TX 78712, USA Danielle Duez
Institut de phonetique
L.A. 261 - CNRS, Universite de Provence 29, avenue Robert-Schuman
F-13621 Aix-en-Provence, France Olle Engstrand
Institutionen for lingvistik Stockholms universitet
S-10691 Stockholm, Sweden
xxiii
Gunnar Fant
Institutionen for taloverforing och musikakustik Kung!. Tekniska Hogskolan
Box 7001 4
S-1 0044 Stockholm, Sweden Edda Farnetani
Centro per Ie ricerche di foneticalCNR Piazzale G. Salvemini 13
1-35131 Padova, Italy Vincent Gracco
Haskins Labora
to
ri
es
270, Crown Street
New Haven,
CT
06511-6695,USA Philip HooleInstitut
fOrP
hon
et
ik und s
prachli
che
Kommunikation der Uni
versi
tat M
On
chen
Schellingstrasse
3
0-8000 MOnchen, Germany Keith Kluender
Department of Psychology University of Wisconsin 1202 West Johnson
Street
Madison, WI 53706, USA Klaus Kohler
Institut fOr Phonetik und digitale Sprachverarbeitung Universitat Kiel
Olshausenstrasse 40 0-2300 Kiel, Germany Barbara KOhnert
Institut fOr Phonetik und sprachliche Kommunikation der Universitat MOnchen
Schellingstrasse 3
0-8000 MOnchen, Germany Gerald o. Lame
Department of Linguistics University of Texas at Austin Austin, TX 78712, USA
PAR TICIP ANTS
Bjorn Lindblom
Institutionen fOr lingvistik Stockholms universitet
S-10691 Stockholm, Sweden and
Department of Linguistics University of Texas at
AustinAustin, TX 78712, USA Anders LOfqvist
Institutionen for logopedi och foniatri Lasarettet
S-22185 Lund, Sweden and
Haskins Laboratories 270 Crown Street
New Haven, CT 06511, USA Peter MacNeilage
Department of Linguistics Unive
rsi
tyof Texas
at Austin Austin, TX 78712, USA Shinji MaedaDepartement SIGNAL CNRS URA-820 ENST 46, rue Barrault
F-75634 Paris Cedex 13,
France
Sharon Manuel
Communication Disorders and Sciences Wayne State University
563, Manoogian Hall Detroit, MI 48202, USA Ignatius Mattingly Haskins Laboratories 270, Crown Street
New Haven, CT 06511, USA Robert McAllister
Institutionen for lingvistik Stockholms Universitet S-10691 Stockholm, Sweden
PARTICIPANTS
xxv
Zeung-Jae Moon
De
p
artment of Linguistics University of Texas at Austin Austin, TX7871 2,
USA Kevin MunhallDepartment of Psychology Queen's University
Kingston, Ontario
K7L
3N6, Canada Sieb NooteboomResearch Institute for Language and Speech Utrecht University
Trans 10
NL-3512JK Utrecht, Netherlands Lennart Nord
Institutionen fOr taloverforing och musikakustik Kung!. Tekn. Hogskolan
S-1 0044 Stockholm, Sweden John Ohala
Department of Linguistics University of Alberta
Edmonton, Alberta T6G 2E7, Canada David Ostry
Department of Psychology McGill University
Montreal, P.Q. H3A 1 B1, Canada Joseph S. Perkell
Research Laboratory of Electronics 36-591 Massachusetts Institute of Technology Cambridge, MA 02139, USA
Daniel Recasens
Institut d'Estudis Catalans Carrer del Carme,
47
E-08001 Barcelona, Spain Lise-Lotte Roug-Hellichius Institutionen for lingvistik Stockholms universitet
S-10691 Stockholm, Sweden
PAR TICIPANTS
Elliot Saltzman Haskins Laboratories 270 Crown Street
New Haven, CT 0651 1 ,
USAJean-Luc Schwartz I. C. P.
INPG
46 Av. Felix-Viallet
F-38031 Grenoble Cedex, France Celia Scully
Department of Linguistics and Phonetics The University of Leeds
Leeds LS2 9JT, England Linda Shockey
Department of Linguistic Science University of Re
a
di
ngWhite
Knights
Reading RG6 2AA, England
Kenneth N. Stevens
Research L
abora
to ry of
Electronics 36-517 Massachusetts Institute of Techology Cambride, MA 02139, USAMichael Studdert-Kennedy Haskins Laboratories 270 Crown Street
New Haven, CT 06511-6695, USA Harvey Sussman
Department of linguistics University of Texas at Austin Calhoun Hall 510
Austin, TX 78712, USA Hartmut Traunmuller Institutionen for lingvistik Stockholms universitet
S-10691 Stockholm, Sweden
PAR neIP ANTS
xxvii
Eric Vatikiotis-Bateson
ATR Auditory and Visual Perception
Research LaboratoriesInuidani, Seika-cho Soraku-gun
Kyoto 61 9-02 Japan Marilyn Vihman 539 Old Street Road Trevose, PA 19053, USA Natalie Waterson
' Innisfree', Fairby Lane
Hartley, Longfield
Kent DA3
BOA,
EnglandPARTICIPANTS
Does increasing representational complexity lead to more speech variability?
On explaining some French data by current models of anticipatory rounding
Christian ABRYand Tahar LALLOUACHE
/nstitut de /a Communication Parlee, Grenoble-Stendhal (France)
1 Introduction
On testing competing models of anticipatory rounding - the so-called
look-ahead(LA),
time-locked(T L), and
two-stageor
hybrid(H) models, following the procedure set up by Perkell (1990)
-with French data (Abry and Lallouache, 199 1), we had the opportunity to observe a lot of variational behavior, for the
samespeaker, between and
withintwo sessions (dec. 1989 and june 1990). This discards all three models as candidates to account for anticipatory rounding in general and the possibility to invoke linguistic and biological variables (Lubker and Gay, 1982) to account for variability. Experimental variables are thus to be suspected.
At a first guess, within speaker variability between sessions, and within session variability for the same speaker, could point out that:
(i) Variability would depend on the
complexityof the encoded string, i.e. the number of consonants and junctures, currently used in this kind of experiment.
(ii)
Poorly audibleprotrusion movements (Benguerel and Adelman, 1976) through up to five consonants [kstsk] complexes would be "free" to vary, provided they reach their audible goal, in our case the vowel [y].
(iii)
Slowmovements, occuring in the realization of such heavy syllables, would be more variable than fast ones; more specifically, they would allow underlying vocalic (and supposedly
consonantal)protrusion "gestures" to emerge more clearly (Boyce
et al.,1990), leading to bumpy movement profiles.
Note that all three factors could perhaps explain an even stronger variability than the one observed. This makes the variability phenomenon overdetermined and its hypothesized origins difficultly testable when it is observed in such complex strings, because poor audibility (of the possible consonants) and sluggishness are usually concomitant with this increase in complexity.
However, a trend to proportional stability observed in one session, where the eliciting technique allowed to improve prosodic pausing control, indicated that, in the pronunciation of such complex clusters, prosodic mastery can counteract effectively the variability induced by representational (syllabic and metric) complexity, ultimately preventing poorly audible movements - in spite of their increased sluggishness - to become bumpy. The first two factors (i) & (ii) being present, and the longest realizations of consonantal strings (iii) belonging precisely to this session, we will challenge the claim that the coproduction framework (overlap of "gestures") could explain the divergences in anticipatory behavior with
1
the basic assumption that "the addition of 'neutral' segments ['protruded' is meant]
and an decrease in speaking rate both favor the occurence of multi-stage patterns and allow individual gestures to emerge as distinct entities"
(ibid.,p. 186).
2 Paradigm and results
In French (non Southern), the number of consonants admitted within [-round] to [+round] transitions, like [i. . .y], can be up to 6 (e.g.:
C' etait pourtant un directeur d'societe miUmupuleux),depending on the possibility of schwa deletion (thus, the famous French
sinistre structureof Benguerel and Cowan, 1974, appeared to be unpronounceable without schwa, even by Northern speakers). We chose to manipulate the number of consonants and the position of the juncture, starting from a «mirror» sequence [ . . .ikstsky . . . ] in
Ces deux Sixte sculpterent("These two Sixte [popes] sculptured"), thus obtaining [kstsk], [kssk], [ksk], [kk], [k] and none [-], with [kst;i:sk], [ks;i:sk], [k;i:sk], [ks;i:k], [k;i:k], [;i:k] , [k;i:] , [k], [;i:], [-], as French regular sequences.
Our attention being focussed, in this discussion, on the possible falsification of the hypothesis that any lengthening effect on the obstruence interval - such as number of consonants, stress and slow rate - would lead to more complex patterns of protrusion movements (Boyce
et al.,1990), we will examine only the most
"heavy" case [ikstsky]. Following Perkell's procedure, kinematic events were detected manually on instantaneous velocity and acceleration functions of the upper lip protrusion, derived from cubic spline functions fitted to raw measurements on each 20 ms video field of our recordings (for more details see Lallouache, 199 1).
An
obstruence intervalwas determined on the synchronized audio signal (sampled at 16KHz) by detecting [i] offset and [y] onset, corresponding respectively to the disappearance and appearance of a clear vocalic formant structure. Among parameters other than upper lip protrusion, image processing enabled us to measure between-lips area from front views. Additional cepstral formant tracking and measurements were checked when needed.
We examined in more details patterns of events referred to [y] acoustic onset and [i] offset in Abry and Lallouache (199 1). During the 2nd session, all three movement profiles - characterized by Perkell (1990) - were observed: (i) a unique ramp, i.e. a one-phase gesture; (ii) no (or a weak) movement phase, followed by a rather steep start of protrusion, i.e. a two-phase protrusion ; (iii) an initial ramp-like phase, followed by a steeper phase, i.e. two phases again. In contrast to this large variety of profiles, the 1 st session displayed almost exclusively movements of the type (ii).
So we must agree with Perkell ( 1990, p. 280) in rejecting all three "strong versions" of LA, TL and
Hmodels. The protrusion «beginning» (conventionally:
minimum value) was not locked at the offset of the unrounded first vowel (LA), nor fixed relative to the onset of the rounded second one (TL; its peak acceleration neither, thus rejecting H). The only consistent fact
through both sessionswas that peak protrusion was locked
aboutthe onset of [y]: this means simply that no plateau-like and/or spatially overshooted anticipations were observed.
So our data exemplify all the three main types of profiles, but they violate all three models with respect to their predicted dates.
� '-'
0
-Eo-
<
CI:
� <
..l
;.. Eo-
-u 0 ..l
� >
:::e:
<
� i:l.
100 �---�
o
80
cP
£ £
60 .��
�� 1st session £
flJr.; £
�
�
��:
40
0 0
20
o� ____________________________________ �
200 300
OBSTRUENCE INTERVAL (ms) Fig.1
(See text, below)
400
3 A tentative explanation
Such negative results have puzzled students in coarticulation for years. And for our part, we were about to give up and to come to a conclusion about variability
per se,when we suddenly remembered
(post hoc!)that we had used innocently two different eliciting techniques to make produce such complex consonant chains as
[
. . .kstsk .
. .]. In the 2nd session, instruction was to repeat the sentence, prompted
with a long pause: «Ces deux Sixte . . . sculpterent»,
as a whole;whereas in the 1st one the subject had to repeat, when prompted, the noun phrase: «Ces deux Sixte», linking up with: «Ces deux Sixte sculpterent». This possibility to «prime» the action could be compared with a trial approach before jumping the hurdle, allowing to size it up. In our case the effect was a better movement «chunking»
(corresponding to prosodic parsing) in this 1st performance, which is visible looking at converging cues, such as overall longer obstruence intervals and less elisions of the closure phase for [t] (see Abry and Lallouache, 199 1, for an illustration).
Testing the
proportionality(for this procedure, cf. Gentner, 1987) for the different kinematic events, it was found that, within the obstruence interval, peak
3
velocity (see Fig. 1) was relatively stable in the 1st session (53%
�protrusion lag ratio
�67%), whereas it drifted towards [y] onset in the 2nd one, as obstruence duration increased (from 30% to 96% lag ratio).
An interpretation of the proportional behavior in the 1st session could be that
more pausing lengthens rounding anticipation(see Cathiard
et al.,for evidence in silent pauses, for French and for the same speaker), hence increasing
bothphases:
the first one that corresponds to a rather clear realization of [kstt:] (without a full silent pause, of course), with no, slow or just starting protrusion; and the second to the deceleration phase towards peak protrusion.
In the 2nd session, due to the eliciting technique, pausing was not as easily controlled, which led to unsteadily junctured products. But we have no suggestion to explain the fact that this instability caused the maximum velocity event to draw nearer to [y] onset as obstruence interval increased (Fig. 1).
4 Provisional conclusions
Since it has been recently reemphasized by supporters of the coproduction model - and this specifically in order to explain conflicting findings about anticipatory rounding-, that any lengthening effect on the obstruence interval (such as number of consonants, stress and slow rate) would lead to more complex patterns of protrusion movement, allowing «individual gestures [vocalic and consonantal protrusions] to emerge as distinct entities» (Boyce et al., 1990), one noteworthy finding was that the greatest stability in movement profiles was obtained in the 1 st session, which brought about the longest and more carefully pronounced tokens.
This shows that rate is not the only factor capable of smoothing articulatory trajectories, as it increases. In the case of a so complex clustering task - hence bumpy in nature in its execution, and this not necessarily because it would prevent an aggregation of idiosyncratic gestures -, an improvement in prosodic pausing control can counteract effectively the variability induced by representational complexity, ultimately preventing movements to become bumpy, in spite of their poor audibility and/or increased sluggishness.
Since we proposed that the whole family of
upper lipprotrusion profiles, we observed in both sessions, could be accounted for by the binding - under prosodic control - of two
active vocalicgestures (active retraction for [i] and active protrusion for [yD, with no influence of an
active upper lip protrusionfor [s], [t]
consonants, at least in French (contrary to the suggestion of Boyce
et aI.,1990, for English; their arguments rest mainly on the
lowerlip behavior - in spite of its clear coupling with the jaw for such coronal consonants -, and seems to us obviously immaterial as regards the
upperlip, see their Fig. 6),
E.Saltzman asked us about the similarity of this assumption with the coproduction one. Our answer is clearly that the prosodic control in the binding of the two underlying vocalic gestures - provided their acoustic constraints are
sufficientlysatisfied - determines the extent of the anticipation and not a look-ahead programming process, nor a time-locked
<?,ne. So we basically agree with the very general assumption (made popular by
Ohman) that vocalic gestures are carrier gestures; but disagree on the specific
interpretation of consonantal "perturbations" on anticipatory rounding; and on the
fact that lengthening factors (number of segments, stress, rate) could explain
perse
the presence/absence of perturbations, hence one-
v s.two-stage data.
The actual problem which is left unsolved by our present view is how gestures are bound through junctures, i. e. essentially between words? Some answers are certainly to be taken from an approach like the one adopted by Fujimura et al.
( 199 1), which - among other concepts - explicitly uses boundary pulses in order to take account of prosodic effects on articulatory gestures and copes with the chunks of motor programming in utterance phrasing.
Thanks to Bj6m Lindblom and Olle Engstrand, who invited us to this seminar, and to Bob McAllister, David Ostry, Joe Perkell and Elliot Saltzman for their comments. On some controversial points regarding anticipation, this text is a stiffened version of Abry and Lallouache (1991), which benefited from subsequent discussions in order to be improved (except our English, every new mistake being our own). This work was done at CNRS URA 368.
References
Abry, C. & Lallouache, M.T. (1991), "Audibility and stability of articulatory movements.
Deciphering two experiments on anticipatory rounding in French", Proceedings of the XIIth Congress of Phonetic Sciences, vol. 1, 220-225.
Benguerel, A.-P. & Adelman, S. (1976), "Perception of coarticulated lip rounding", Phonetica, 33, 113-126.
Benguerel, A.-P. & Cowan, H.A (1974), "Coarticulation of upper lip protrusion in French", Phonetica, 30, 41-55.
Boyce, S.E., Krakow, RA., Bell-Berti, F. & Gelfer, C. E. (1990), "Converging sources of evidence for dissecting articulatory movements into core gestures", Journal of Phonetics, 18, 173-188.
Cathiard, M.-A., Tiberghien, G., Cirot-Tseva, A, Lallouache, M.T. & Escudier, P. (1991),
"Visual perception of anticipatory rounding during acoustic pauses : a cross-language study", Proceedings of the XIIth Congress of Phonetic Sciences, vol. 4, 50-53.
Fujimura, 0., Erikson, D. & Wilhelms, R (1991), "Prosodic effects on articulatory gestures. A model of temporal organization", Proceedings of the XlIth Congress of Phonetic Sciences, vol. 2, 26-29.
Gentner, D.R. (1987), "Timing of skilled motor performance: tests of the proportional duration model", Psychological review, 94, 255-276.
Lallouache, M.T. (1991), Un poste "Visage-Parole" couleur. Acquisition et traitement des contours des levres, Doctoral thesis, Institut National Poly technique de Grenoble.
Lubker, J. & Gay, T. (1982), "Anticipatory labial coarticulation: Experimental, biological, and linguistic variables", Journal of the Acoustical Society of America, 71 (2),437-448.
Perkell, J.S. (1990), "Testing theories of speech production: implications of some detailed analyses of variable articulatory data", in W.J. Hardcastle & A. Marchal (Eds), Speech production and speech modelling, Kluwer Academic Publishers, Dordrecht, Boston, London, 263-288.
5
SOME CROSS LANGUAGE ASPECTS OF CO-ARTICULATION
ABSTRACT
Robert MeA/lister and Olle Engstrand Institute of Linguistics Stockholm, Sweden
The work reported in this paper concerns som temporal aspects of vowel dynamics in English, French and Swedish. The language specific auditory effects of dynamic complexity and direction of tongue movement are starting points for a study of VCV sequences in these three languages using dynamic electropalatography. Tongue movement is compared between the three lan
guages. Results support the assumtion that differences in auditory impressions of vowels in Swedish and English are dependent on differences in the timing of similar articulatory events whereas French seems to employ quite different articulatory strategies.
1. Introduction
This paper is a brief progress report on research activities in connection with the ACCOR project (Articulatory-Acoustic Correlations in Coarticulatory Processes: A Cross-Language Investigation) which is part of ESPRIT's Basic Research Action program. The work being reported on here is focused on articulatory dynamics in VCV utterances and, in particular, vowel dynamics in these sequences. In many dialects of English, high vowels such as
Ii!and
luiare heard to glide from a somewhat centralized towards a more cardinal vowel quality. The corresponding Central Swedish vowels tend to display a more complex dynamic behavior with a final offglide from cardinal to centralized. In French, on the other hand, these vowel colors sound essentially constant. These language specific, auditory effects are quite characteristic. From a cross-lin
guistic point of view, these dynamic patterns tend to typify a phonetic typology based on two continuous dimensions: 1) dynamic complexity (monophthongal, diphthongal, triphtongal, ... ), and 2) direction of movement (offgliding, onglid
ing). Among the languages mentioned above, French would approximate the dynamically less complex type, whereas English and Swedish would approxi
mate the dynamically more complex type; and English would approximate the
ongliding type, whereas Swedish would approximate the offgliding type.
articulatory means employed to bring about these effects. It might be assumed, in particular, that differences in perceived vowel dynamics between some languages (perhaps English and Swedish) are brought about essentially by means of different relative timing of onsets and offsets of parallel activity in the articulatory and phonatory subsystems, whereas the activity pattern in each particular subsystem varies less between the languages; other languages (per
haps French) might employ a different articulatory scheme altogether. In this paper, we present some preliminary electropalatographic (EPG) data relevant to this question.
2. METHODS
We used the EPG system available at Reading to record a set of vowel-con
sonant-vowel (VCV) utterances, forming all possible combinations of V
=/i,a/
and C
=/p,b/, spoken by an Australian English, a French, and a Swedish speaker. The English and Swedish vowels belonged to the set of tense vowels;
the French vowel inventory has no tense vs. lax distinction. Randomly ordered lists of these combinations were read several times by each speaker.
3. RESULTS
We will limit this report to some results on the sequence /ipi/ as produced by the three speakers. Figure 1 shows number of activated electrodes (out of a total of 62) at various points in time for English and Swedish; from left to right:
a) acoustic onset of Vl, b) maximum number of activated electrodes during Vl, c) acoustic offset of Vl, d) minimum number of activated electrodes (for English
=acoustic /p/ release, e) (only Swedish) acoustic /p/ release, f) maximum number of activated electrodes during V2, g) acoustic offset of V2.
For French, where no clear maxima or minima could be discerned, the triangles correspond to a) acoustic onset of Vl, b) acoustic offset of Vl, c) acoustic /p/
release, d) acoustic offset of V2. Acoustic segments corresponding to /i/l, /p/
and /i/2 are indicated at the bottom of the figure for each subject. The data
represent averages of
5repetitions of the test utterance. The Swedish data are
shown by filled squares, the English data by filled circles, and the French data
by triangles. These symbols are connected by straight lines. The data are aligned
to the point in time where there is a minimum number of active electrodes for
all three subjects. This point also corresponds to the /p/ release for the
Australian English and the French subject. When the data are synchronized in
this way, the similarity between the overall English and Swedish contours, and
the difference between these and the French contour, are evident. In particular, the English and Swedish data both display a deep "trough" in the electrode activation pattern, corresponding to a relaxation of the tongue position roughly coinciding with the consonant; the tendency to such a trough in the French pattern is to weak to be statistically significant.
There is, however, a clear difference between the English and the Swedish contours. In the Swedish contour, most of the vowel offglides fall within the vocalic segments, whereas they mostly fall outside the vocalic segments in the English contour. In other words, the troughs in the respective EPG pattern are differently timed relative to the acoustic segment boundaries; the minimum number of activated electrodes occurs at the middle of the consonant segment in the Swedish subject, and at the CN2 boundary in the Australian-English subject. These differences are thus due to a different relative timing between the tongue articulation underlying the EPG activation patterns and the parallel labial and glottal activities .
...
-
.-
�---4---4---'r---�ISwedish Eng�ish French
Figure 1. Number of activated EPG electrodes at different points in time during the production of the utterence lipV by an Australian English (circles). a Swedish (squares) and a French speaker (triangles). Below: segment boundaries between the vocalic portion of 1V1. and /V2.
In summary, this limited data set supports the assumption that the difference in perceived vowel dynamics between English and Swedish can be primarily brought about by means of different relative timing of onsets and offsets of activity in the articulatory and phonatory subsystems, whereas French seems to employ a quite different articulatory scheme. In French, the auditory im
pression of a constant, non-dynamic vowel quality seems to correspond to a constant articulatory position throughout the /ipi/ sequence. This also shows that the presence of a trough in a VCV sequence is language specific rather than universal (Perkell, 1986), and that its timing relative to acoustic boundar
ies is related to characteristic dynamic properties of vowels in the respective languages. A further factor possibly contributing to the presence of troughs in vowel-symmetrical utterances in English and Swedish is related to conditions on aspiration as discussed in McAllister (1978) and Engstrand (1988, 1989). In particular, the aerodynamic requirements on the production of the stressed, aspirated /p/ release would include a relatively wide vocal tract (cf. Stevens, 1971), a condition met when the high vowel position is temporarily relaxed. In French, where voiceless stops are not aspirated, or considerably less aspirated, this adjustment would not be necessary.
REFERENCES
[1] Engstrand,
O.1988. Articulatory correlates of stress and speaking rate in Swedish VCV utterances. JASA 83, 5 1863-1875
[2] Engstrand,
O.1989. "Towards an electropalatographic specification of consonant articulation in Swedish." Phonetic Experimental Research, Institute of Linguistics, University of Stockholm (PER/LUS) X, 115-156.
[3] McAllister, R. 1978. "Temporal assymmetry in labial coarticulation." Papers from the Institute of Linguistics, University of Stockholm (PILUS) 35.
[4] Perkell,
J.1986. "Co articulation strategies: preliminary implications of a detailed analysis of lower lip protrusion movements." Speech Communication 5, 47-68.
[5] Stevens, K.N. 1971. "Airflow and turbulence noise for fricative and stop consonants: static considerations." Journal of the Acoustical Society of America, 50, 1180-1192.
ACKNOWLEDGMENTS