coarticulation is not a design flaw

I’m just back from a talk* where it was argued that language is far from a perfect or optimal system, but something that happens to work, most of the time, in spite of being bodged together in a clumsy and inelegant way (it could never have been designed to be this way, but with a bit of tinkering it comes to have the properties which make it at least functional).

The argument itself is coming from a background in American generative syntax, and so most of the argumentation was directed towards showing that human minds don’t and can’t represent entire trees for complicated syntactic structures. Which is actually, and thankfully, not even slightly controversial in many linguistics departments today, although apparently not all.

The phon-link, however, came in the discussion session following the talk, when one example of a clumsy solution to the language problem was drawn from speech production. According to the speaker, it’s not ideal that speech is produced via a single-tube system (ie the vocal tract) – because it gives rise to problems such as coarticulation.

For a single-sentence tutorial on coarticulation, consider the way that you say the word ‘ten’ on its own, and the way that you say the word sequence ‘ten past’ – the end of the word ‘ten’ becomes more similar to the start of the word ‘past’ when you say them together, particularly in fast speech. It might sound a bit more like ‘tem past’, in other words.

But coarticulation isn’t a problem. It’s not a problem for speakers, it’s not a problem for hearers – if it’s a problem for anyone, it’s only for people who adopt the troublesome assumption that the components of words have their own form in some sense independently of the words they belong to, and that this form somehow changes to take on the shape of adjacent or nearby segments when the segments are all assembled in order to be articulated. The problem in speech analysis is not how coarticulation can happen, but how segmentation can be motivated, for what is an inherently continuous (non-segmented) stream produced by the overlapping movements of the tongue, lips, jaw, and so on – and it’s in precisely the “transitions” between what could be thought of as “segments” that so much of the information that is most valuable for hearers is located.

To paraphrase someone else’s slogan – coarticulation in speech is not noise, but information! and the perception of inelegance and clumsiness is very much just in the eye of the beholder.

* Actually, I’ve just discovered this wee rant languishing as a draft in a folder somewhere – the talk was so long ago I can barely remember what the speaker looked like. But I need to post it, if only for my own phon-related health. On account of unavoidable weekly commitments I haven’t been to the departmental phonetics/phonology seminar for weeks – months even – and the p-side of my brain (p-centre?) is getting worryingly undernourished.

6 thoughts on “coarticulation is not a design flaw

  1. I was also unconvinced by the argument that language is all that clumsy in general, but, on account of having rarely come out of arguments with syntacticians particularly well, feel I’m on safer ground saying that at least phon systems need not be seen as especially inelegant. (Do you have a view on that?)


  2. I was frankly surprised to find someone at this level making the kind of argument you rightly critique here.

    I did wonder when I first saw the title whether we were in for a quick exposition of co-articulated allophones or somesuch. Somewhere in the back of my mind being my observations since moving to Tyneside of the way that voiceless stops are coarticulated with a glottal stop; makes your point though because the information regarding the ‘bucal value’ of the stop is present in the qualities of the surrounding sounds.


  3. Yes I can vaguely remember reading something about glottal stops in Tyneside, although I’m not sure if it’s a case of voiceless stops plus glottal stop (glottal reinforcement?), or glottal stop instead of voiceless stop. However i think this was in a phonology textbook (Gussenhoven & Jakobs?) rather than any detailed (socio)phonetic work (although such must surely exist), so i wouldn’t like to say for sure either way.

    I wish I could remember more of the argument of the talk, but it was so long ago. It runs into difficulty at both ends though – either because you view things as problems on entirely flawed grounds, as in the speech production example, or because you adopt an a priori view of what would, in your view, constitute a truly elegant or efficient solution to the language problem. Surely (- discussion welcome though!) it would be better to simply accept the data we’ve got, establish what patterns there are, and not despise the communication systems that exist on the basis of them not being like, well, computers or whatever comparison you’re drawing.


  4. I’m pretty certain that the tyneside voiceless stops are not glottal substitutions for stops but genuine co-articulation: glottal reinforcement might be a good description. The vowel quality to my ear clearly gives information about the lingual point of articulation. And when I try to reproduce what I think I’m hearing, it only sounds right when I co-articulate glottal and velar, bilabial or alveolar stops.

    I’m all for accepting the data we have and working with that: isn’t that what a scientific study is *meant* to do? I know that there is a play-off between theory and observation (it’s the stuff of practical theology which is my day-job), but even so …


Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s