Detecting Social Patterns from Shifting Dialects

linguistics, accents — (Image credit: Mouth image via Shutterstock)

This Behind the Scenes article was provided to LiveScience in partnership with the National Science Foundation.

Knowing glances may dot a room when listeners hear the line, "You say tomato, I say tomahto," from the popular Gershwin song "Let's Call the Whole Thing Off." Whether you're from Philadelphia or Fresno, Winnetka or Waco, your dialect often identifies you with a particular locale.

Now using a powerful computer program, researchers at the University of Pennsylvania provide insights into a significant change in the dialect of Philadelphians. In a century's time, the sound of Philadelphia has shifted from a somewhat southern accent to a more northern one. And it's not just a few areas of Philadelphia. The entire city shifted. "The reversal indicates major changes in social patterns," says University of Pennsylvania linguist William Labov.

Latest Videos From

Watch full video here:

Labov and his colleagues developed their conclusions using a program called Forced Alignment & Vowel Extraction (FAVE). It allowed them to automatically analyze vowel sounds on recordings of interviews with speakers from 89 neighborhoods throughout the city whose birth years ranged from 1888 through 1991. The interviews were compiled yearly beginning in 1973 as part of a long-term language study undertaken by Labov and his students.

"We wanted to make automatic what, in the past, was a painfully slow hand process," says Labov of the computer analysis program. Previously, vowel analysis required listening to a digital recording on a computer and physically stopping the audio to make a measurement of a vowel sound. The few automated analysis programs available required quality checks to determine if the program had correctly identified the start and end of a vowel sound.

"When the original algorithm was working correctly, very few errors were found. However, when it was off, it was off by a lot and introduced numerous errors," says Josef Fruehwald, a doctoral student working with Labov. Older analysis programs were also unable to accurately sort through the extraneous noises introduced on the recordings by household sounds such as water running or a television playing in the background.

Two years in the making, the FAVE program follows every word on an interview transcript and looks up the each word's sounds in a pronunciation dictionary. For the word "bat," for instance, the algorithm marks the beginning and end of b, a, and t. It then provides analysis for vowels throughout the entire interview. The program is so efficient that in one hour it provides 7000 measurements for one interview. Before FAVE, an analysis could take 3 days and yield just 300 measurements.

These spectrograms, two of the million measured by a program called the FAVE suite, illustrate a speaker born in 1888 (top) and a speaker born in 1988 (bottom) vocally progressing from the word "make" toward "meek." The vertical bars show the beating of the vocal cords. The horizontal dark bars show the shaping effect of the tongue and the lips.

Sound changes such as those studied here remain a major obstacle to communication, especially when it comes to machine recognition of spontaneous speech. Companies engaged in creating speech recognition programs have used the Atlas of North American English, produced by Labov's research group, to define the range of dialects that must be represented in the data base of sounds used to "train" the speech recognition software. Philadelphia teachers are also using the group's results to refine their classroom plans so that they account for speech variations among students.

Editor's Note: The researchers depicted in Behind the Scenes articles have been supported by the National Science Foundation, the federal agency charged with funding basic research and education across all fields of science and engineering. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation. See the Behind the Scenes Archive.

TOPICS