Research Interests

Dr. Nam proposed a model of how syllable structure that promises to reveal how people put segments together in speech, something that has the potential of revealing a great deal about disorders of speech organization like stuttering and apraxia. Dr. Nam is also well known to his peers as one of the major developers of Haskins Laboratories speech production model (aka TADA model), which is the first full computational model of the speech capacity, one of the most fundamental but least understood human capacities. TADA model has been increasingly employed and cited in the field of speech pathology, speech engineering, sign language studies, second language acquisition, let alone linguistic theories such as phonology and phonetics. Dr. Nam used the model to unlock the mechanics of babbling, essential to understanding why some children learn speech properly, and some do not. His model is further being implemented in a new type of speech recognizer that promises to revolutionize how computers recognize speech.

Representative Publications

Vowel variability in elicited versus spontaneous speech: evidence from Mixtec. Journal of Phonetics, 48, (2015) 45-59. (Coauthored with DiCanio, C., Amith, J. D., García, R. C., & Whalen, D. H)

Hearing tongue loops: Perceptual sensitivity to acoustic signatures of articulatory dynamics. Journal of the Acoustical Society of America, 134(5), (2013) 3808-3817. (Coauthored with Mooshammer, C., Iskarous, K., & Whalen, D. H.)

Using automatic alignment to analyze endangered language data: Testing the viability of untrained alignment. Journal of the Acoustical Society of America, 134(3), (2013) 2235-2246. (Coauthored with DiCanio, C., Whalen, D. H., Timothy Bunnell, H., Amith, J. D., & García, R. C.)

Computational simulation of CV combination preferences in babbling. Journal of Phonetics, 41(2), (2013) 63-77. (Coauthored with Goldstein, L. M., Giulivi, S., Levitt, A. G., & Whalen, D. H.)

A procedure for estimating gestural scores from speech acoustics. Journal of the Acoustical Society of America, 132(6), (2012) 3980-3989. (Coauthored with Mitra, V., Tiede, M., Hasegawa-Johnson, M., Espy-Wilson, C., Saltzman, E., & Goldstein, L.)

Recognizing articulatory gestures from speech for robust speech recognition. Journal of the Acoustical Society of America, 131(3), (2012) 2270-2287. (Coauthored with Mitra, V., Espy-Wilson, C., Saltzman, E., & Goldstein, L.)

Bridging planning and execution: Temporal planning of syllables. Journal of Phonetics, 40(3), 374-389. (2012) (Coauthored with Mooshammer, C., Goldstein, L., McClure, S., Saltzman, E., & Tiede, M.)

Biomechanically preferred consonant-vowel combinations fail to appear in adult spoken corpora. Language and Speech, 55(4), (2012) 503-515. (Coauthored with Whalen, D. H., Giulivi, S., Levitt, A. G., Hallé, P., & Goldstein, L. M.)

Articulatory information for noise robust speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 19(7), (2011) 1913-1924. (Coauthored with Mitra, V., Espy-Wilson, C. Y., Saltzman, E., & Goldstein, L.)

An articulatory phonology account of preferred consonant-vowel combinations. Language Learning and Development, 7(3), (2011) 202–225. (Coauthored with Giulivi, S., Whalen, D. H., Goldstein, L. M., & Levitt, A. G.)

Response to macneilage and Davis and to Oller. Language Learning and Development, 7(3), (2011) 243-249. (Coauthored with Whalen, D. H., Giulivi, S., Goldstein, L. M., & Levitt, A. G.)

Retrieving tract variables from acoustics: A comparison of different machine learning strategies. The IEEE Journal of Selected Topics in Signal Processing, 4(6), (2010) 1027-1045. (Coauthored with Mitra, V., Espy-Wilson, C., Saltzman, E., & Goldstein, L.)

Perception of articulatory dynamics from acoustic signatures. Journal of the Acoustical Society of America, 127(6), (2010) 3717–3728. (Coauthored with Iskarous, K. & Whalen, D. H.)

Schwa deletion under varying prosodic conditions: Results of a pilot study. In S. Fuchs & M. Zygis (Eds.), Between the regular and the particular in speech and language (2010) (pp. 145-169). Berlin: Peter Lang Verlag. (Coauthored with Geng, C., Mooshammer, C., & Hoole, P.)

Self-organization of syllable structure: A coupled oscillator model. In F. Pellegrino, E. Marisco, & I. Chitoran (Eds.), Approaches to phonological complexity (2009) (pp. 299-328). Berlin, New York: Mouton de Gruyter. (Coauthored with Goldstein, L., & Saltzman, E.)

Coupled oscillator planning model of speech timing and syllable structure. In G. Fant, H. Fujisaki, & J. Shen (Eds.), Frontiers in phonetics and speech science (2009) (pp. 239-250). Beijing: The Commercial Press. (Coauthored with Goldstein, L., Saltzman, E., & Chitoran, I.)

Syllable-level intergestural timing model: Split-gesture dynamics focusing on positional asymmetry and moraic structure. In J. Cole & J. I. Hualde (Eds.), Laboratory phonology 9 (phonology and phonetics) (2007) (pp. 483-506). Berlin, New York: Walter de Gruyter.

The distinctions between state, parameter and graph dynamics in sensorimotor control and coordination. In Motor control and learning (2006) (pp. 63-73). (Coauthored with Saltzman, E., Goldstein, L., & Byrd, D.)

A competitive, coupled oscillator model of syllable structure. Proceedings of the 15th International Congress of Phonetic Sciences, Barcelona, Spain, (2003) 2253-2256. (Coauthored with Saltzman, E.)

Grants

A-193: Multilingual Gestural Models for Robust Language-Independent
Speech Recognition (Principal Investigator: NSF)

A-93: Links Between Production and Perception in Speech (Investigator: NIH)

A-40: Nature and Acquisition of the Speech Code and Reading (Investigator: NIH)

 

 

 

Contact

Korea University
Dept. of English Language and Literature
145 Anam-ro, Seongbuk-gu, Seoul, Korea

Haskins Laboratories
300 George Street
Suoite 900
New Haven, CT 06511

Haskins Phone:
(203) 865-6163, x235
Yale Phone: (203) 764-9353
Haskins Fax: (203) 865-8963

nam@haskins.yale.edu