ALTA Logo Proceedings of ALTSS/ALTW, Melbourne, December 2003

Speech processing

David Grayden, Bionic Ear Institute, Melbourne


ABSTRACT:

This course is an introduction to the speech signal and how it is processed by humans and by machines. We begin with the production of speech, the properties of the acoustic signal and how it is perceived by humans. Then we look at the methods of analysing the speech signal. Speech signal analysis and human perception are tied together by looking at speech coding, in particular perceptual coding of sound using MPEG-1 psychoacoustic models, such as MP3. We touch on data embedding and watermarking and then look at automatic speech recognition in some detail. Finally there is an introduction to speech synthesis and areas of ongoing speech processing research.

BIO :

Dr David Grayden has been working as a Research Fellow at the Bionic Ear Institute in Melbourne since 1997. His main research involves examination of phoneme confusions made by people using cochlear implants with the view to designing strategies that will improve perception by the users. He is currently developing and evaluation a number of advanced sound processing strategies. He is also involved in other research areas, including automatic speech recognition and speech enhancement using auditory models, auditory physiology, integration of auditory and visual input, and models of spike-timing dependent plasticity for adaptive learning of spatiotemporal patterns. [http://www.bionicear.org/people/graydend/]

RESOURCES: