Integrating Noise and Unified Speech Models for Communication on Mobile Devices in Noisy Environments

Project Overview

An EPSRC funded collaborative research project (Pro. No GR/S30238/01) between Brunel University, Southampton University, University of East Anglia, Norwich and Queen's University Belfast.

The project is divided into three parts:

Mobile phones are often used in noisy outdoor environments such as a noisy street, airports, cafe, or in a moving car/train. The quality and intelligibility of speech can be severely degraded by the ambient noise. Therefore noise reduction is an increasingly important aspect of improving the quality of service (QoS) and reliability of speech communication. With the increasingly deployment of speech recognition and voice-based systems across a wide range of multimedia mobile services, it is important to the users and providers of mobile phones that speech communication and access to voice recognition systems is not impaired by noise. Noise degrades the accuracy of automatic speech recognition even for such modest tasks as name dialling, automatic directory enquiry, or voice control of the accessories in a moving car. Furthermore, the future generation of very low bit rate coders will increasingly depend on correct speech classification and accurate estimation of speech parameters for improved performance. The aim of this proposal is to develop an integrated system for both speech enhancement and speech recognition. This will be achieved through the development of a unified speech model for both speech recognition and synthesis/reconstruction, together with decision-tree predictive models of non-stationary noise sources typically encountered in mobile environments.

Objective

People

Academic Staff

Research Staff and Students

Database

NoiseX_0 NoiseX_1

Aurora

TIDIGITS ADULTS

Cellular Telephone Acoustic-Phonetic Continuous Speech Corpus (CTIMIT)

Progress Report

Project meeting 23/01/2004

Project meeting 25/05/2004

Project meeting 28/07/2004

Project meeting 14/02/2005

Project meeting 06/07/2005

Project meeting 23/11/2005

Links

News

List of MPhil/PhD Projects in MultiMedia Mobile Digtial Signal Processing

Advanced Signal processing and Noise Reduction 3rd Ed, S. Vaseghi, John Wiley 2006

Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications, S. Vaseghi, John Wiley 2007

DeNoise Toolkit

This is a collection of established and new speech denoising methods developed at Brunel in collaboration with Southampton and UEA (Project sponsored by EPSRC).

Voice Morph

VoiceMorph is a software tool developed in our lab for analysis, modelling modification and of voice profile parameters including speaker correlates and Accent correlates of voice.

MSc Digital Signal Processing

Multimedia, communication and inteligent systems.