Speech Enhancement from Additive Noise and Channel Distortion - a Corpus-Based Approach

Ming Ji, Daniel Crookes

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

This paper presents a new approach to single-channel speech enhancement involving both noise and channel distortion (i.e., convolutional noise). The approach is based on finding longest matching segments (LMS) from a corpus of clean, wideband speech. The approach adds three novel developments to our previous LMS research. First, we address the problem of channel distortion as well as additive noise. Second, we present an improved method for modeling noise. Third, we present an iterative algorithm for improved speech estimates. In experiments using speech recognition as a test with the Aurora 4 database, the use of our enhancement approach as a preprocessor for feature extraction significantly improved the performance of a baseline recognition system. In another comparison against conventional enhancement algorithms, both the PESQ and the segmental SNR ratings of the LMS algorithm were superior to the other methods for noisy speech enhancement. Index Terms: corpus-based speech model, longest matching segment, speech enhancement, speech recognition
Original languageEnglish
Title of host publicationInterspeech 2014: Proceedings
Pages2710-2714
Number of pages5
Publication statusPublished - 2014

Fingerprint

Dive into the research topics of 'Speech Enhancement from Additive Noise and Channel Distortion - a Corpus-Based Approach'. Together they form a unique fingerprint.

Cite this