TY - CHAP
T1 - Combining multi-band and frequency-filtering techniques for speech recognition in noisy environments
AU - Jancovic, Peter
AU - Ji, Ming
AU - Hanna, Philip
AU - Stewart, Darryl
AU - Smith, F
PY - 2000
Y1 - 2000
N2 - While current speech recognisers give acceptable performance in carefully controlled environments, their performance degrades rapidly when they are applied in more realistic situations. Generally, the environmental noise may be classified into two classes: the wide-band noise and narrow band noise. While the multi-band model has been shown to be capable of dealing with speech corrupted by narrow-band noise, it is ineffective for wide-band noise. In this paper, we suggest a combination of the frequency-filtering technique with the probabilistic union model in the multi-band approach. The new system has been tested on the TIDIGITS database, corrupted by white noise, noise collected from a railway station, and narrow-band noise, respectively. The results have shown that this approach is capable of dealing with noise of narrow-band or wide-band characteristics, assuming no knowledge about the noisy environment.
AB - While current speech recognisers give acceptable performance in carefully controlled environments, their performance degrades rapidly when they are applied in more realistic situations. Generally, the environmental noise may be classified into two classes: the wide-band noise and narrow band noise. While the multi-band model has been shown to be capable of dealing with speech corrupted by narrow-band noise, it is ineffective for wide-band noise. In this paper, we suggest a combination of the frequency-filtering technique with the probabilistic union model in the multi-band approach. The new system has been tested on the TIDIGITS database, corrupted by white noise, noise collected from a railway station, and narrow-band noise, respectively. The results have shown that this approach is capable of dealing with noise of narrow-band or wide-band characteristics, assuming no knowledge about the noisy environment.
U2 - 10.1007/3-540-45323-7_45
DO - 10.1007/3-540-45323-7_45
M3 - Chapter (peer-reviewed)
SN - 978-3-540-41042-3
VL - 1902
T3 - Lecture Notes in Computer Science
SP - 265
EP - 270
BT - Text, Speech and Dialogue
ER -