TY - GEN
T1 - Modeling the mixtures of known noise and unknown unexpected noise for robust speech recognition
AU - Ming, Ji
AU - Jancovic, Peter
AU - Hanna, Philip
AU - Stewart, Darryl
PY - 2001/1/1
Y1 - 2001/1/1
N2 - Real-world noise may be a mixture of known or trainable noise and unknown unexpected noise. This paper investigates the combination of the conventional noise-reduction techniques with the probabilistic union model to deal with this type of mixed noise for robust speech recognition. In particular, we have developed a multi-environment system to remove the known or trainable acoustic mismatch across different environments. The novelty of this system, in contrast to other multi-environment models, is that the acoustic model for each environment is built upon the probabilistic union model, so that this system is also capable of accommodating further unknown unexpected noise within a specific environment. We have tested the new system for connected digit recognition in different environments, each involving an environment-specific noise and some unknown untrained noise. The results indicate that the new system offers significantly improved performance for the environments involving unknown additional noise, in comparison to a baseline multi-environment system.
AB - Real-world noise may be a mixture of known or trainable noise and unknown unexpected noise. This paper investigates the combination of the conventional noise-reduction techniques with the probabilistic union model to deal with this type of mixed noise for robust speech recognition. In particular, we have developed a multi-environment system to remove the known or trainable acoustic mismatch across different environments. The novelty of this system, in contrast to other multi-environment models, is that the acoustic model for each environment is built upon the probabilistic union model, so that this system is also capable of accommodating further unknown unexpected noise within a specific environment. We have tested the new system for connected digit recognition in different environments, each involving an environment-specific noise and some unknown untrained noise. The results indicate that the new system offers significantly improved performance for the environments involving unknown additional noise, in comparison to a baseline multi-environment system.
UR - http://www.scopus.com/inward/record.url?scp=85009154262&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85009154262
T3 - EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology
SP - 1111
EP - 1114
BT - EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology
A2 - Lindberg, Borge
A2 - Benner, Henrik
A2 - Dalsgaard, Paul
A2 - Tan, Zheng-Hua
PB - International Speech Communication Association
T2 - 7th European Conference on Speech Communication and Technology - Scandinavia, EUROSPEECH 2001
Y2 - 3 September 2001 through 7 September 2001
ER -