Abstract
This paper considers the separation and recognition of overlapped speech sentences assuming single-channel observation. A system based on a combination of several different techniques is proposed. The system uses a missing-feature approach for improving crosstalk/noise robustness, a Wiener filter for speech enhancement, hidden Markov models for speech reconstruction, and speaker-dependent/-independent modeling for speaker and speech recognition. We develop the system on the Speech Separation Challenge database, involving a task of separating and recognizing two mixing sentences without assuming advanced knowledge about the identity of the speakers nor about the signal-to-noise ratio. The paper is an extended version of a previous conference paper submitted for the challenge.
| Original language | English |
|---|---|
| Pages (from-to) | 67-76 |
| Number of pages | 10 |
| Journal | Computer Speech & Language |
| Volume | 24 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - Jan 2010 |
| Event | Interspeech 2006 - ICSLP - Pittsburgh, United States Duration: 17 Sept 2006 → 21 Sept 2006 |
Fingerprint
Dive into the research topics of 'Combining missing-feature theory, speech enhancement, and speaker-dependent/-independent modeling for speech separation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver