Abstract
Sign language translation (SLT) is usually seen as a two-step process of continuous sign language recognition (CSLR) and gloss-to-text translation. We propose a novel, Transformer-based architecture to jointly perform CSLR and sign-translation in an end-to-end fashion. We extend the ordinary Transformer decoder with two channels to support multitasking, where each channel is devoted to solving a particular problem. To control the memory footprint of our model, channels are designed to share most of their parameters with each other. However, each channel still has a dedicated set of parameters that is fine-tuned with respect to the channel's task. In order to evaluate the proposed architecture, we focus on translating German signs into English sequences and use the RWTH-PHOENIX-Weather 2014 T corpus in our experiments. Evaluation results along with detailed quantitative and qualitative analyses indicate that the mixture of information provided by the multitask decoder was successful and enabled us to achieve superior performance in comparison to other SLT models.
| Original language | English |
|---|---|
| Title of host publication | 2023 International Joint Conference on Neural Networks (IJCNN): Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Number of pages | 8 |
| ISBN (Electronic) | 9781665488679 |
| ISBN (Print) | 9781665488686 |
| DOIs | |
| Publication status | Published - 02 Aug 2023 |
| Externally published | Yes |
| Event | 2023 International Joint Conference on Neural Networks - Gold Coast, Australia Duration: 18 Jun 2023 → 23 Jun 2023 |
Publication series
| Name | Proceedings of the International Joint Conference on Neural Networks |
|---|---|
| ISSN (Print) | 2161-4393 |
| ISSN (Electronic) | 2161-4407 |
Conference
| Conference | 2023 International Joint Conference on Neural Networks |
|---|---|
| Abbreviated title | IJCNN 2023 |
| Country/Territory | Australia |
| City | Gold Coast |
| Period | 18/06/2023 → 23/06/2023 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Keywords
- End to end learning
- Multitasking
- Sign Language Translation
ASJC Scopus subject areas
- Software
- Artificial Intelligence