Method | Enc Pre-train | Dec Pre-train | BLEU |
---|
MT system | |||
Transformer MT (Liu et al. 2019) | - | - | 22.91 |
Base ST setting | |||
LSTM ST (B ́erard et al. 2018) | ✗ | ✗ | 12.90 |
+pre-train+multitask (B ́erard et al. 2018) | ✓ | ✓ | 13.40 |
LSTM ST+pre-train (Inaguma et al. 2020) | ✓ | ✓ | 16.68 |
Transformer+pre-train (Liu et al. 2019) | ✓ | ✓ | 14.30 |
+knowledge distillation (Liu et al. 2019) | ✓ | ✓ | 17.02 |
TCEN-LSTM (Wang et al. 2020a) | ✓ | ✓ | 17.05 |
Transformer+ASR pre-train (Wang et al. 2020b) | ✓ | ✗ | 15.97 |
Transformer+curriculum pre-train (Wang et al. 2020b) | ✓ | ✗ | 17.66 |
COSTT without pre-training | ✗ | ✗ | 17.83 |
Expanded ST setting | |||
LSTM+pre-train+SpecAugment (Bahar et al. 2019) | ✓ | ✓ | 17.00 |
Multilingual ST+PT (Inaguma et al. 2019) | ✓ | ✗ | 17.60 |
Transformer+ASR pre-train (Wang et al. 2020b) | ✓ | ✗ | 16.90 |
Transformer+curriculum pre-train (Wang et al. 2020b) | ✓ | ✗ | 18.01 |
COSTT with pre-training | ✓ | ✓ | 18.23 |
Transcript | said the doctor yes |
Target | dit le docteur , oui . |
Base ST | dit le docteur . |
COSTT | <asr> said the doctor yes <ast> dit le doc- teur , oui . |
Transcript | i rushed aboard |
Target | je me pre ́cipitai a` bord. |
Base ST | je me pre ́cipitai vers l’ avant . |
COSTT | <asr> i rushed aboard <ast> je me pre ́cipitai a` bord . |
Transcript | is there any news today |
Target | y a-t-il des nouvelles aujourd’ hui ? |
Base ST | est-ce que j’ ai de ́ja` utilise ́ aujourd’ hui ? |
COSTT | <asr> is there any news to day <ast> y a-t- il des nouvelles aujourd’ hui ? |
@inproceedings{dong2021consecutive, title={Consecutive Decoding for Speech-to-text Translation}, author={Qianqian Dong, Mingxuan Wang, Hao Zhou, Shuang Xu, Bo Xu, Lei Li}, booktitle={Proceedings of the AAAI Conference on Artificial Intelligence}, year={2021} }