Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation
Paper
•
2210.03953
•
Published
For a better experience, please wear headphones.
| Chunk Size 320ms | Chunk Size 2560ms | Offline |
|---|---|---|
| Source Speech Transcript | Reference Text Translation |
|---|---|
| Avant la fusion des communes, Rouge-Thier faisait partie de la commune de Louveigné. | before the fusion of the towns rouge thier was a part of the town of louveigne |
For more examples, please check https://nast-s2x.github.io/.
Check Details 👇
We release French-to-English speech-to-speech translation models trained on the CVSS-C dataset to reproduce results in our paper. You can train models in your desired languages by following the instructions provided below.
| Chunk Size | checkpoint | ASR-BLEU | ASR-BLEU (Silence Removed) | Average Lagging |
|---|---|---|---|---|
| 320ms | checkpoint | 19.67 | 24.90 | -393ms |
| 1280ms | checkpoint | 20.20 | 25.71 | 3330ms |
| 2560ms | checkpoint | 24.88 | 26.14 | 4976ms |
| Offline | checkpoint | 25.82 | - | - |
| Vocoder |
|---|
Before executing all the provided shell scripts, please ensure to replace the variables in the file with the paths specific to your machine.
offline_s2u_infer.shoffline_wav_infer.shSimulEval: b43a7c to evaluate the model in simultaneous inference. This repository is built upon the official SimulEval: a1435b and includes additional latency scorers.streaming_infer.shtrain_ctc.shtrain_nmla.shPlease kindly cite us if you find our papers or codes useful.
@inproceedings{
ma2024nonautoregressive,
title={A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation},
author={Ma, Zhengrui and Fang, Qingkai and Zhang, Shaolei and Guo, Shoutao and Feng, Yang and Zhang, Min
},
booktitle={Proceedings of ACL 2024},
year={2024},
}
@inproceedings{
fang2024ctcs2ut,
title={CTC-based Non-autoregressive Textless Speech-to-Speech Translation},
author={Fang, Qingkai and Ma, Zhengrui and Zhou, Yan and Zhang, Min and Feng, Yang
},
booktitle={Findings of ACL 2024},
year={2024},
}