Accurate Quality Estimation in Machine Translation

Thursday 27 March 2025


As machine translation technology continues to improve, a major challenge has emerged: how to accurately assess the quality of translations before they’re complete. Current methods struggle when faced with incomplete or partial translations, which are increasingly common in applications like simultaneous speech translation.


Researchers have developed a new approach that addresses this problem by training a special type of model to evaluate partial translations. This model, called Partial COMET, is designed to work alongside traditional machine translation systems to provide early feedback on the quality of translations as they’re being generated.


The key innovation behind Partial COMET is its ability to learn from incomplete translations. Unlike traditional quality estimation models, which are trained on complete translations and struggle with partial ones, Partial COMET is explicitly designed to evaluate translations at various stages of completion. This allows it to provide more accurate assessments even when the translation is not yet finished.


To train Partial COMET, researchers used a large dataset of machine-translated text, along with corresponding human ratings of quality. They then developed an algorithm that takes into account not only the partial translation itself but also the context in which it was generated – such as the language pair and the length of the original source text.


The results are promising: Partial COMET outperforms traditional quality estimation models when evaluating partial translations, with correlations between predicted scores and human ratings reaching up to 25% higher than before. This means that machine translation systems can now make more informed decisions about which candidates to pursue further, reducing computational costs and improving overall performance.


But the benefits of Partial COMET extend beyond just improved accuracy. By providing early feedback on translation quality, this model can also help identify potential issues with the translation process itself – such as errors in segmentation or language modeling. This information can then be used to fine-tune the machine translation system and improve its overall performance over time.


As machine translation becomes increasingly important in a wide range of applications – from customer service chatbots to medical research databases – the need for accurate quality estimation has never been greater. With Partial COMET, researchers have taken an important step towards meeting that challenge head-on, paving the way for more efficient and effective machine translation systems in the future.


The potential impact of this technology is significant, enabling machine translation systems to adapt to changing contexts and provide better results even when faced with incomplete or uncertain information.


Cite this article: “Accurate Quality Estimation in Machine Translation”, The Science Archive, 2025.


Machine Translation, Quality Estimation, Partial Translations, Simultaneous Speech Translation, Model Training, Large Dataset, Human Ratings, Language Pair, Segmentation, Fine-Tuning


Reference: Vilém Zouhar, Maike Züfle, Beni Egressy, Julius Cheng, Jan Niehues, “Early-Exit and Instant Confidence Translation Quality Estimation” (2025).


Leave a Reply