header

An approach for improving statistical translation /

Marwa Nabil Refaie

An approach for improving statistical translation / توجه نحو تحسين الترجمة الاحصائية Marwa Nabil Refaie ; Supervised Ibrahim Farag , Ibrahim Imam - Cairo : Marwa Nabil Refaie , 2017 - 116 Leaves : facsimiles ; 30cm

Thesis (Ph.D.) - Cairo University - Faculty of Computers and Information - Department of Computer Science

A statistical Machine Translation, the state-of-the-art of MT approach nowadays, can learn from a huge amount of data, but originally designed as a batch model. Retraining SMT existing models, using human edits to MT output, are dominating the research field. Traditionally, users feedback is linked to commercial applications, when a review is written or a product is rated similarly, translators feedback is used to improve SMT; therefore, the user could receive better translation learnt from his feedback. This dissertation proposes an online incremental method for statistical machine translation system, in a scenario utilizing experts edit and correction for the SMT output. By updating the model by new translation rules, learning new vocabulary or adapting the MT system to a human translator style. This dissertation presents a new method to improve SMT using post-edits. The proposed method compares the post-edit sentences with the hypotheses translation output in order to automatically detect where the decoder made a mistake and learn from it. Once the errors have been detected, new word alignments are computed between input and post-edit sentences, proposing a set of similarity features, to extract translation units that are then merged online into the system to fix those errors for future translations



Post-Edit Statistical Machine Translation Word alignments