CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Using Synchronous TAG for Source-Side Reordering in SMT

عنوان مقاله: Using Synchronous TAG for Source-Side Reordering in SMT
شناسه ملی مقاله: JR_ITRC-5-4_006
منتشر شده در در سال 1392
مشخصات نویسندگان مقاله:

Amin Mansouri
Hakimeh Fadaei
Heshaam Faili
Mohsen Arabsorkhi

خلاصه مقاله:
Recent efforts in machine translation try to enrich statistical methods by syntactic information of source and target languages. In this paper we present a hybrid machine translator, which combines rule-based and statistical models in a serial manner. This system uses synchronous tree adjoining grammar (STAG) to benefit the context sensitivity of this formalism. In this system, a set of reordering rules in STAG formalism is automatically extracted from a parallel corpus. These rules are used to change the word orders of the source sentence to match the word ordersin the target language. The restructured sentences are then translated to target language using a statistical approach. Experiments are carried out on three different datasets for English-Persian translation. Experimental results show that the presented reordering method combined with conventional or monotone phrase-based SMT, improves the translation quality respectively by ۱.۸ and ۰.۵۵ points regarding BLEU score.

کلمات کلیدی:
Statistical Machine Translation, Reordering Rules, Tree Adjoining Grammar

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1426578/