Automatic Construction of Persian ICT WordNet using Princeton WordNet

سال انتشار: 1398
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 337

فایل این مقاله در 11 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JADM-7-1_010

تاریخ نمایه سازی: 19 تیر 1398

چکیده مقاله:

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose several automatic methods to extract Information and Communication Technology (ICT)-related data from Princeton WordNet. We, then, add these extracted data to our Persian WordNet. The advantage of automated methods is reducing the interference of human factors and accelerating the development of our bilingual ICT WordNet. In our first proposed method, based on a small subset of ICT words, we use the definition of each synset to decide whether that synset is ICT. The second mechanism is to extract synsets which are in a semantic relation with ICT synsets. We also use two similarity criteria, namely LCS and S3M, to measure the similarity between a synset definition in WordNet and definition of any word in Microsoft dictionary. Our last method is to verify the coordinate of ICT synsets. Results show that our proposed mechanisms are able to extract ICT data from Princeton WordNet at a good level of accuracy.

کلیدواژه ها:

WordNet ، semantic relation ، synset ، part of speech ، Information and Communication Technology

نویسندگان

A. Ahmadi Tameh

Computer Department, Faculty of Engineering, Bu-Ali Sina University, Hamedan, Iran.

M. Nassiri

Computer Department, Faculty of Engineering, Bu-Ali Sina University, Hamedan, Iran.

M. Mansoorizadeh

Computer Department, Faculty of Engineering, Bu-Ali Sina University, Hamedan, Iran.