A Graph-based Approach for Persian Entity Linking

سال انتشار: 1399
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 184

فایل این مقاله در 10 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_ITRC-12-3_006

تاریخ نمایه سازی: 14 فروردین 1401

چکیده مقاله:

Most of the data on the web is in the form of natural language, but natural language is highly ambiguous, especially when it comes to the frequent occurrence of entities. The goal of entity linking is to find entity mentions and link them to their corresponding entities in an external knowledge base. Recently, FarsBase was introduced as the first Persian knowledge base with nearly ۷۵۰,۰۰۰ entities. This research suggested one of the first end-to-end unsupervised entity linking systems specifically for Persian, using context and graph-based features to rank candidate entities. To evaluate the proposed method, we used the first Persian entity-linking dataset created by crawling social media text from some popular Telegram channels. The ParsEL results show that the F-Score of the input data set is ۸۷.۱% and is comparable to any other entity-linking system that supports Persian.

نویسندگان

Majid Asgari-Bidhendi

Computer Engineering School Iran University of Science and Technology Tehran, Iran

Farzane Fakhrian

Computer Engineering School Iran University of Science and Technology Tehran, Iran

Behrouz Minaei-Bidgoli

Computer Engineering School Iran University of Science and Technology Tehran, Iran