A Distributed Clustering Approach for Heterogeneous EnvironmentsUsing Fuzzy Rough Set Theory مقاله

نویسنده: Mozafari، Niloofar ؛ Nikouei Mahani، Mohammad-Ali ؛ Hashemi، Sattar ؛

International Journal Of Information Science And Management July & December 2020, Volume 18 - Number 2 رتبه بین المللی (وزارت علوم/ISC (‎14 صفحه - از 215 تا 228 )

کلیدواژه ها: Distributed Clustering Fuzzy Rough Set Theory Data Distributed Mining

چکیده:

Vast majority of data mining algorithms have been designed to work on centralized data, unfortunately however, almost all of nowadays data sets are distributed both geographically and conceptually. Due to privacy and computation cost, centralizing distributed data sets before analyzing them is undoubtedly impractical. In this paper, we present a framework for clustering distributed data which takes into account privacy and computation cost. To do that, we remove uncertain instances and just send the label of the other instances to the central location. To remove the uncertain instances, we develop a new instance weighting method based on fuzzy and rough set theory. The achieved results on well-known data verify effectiveness of the proposed method compared to previous works.

خلاصه ماشینی:

A Distributed Clustering Approach for Heterogeneous Environments Using Fuzzy Rough Set Theory Niloofar Mozafari Department of Designing & System Operation, Regional Information Center for Science and Technology, RICeST, Shiraz, IRAN Corresponding Author: mozafari Hricest. Samatova, Ostrouchov, Geist & Melechko (2005) presented another hierarchical clustering in distributed environments that send a representative from each cluster to a central location. There are also some methods for distributed clustering in homogeneous data that work well in distributed environments but they do not specifically address the privacy issues (Tasoulis & Vrahatis, 2004), (Dhillon & Modha, 2002). A density based clustering in distributed environments was proposed in (Santos, Syed, Naldi, Campello & Sander, 2019). For selecting the appropriate labels, we propose a new instance weighting method based on fuzzy and rough set theory. Our Fuzzy Rough set Instance Weighting (FRIW) gives weight to each instance and instead of sending the entire data, the label of instances with higher weights are just sent to the central location. Instead of sending the entire instances with all of their features to central location, the labels of selected data are just sent. As it is obvious, in this data set outlier instances which are far from the core (central) of the cluster, have the minimum weights. For example, in Pendig data set; with removing the boundary instances; the number of label of instances in the central location decreases from 44964 to 25526. In order to select the appropriate labels, we propose a new instance weighting method based on fuzzy and rough set theory.

دریافت فایل ارجاع :
(پژوهیار, , , )

دانلود HTML
دانلود PDF

ورود / عضویت

برای مشاهده محتوای مقاله لازم است وارد پایگاه شوید. در صورتی که عضو نیستید از قسمت عضویت اقدام فرمایید.

ورود

عضویت

تحتاج دخول لعرض محتوى المقالة. إذا لم تكن عضوًا ، فتابع من الجزء الاشتراک.
إن كنت لا تقدر علی شراء الاشتراك عبرPayPal أو بطاقة VISA، الرجاء ارسال رقم هاتفك المحمول إلی مدير الموقع عبر webmaster@noormags.com .

You need Sign in to view the content of the article. If you are not a member, proceed from part Sign up.
If you fail to purchase subscription via PayPal or VISA Card, please send your mobile number to the Website Administrator via webmaster@noormags.com .

لینک کوتاه:

1400

1399

1398

1397

1396

1395

1394

1393

1392

1391

1390

1389

1388

1387

1386

1385

1384

1383

1382

1381

A Distributed Clustering Approach for Heterogeneous EnvironmentsUsing Fuzzy Rough Set Theory مقاله