10 اردیبهشت 1403
غلامرضا احمدي

غلامرضا احمدی

مرتبه علمی: مربی
نشانی: دانشکده مهندسی جم - گروه مهندسی کامپیوتر (جم )
تحصیلات: کارشناسی ارشد / فناوری اطلاعات
تلفن: 07737646160
دانشکده: دانشکده مهندسی جم

مشخصات پژوهش

عنوان Semi-supervised hierarchical ensemble clustering based on an innovative distance metric and constraint information
نوع پژوهش مقالات در نشریات
کلیدواژه‌ها
Ensemble clustering ,AHC ,Semi-supervised clustering, Distance metric, Information constraints
مجله ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE
شناسه DOI https://doi.org/10.1016/j.engappai.2023.106571
پژوهشگران باهوا شن (نفر اول) ، جیان جیانگ (نفر دوم) ، فنگ کیان (نفر سوم) ، دائوگو لی (نفر چهارم) ، یانمینگ یه (نفر پنجم) ، غلامرضا احمدی (نفر ششم به بعد)

چکیده

Agglomerative Hierarchical Clustering (AHC) is a bottom-up clustering strategy in which each object is originally a cluster, and more pairs of clusters are formed by traversing the hierarchy. It has been proven that there is no individual AHC clustering algorithm that can be efficient in all situations. In order to address this problem, ensemble clustering techniques have been introduced. These techniques combine the results of several output partitions to achieve a consensus with higher accuracy compared to an individual clustering algorithm. This paper proposes an AHC-based ensemble semi-supervised clustering algorithm to improve performance. In semi-supervised clustering, class membership information is used in some objects. Here, we introduce the Semi-Supervised Ensemble Hierarchical Clustering based on Constraints Information (SSEHCCI) algorithm. SSEHCCI is developed using several individual clustering algorithms based on AHC. SSEHCCI includes a flexible weighting policy to generate base partitions and uses the constraints information to configure the semi-supervised clustering. In addition, SSEHCCI uses an innovative distance measure to calculate the distance between each pair of objects. Experimental results show that SSEHCCI performs better than existing semi-supervised algorithms on some University of California Irvine (UCI) datasets. Specifically, we observed an average accuracy of SSEHCCI compared to SSDC and RSSC of 2.6% and 1.8%, respectively.