The Improving Accurate Candidates for Missing Data Using Benefit Performance of (ML-SOM)

Authors

  • Abeer Abdullah AL-Mohdar Information Technology Department, Faculty of Computers and Information Technology, Hadhramout University
  • Mohamed Abdullah Bamatraf Information Technology Department, Faculty of Computers and Information Technology, Hadhramout University. Received on 4/2/2020 and Accepted for Publication on17/6/2020

Keywords:

Self-organizing-map, Multi-layer self-organizing-map, missing data, Data mining, Neural networks, Machine learning

Abstract

Missing data is one of the major challenges in extracting and analyzing knowledge from dataset. The performance of
training quality was affected by the appearance of missing data in a datasets. For this reason, there is a need for a quick
and reliable method to find possible solutions in order to provide an accurate system. Therefore, the previous studies
provided robust ability of Self Organizing Map (SOM) algorithm to deal with the missing values [6, 20]. However, it has
a drawback such as an error rate(ERR) in the missing values that increase huge dataset. This study is mainly based on the
projection of unsupervised Multilayer SOM (ML-SOM) for missing values. The global methodology presented the
combination of advantages of the proposed ML-SOM to obtain a precise method with various missing rates: 5%, 10%
and 20%. The experiments were conducted by adopting two types of commonly used data benchmarks (IRIS and BreastCancer) from Weka 3.9 machine learning tool. The new proposed method ML-SOM provides a minimum Error Rate
(ERR) and high accuracy (ACC)in small and large datasets compared to other standard classifier types (Bayes-Net, Kmeans and SOM)

Downloads

Published

2023-11-22