- 분류 성능 향상을 위한 지역적 선형 재구축 기반 결측치 대치
- ㆍ 저자명
- 강필성,Kang. Pilsung
- ㆍ 간행물명
- 대한산업공학회지
- ㆍ 권/호정보
- 2012년|38권 4호|pp.276-284 (9 pages)
- ㆍ 발행정보
- 대한산업공학회
- ㆍ 파일정보
- 정기간행물| PDF텍스트
- ㆍ 주제분야
- 기타
Classification algorithms generally assume that the data is complete. However, missing values are common in real data sets due to various reasons. In this paper, we propose to use locally linear reconstruction (LLR) for missing value imputation to improve the classification performance when missing values exist. We first investigate how much missing values degenerate the classification performance with regard to various missing ratios. Then, we compare the proposed missing value imputation (LLR) with three well-known single imputation methods over three different classifiers using eight data sets. The experimental results showed that (1) any imputation methods, although some of them are very simple, helped to improve the classification accuracy; (2) among the imputation methods, the proposed LLR imputation was the most effective over all missing ratios, and (3) when the missing ratio is relatively high, LLR was outstanding and its classification accuracy was as high as the classification accuracy derived from the compete data set.