Title: On fast supervised learning for normal mixture models with missing information
Authors: Lin, TI
Lee, JC
Ho, HJ
統計學研究所
資訊管理與財務金融系 註:原資管所+財金所
Institute of Statistics
Department of Information Management and Finance
Keywords: Bayesian classifier;data augmentation;EM algorithrn;incomplete features;Rao-Blackwellization
Issue Date: 1-Jun-2006
Abstract: It is an important research issue to deal with mixture models when missing values occur in the data. In this paper, computational strategies using auxiliary indicator matrices are introduced for efficiently handling mixtures of multivariate normal distributions when the data are missing at random and have an arbitrary missing data pattern, meaning that missing data can occur anywhere. We develop a novel EM algorithm that can dramatically save computation time and be exploited in many applications, such as density estimation, supervised clustering and prediction of missing values. In the aspect of multiple imputations for missing data, we also offer a data augmentation scheme using the Gibbs sampler. Our proposed methodologies are illustrated through some real data sets with varying proportions of missing values. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
URI: http://dx.doi.org/10.1016/j.patcog.2005.12.014
http://hdl.handle.net/11536/12196
ISSN: 0031-3203
DOI: 10.1016/j.patcog.2005.12.014
Journal: PATTERN RECOGNITION
Volume: 39
Issue: 6
Begin Page: 1177
End Page: 1187
Appears in Collections:Articles


Files in This Item:

  1. 000236696400015.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.