[ 1 ] A. Nigam, A.K. McCallum, S. Thrun, and T.M. Mitchel, “Text Classification from Labeled and Unlabeled Documents Using Em,” J. Machine Learning, vol. 39, no. 2, pp. 103-134, 2000.
[ 2 ] C. Smyth, “Model Selection for Probabilistic Clustering Using Cross-Validated Likelihood,” Statistics and Computing, vol. 10, no. 1, pp. 63-72, 2000.
[ 3 ] R. Madsen, D. Kauchak, and C. Elkan, “Modeling Word Burstiness Using the Dirichlet Distribution,” Proc. Int’l Conf. Machine Learning, pp. 545-552, 2005.
[ 4 ] C. Elkan, “Clustering Documents with an Exponential-Family Approximation of the Dirichlet Compound Multinomial Distribution,” Proc. Int’l Conf. Machine Learning, pp. 289-296, 2006.
[ 5 ] I. Cheeseman, J. Kelly, M. Self, J. Stutz, W. Taylor, and D.Freedman, “Autoclass: A Bayesian Classification System,” Proc.Int’l Conf. Machine Learning, pp. 54-64, 1988.
[ 6 ] J. Rissanen, “Modeling by Shortest Data Description,” Automatica, vol. 14, pp. 465-471, 1978.
[ 7 ] K. Bozdogan, “Determining the Number of Component Clusters in the Standard Multivariate Normal Mixture Model Using Model-Selection Criteria,” Technical Report UIC/DQM/A83-1,Quantitative Methods Dept., Univ. of Illinois, Chicago, IL, 1983.
[ 8 ] L. Huang, and Z. Wang, “Document Clustering via Dirichlet Process Mixture Model with Feature Selection,” Proc. ACM Int’l Conf. Knowledge Discovery and Data Mining, pp. 763-772, 2010.
[ 9 ] N. Schwarz, “Estimating the Dimension of a Model,” The Annals of Statistics, vol. 6, no. 2, pp. 461-464, 1978.
[ 10 ] U.H.C. Law, M.A.T. Figueiredo, and A.K. Jain, “Simultaneous Feature Selection and Clustering Using Mixture Models,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 26, no. 9, pp. 1154-1166, Sept. 2004.
[ 11 ] Yu, R. Huang, and Z. Wang, “Document Clustering via Dirichlet Process Mixture Model with Feature Selection,” Proc. ACM Int’l Conf. Knowledge Discovery and Data Mining, pp. 763-772, 2010.