W. Lam

One important and useful basic component in text mining is automatic text categorization. Text categorization has a lot of applications including intelligent document routing and knowledge management. It is more challenging than ordinary classification problems due to high dimensionality, text feature extraction, and skewedly distributed classes. We have been developing new algorithms for this problem. In addition to algorithmic progress, we intend to seek a more realistic model for capturing the inherent properties of text classification.

Department of Systems Engineering and Engineering Management, CUHK