Recognizing Text Genres with Simple Metrics Using Discriminant Analysis

Karlgren, Jussi and Cutting, Douglass (1994) Recognizing Text Genres with Simple Metrics Using Discriminant Analysis. In: Proceedings of the 15th International Conference on Computational Linguistics, Kyoto, Japan.



A simple method for categorizing texts into pre-determined text genre categories using the statistical standard technique of discriminant analysis is demonstrated with application to the Brown corpus. Discriminant analysis makes it possible use a large number of parameters that may be specific for a certain corpus or information stream, and combine them into a small number of functions, with the parameters weighted on basis of how useful they are for discriminating text genres. An application to information retrieval is discussed.

Item Type:Conference or Workshop Item (Paper)
Subjects:H. Information Systems > H.3 INFORMATION STORAGE AND RETRIEVAL
ID Code:56
Deposited By:Userware Researcher
Deposited On:25 Oct 2005
Last Modified:18 Nov 2009 15:51

Repository Staff Only: item control page