An experiment in automatic hierarchical document classification |
| |
Authors: | Kathleen Garland |
| |
Affiliation: | School of Communication, Information and Library Studies, Rutgers—The State University of New Jersey, New Brunswick, NJ 08903, U.S.A. |
| |
Abstract: | A method of automatic document classification was developed as part of a larger research project in materials selection. Documents classed as QA by the Library of Congress classification system were clustered at six thresholds by keyword using the single link technique. The automatically generated clusters were then compared to the Library of Congress subclasses to which the documents had been assigned by human classifiers. Finally, a partial classified hierarchy was formed from the individual document clusters within a single threshold. Implications of the utility of grouping documents for on-line searching are discussed. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|