Designing and implementing a system of automatic classification of documents for a finite set of languages

Zaitsev V.G., Lan Chunlin

The paper investigates the practical aspects of creating an automated system for automatic language recognition and classification of documents. The basic methods for automatic language identification and classification of documents on proposed. Describes the proposed architecture of the automatic text classification system for multilingual environment.

Full text (pdf)