Translator Disclaimer
23 February 2005 Using Kohonen networks for WWW document classification
Author Affiliations +
This paper presents flexible solutions to the clustering and classification of World Wide Web documents. The method proposed in this paper applies the self-organizing Kohonen network known also a self-organizing map (SOM) with two-layer architecture. In this architecture documents become mapped as points on the SOM, in a geometric order that describes the similarity of their contents. This network has been learned by means of unsupervised training technique. After learning process has been completed, the network visualizes semantic relationship between input documents as two-dimensional semantic map. This map is a retrieval interface for an online WWW documents classification system. In this paper, first, the main idea of solution based on SOM has been presented. Next, the operation of this method has been illustrated with the us of synthetic data set. Finally, this technique has been tested by means of real-life WWW documents set.
© (2005) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Filip Rudzinski, Adam Gluszek, and Michal Kekez "Using Kohonen networks for WWW document classification", Proc. SPIE 5775, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments III, (23 February 2005);


Visual exploration of large data sets
Proceedings of SPIE (April 21 1996)
Supporting online learning with games
Proceedings of SPIE (April 08 2007)
SIP-CCME cooperation in multimedia VoIP network
Proceedings of SPIE (October 11 2006)

Back to Top