Journal of Northeastern University ›› 2007, Vol. 28 ›› Issue (1): 44-48.DOI: -

• OriginalPaper • Previous Articles     Next Articles

On the WCSW: Website classification system wrapper

Gao, Ke-Ning (1); Wang, Bo (1); Zhang, Bin (1); You, Zhen (1)   

  1. (1) School of Information Science and Engineering, Northeastern University, Shenyang 110004, China
  • Received:2013-06-27 Revised:2013-06-27 Online:2007-01-15 Published:2013-06-24
  • Contact: Gao, K.-N.
  • About author:-
  • Supported by:
    -

Abstract: In a website, various information is organized by its own navigation system, which involves the semantic characteristics of classification. In order to fulfill effective extraction of Web information, the WCSW (website classification system wrapper) based on HTML page blocking algorithm is proposed aiming at the classification system of websites. WCSW deals with navigation information blocks involving semantic classification in accordance to extraction rules, which the whole website as an object based on the blocking algorithm and analysis of semantic characteristics, the experimental result shows high-accuracy level classification in extracted websites with good practicability.

CLC Number: