TY - GEN
T1 - Using symbolic objects to cluster web documents
AU - Meneses, Esteban
AU - Rodríguez-Rojas, Oldemar
PY - 2006
Y1 - 2006
N2 - Web Clustering is useful for several activities in the WWW, from automatically building web directories to improve retrieval performance. Nevertheless, due to the huge size of the web, a linear mechanism must be employed to cluster web documents. The k-means is one classic algorithm used in this problem. We present a variant of the vector model to be used with the k-means algorithm. Our representation uses symbolic objects for clustering web documents. Some experiments were done with positive results and future work is optimistic.
AB - Web Clustering is useful for several activities in the WWW, from automatically building web directories to improve retrieval performance. Nevertheless, due to the huge size of the web, a linear mechanism must be employed to cluster web documents. The k-means is one classic algorithm used in this problem. We present a variant of the vector model to be used with the k-means algorithm. Our representation uses symbolic objects for clustering web documents. Some experiments were done with positive results and future work is optimistic.
KW - Symbolic data analysis
KW - Web clustering
UR - http://www.scopus.com/inward/record.url?scp=34250634473&partnerID=8YFLogxK
U2 - 10.1145/1135777.1135968
DO - 10.1145/1135777.1135968
M3 - Contribución a la conferencia
AN - SCOPUS:34250634473
SN - 1595933239
SN - 9781595933232
T3 - Proceedings of the 15th International Conference on World Wide Web
SP - 967
EP - 968
BT - Proceedings of the 15th International Conference on World Wide Web
T2 - 15th International Conference on World Wide Web
Y2 - 23 May 2006 through 26 May 2006
ER -