Clustering newsgroups data using k-means