Approximation Algorithms for Co-Clustering
Source:
PODS, Vancouver (2008)
Abstract:
Co-clustering is the simultaneous partitioning of the rows and columns
of a matrix such that the blocks induced by the row/column partitions
are good clusters. Motivated by several applications in text mining,
market-basket analysis, and bioinformatics, this problem has attracted
severe attention in the past few years. Unfortunately, to date, most of
the algorithmic work on this problem has been heuristic in nature.
In this work we obtain the first approximation algorithms for the
co-clustering problem. Our algorithms are simple and
obtain constant-factor approximation solutions to the optimum.
We also show that co-clustering is NP-hard, thereby complementing our
algorithmic result.