Kagan Tumer's Publications

Display Publications by [Year] [Type] [Topic]


Ensemble Clustering with Voting Active Labels. A. K. Agogino and K. Tumer. Pattern Recognition Letters, 29(14):1947–1953, 2008.

Abstract

Clustering is an integral part of pattern recognition problems and is connected to both the data reduction and the data understanding steps. Combining multiple clusterings into an ensemble clustering is critical in many real world applications, particularly for domains with large data sets, high-dimensional feature sets and proprietary data. This paper presents Voting Active Clusters (VACs), a method for combining multiple ``base'' clusterings into a single unified ``ensemble'' clustering that is robust against missing data and does not require all the data to be collected in one central location. In this approach, separate processing centers produce many base clusterings based on some portion of the data. The clusterings of such separate processing centers are then pooled to produce a unified ensemble clustering through a voting mechanism. The major contribution of this work is in providing an adaptive voting method by which the clusterings (e.g., spatially distributed processing centers) update their votes in order to maximize an overall quality measure. Our results show that this method achieves comparable or better performance than traditional cluster ensemble methods in noise-free conditions, and remains effective in noisy scenarios where any traditional methods are inapplicable.

Download

(unavailable)

BibTeX Entry

@article{tumer-agogino_prl09,
	author = {A. K. Agogino and K. Tumer},
	title = {Ensemble Clustering with Voting Active Labels},
	journal = {Pattern Recognition Letters},
	Volume = {29},
	Number = {14},
	Pages = {1947-1953},
	bib2html_pubtype = {Journal Articles},
	bib2html_rescat = {Classifier Ensembles, Multiagent Systems},
	abstract ={
	Clustering is an integral part of pattern recognition problems and is connected to both the data reduction and the data understanding steps.  Combining multiple clusterings into an ensemble clustering is critical in many real world applications, particularly for domains with large data sets, high-dimensional feature sets and proprietary data. This paper presents Voting Active Clusters (VACs), a method for combining multiple ``base'' clusterings into   a single unified ``ensemble'' clustering that is robust against missing data  and does not require all the data to be collected in one central location.  In this approach, separate processing centers produce many base clusterings based on some portion of the data.  The clusterings of such separate processing  centers are then pooled to produce a unified ensemble clustering through a  voting mechanism.  The major contribution of this work is in providing an adaptive voting method by which the clusterings (e.g., spatially distributed processing centers) update their  votes in order to maximize an overall quality measure. Our results show that  this method achieves comparable or better performance than traditional cluster ensemble methods in noise-free conditions, and remains effective in noisy  scenarios where  any  traditional methods are inapplicable.},
	year = {2008}
} 

Generated by bib2html.pl (written by Patrick Riley ) on Tue Jun 26, 2018 19:10:42