Emergent Mind

Linear-Time Approximation Scheme for k-Means Clustering of Affine Subspaces

(2106.14176)
Published Jun 27, 2021 in cs.CG and cs.DS

Abstract

In this paper, we present a linear-time approximation scheme for $k$-means clustering of \emph{incomplete} data points in $d$-dimensional Euclidean space. An \emph{incomplete} data point with $\Delta>0$ unspecified entries is represented as an axis-parallel affine subspaces of dimension $\Delta$. The distance between two incomplete data points is defined as the Euclidean distance between two closest points in the axis-parallel affine subspaces corresponding to the data points. We present an algorithm for $k$-means clustering of axis-parallel affine subspaces of dimension $\Delta$ that yields an $(1+\epsilon)$-approximate solution in $O(nd)$ time. The constants hidden behind $O(\cdot)$ depend only on $\Delta, \epsilon$ and $k$. This improves the $O(n2 d)$-time algorithm by Eiben et al.[SODA'21] by a factor of $n$.

We're not able to analyze this paper right now due to high demand.

Please check back later (sorry!).

Generate a summary of this paper on our Pro plan:

We ran into a problem analyzing this paper.

Newsletter

Get summaries of trending comp sci papers delivered straight to your inbox:

Unsubscribe anytime.