PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.
KMeans is one of most popular algorithms in data mining (ranking number 2) and has be widely used in many fields. KMeans uses Euclidean distance to compare two data. However Euclidean distance is sensitive to linear transform in data collection process. Due to these linear transforms, the distance between two data points for the same class (intra-class distance) may larger than those for different classes (inter-class distance) that may cause low clustering performance for KMeans algorithm. In this paper, we propose simple linear regression approach for data clustering. Instead of using Euclidean distance to measure the difference, we recommend using the goodness of fitting (or normalized cross correlation) to measure the similarity and compare two data points. Using this new data comparison technique, we introduce linear regression approach for data clustering and demonstrate that the proposed method has higher performance and low computational cost than KMeans methods.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.
The alert did not successfully save. Please try again later.
Bingcheng Li, "Simple linear regression model based data clustering," Proc. SPIE 10988, Automatic Target Recognition XXIX, 109880A (14 May 2019); https://doi.org/10.1117/12.2518037