Simple linear regression model based data clustering

Bingcheng Li

doi:10.1117/12.2518037

14 May 2019 Simple linear regression model based data clustering

Bingcheng Li

Proceedings Volume 10988, Automatic Target Recognition XXIX; 109880A (2019) https://doi.org/10.1117/12.2518037
Event: SPIE Defense + Commercial Sensing, 2019, Baltimore, MD, United States

Abstract

KMeans is one of most popular algorithms in data mining (ranking number 2) and has be widely used in many fields. KMeans uses Euclidean distance to compare two data. However Euclidean distance is sensitive to linear transform in data collection process. Due to these linear transforms, the distance between two data points for the same class (intra-class distance) may larger than those for different classes (inter-class distance) that may cause low clustering performance for KMeans algorithm. In this paper, we propose simple linear regression approach for data clustering. Instead of using Euclidean distance to measure the difference, we recommend using the goodness of fitting (or normalized cross correlation) to measure the similarity and compare two data points. Using this new data comparison technique, we introduce linear regression approach for data clustering and demonstrate that the proposed method has higher performance and low computational cost than KMeans methods.

Conference Presentation

Citation Download Citation

Bingcheng Li "Simple linear regression model based data clustering", Proc. SPIE 10988, Automatic Target Recognition XXIX, 109880A (14 May 2019); https://doi.org/10.1117/12.2518037

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available