Paper
13 April 2009 Estimating the threshold for maximizing expected gain in supervised discrete Bayesian classification
Author Affiliations +
Abstract
When mining discrete data to train supervised discrete Bayesian classifiers, it is often of interest to determine the best threshold setting for maximizing performance. In this work, we utilize a discrete Bayesian classification model, and a gain function, to determine the best threshold setting for a given number of training data under each class. Results are demonstrated for simulated data by plotting the expected gain versus threshold settings for different numbers of discrete training data. In general, it is shown that the expected gain reaches a maximum at a certain threshold. Further, this maximum point varies with the overall quantization of the data. Additional results are also shown for different gain functions on the decision variable.
© (2009) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Robert S. Lynch Jr. and Peter K. Willett "Estimating the threshold for maximizing expected gain in supervised discrete Bayesian classification", Proc. SPIE 7344, Data Mining, Intrusion Detection, Information Security and Assurance, and Data Networks Security 2009, 734408 (13 April 2009); https://doi.org/10.1117/12.819067
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Quantization

Computer simulations

Data mining

Error analysis

Algorithm development

Mining

Back to Top