Paper
15 October 2021 Research on Chinese classification based on TF-IDF
Liang Xiao, Nianmin Yao
Author Affiliations +
Proceedings Volume 11933, 2021 International Conference on Neural Networks, Information and Communication Engineering; 119330B (2021) https://doi.org/10.1117/12.2615301
Event: 2021 International Conference on Neural Networks, Information and Communication Engineering, 2021, Qingdao, China
Abstract
Chinese text classification has been in the research stage, there are many machine learning algorithms that can be used, such as logical regression, SVM, KNN, naive Bayes, random forest, neural network and so on. In this paper, taking Chinese modern novels as an example, we use various algorithms for classification and comparison, and choose the best algorithm for naive Bayes and neural network. After adjusting the TF-IDF algorithm and processing the participle according to the TF-IDF value, the accuracy of classification is improved obviously. The logistic regression with the lowest accuracy can increase about 6.7%,while the simple Bias and neural network can reach 100%.
© (2021) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Liang Xiao and Nianmin Yao "Research on Chinese classification based on TF-IDF", Proc. SPIE 11933, 2021 International Conference on Neural Networks, Information and Communication Engineering, 119330B (15 October 2021); https://doi.org/10.1117/12.2615301
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Neural networks

Evolutionary algorithms

Machine learning

Library classification systems

Feature extraction

Information technology

Back to Top