Paper
22 May 2024 Variant bad text recognition based on ChineseBERT-BiGRU
Huawei Song, Yuping Wei, Fangjie Wan, Shengqi Li
Author Affiliations +
Proceedings Volume 13176, Fourth International Conference on Machine Learning and Computer Application (ICMLCA 2023); 1317625 (2024) https://doi.org/10.1117/12.3029068
Event: Fourth International Conference on Machine Learning and Computer Application (ICMLCA 2023), 2023, Hangzhou, China
Abstract
The bad text containing variant words seriously harms the health of the network environment. The existing methods for the recognition of bad variant text do not take into account the importance of the phonetic and positional information for the recognition of bad variant text. In this paper, a ChineseBERT-BiGRU variant bad text recognition model is proposed. This model first learns and trains the word vector of the text by inputting the pinyin information, font information and character information of the text, and then combines the position information of the text. Then the word vector of the text is input into BiGRU to learn richer semantic information of the text. Finally, the bad text containing variant words is identified through Softmax classification. The accuracy, accuracy and F1 values of ChineseBERT-BiGRU in the data set in this paper are compared with those of other models.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Huawei Song, Yuping Wei, Fangjie Wan, and Shengqi Li "Variant bad text recognition based on ChineseBERT-BiGRU", Proc. SPIE 13176, Fourth International Conference on Machine Learning and Computer Application (ICMLCA 2023), 1317625 (22 May 2024); https://doi.org/10.1117/12.3029068
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Semantics

Education and training

Data modeling

Deformation

Classification systems

Target recognition

Tunable filters

Back to Top