Paper
21 July 2023 Research on end-to-end animal behavior speech classification based on Wav2Vec2.0
ZhenBang Kuang, Shang Jiang, HaoZhang Huang, Yixian Liu, Xinran Li
Author Affiliations +
Proceedings Volume 12717, 3rd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2023); 127173H (2023) https://doi.org/10.1117/12.2684696
Event: 3rd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2023), 2023, Wuhan, China
Abstract
There are many fields involved in human research on animal vocal behavior, such as biomedical science, ecology, and behavioral science. Understanding animal language and behavior remains a major focus of scientific research, and many efforts are being made in this direction. With the development of artificial intelligence technology, deep learning has gradually become a major method for humans to understand animal behavior sound. This article aims to develop an end-to-end animal behavior sound classification model using Wav2Vec2.0 (W2VCM). By combining the Wav2Vec2.0 model with the CatMeows dataset, a pre-trained model is obtained for extracting speech features and performing animal behavior sound classification tasks. Experimental results demonstrate that W2VCM achieves higher accuracy than six other deep learning models tested, both on the original and re-divided datasets. The accuracy of W2VCM reached 97.97%, 2.03% higher than the best result of other models on the original dataset, and 3.13% higher on the re-divided dataset.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
ZhenBang Kuang, Shang Jiang, HaoZhang Huang, Yixian Liu, and Xinran Li "Research on end-to-end animal behavior speech classification based on Wav2Vec2.0", Proc. SPIE 12717, 3rd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2023), 127173H (21 July 2023); https://doi.org/10.1117/12.2684696
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Animals

Animal model studies

Deep learning

Sampling rates

Machine learning

Feature extraction

Back to Top