To address this challenge, this paper proposes a pretrained OpenL3-SVM transfer learning framework for the automatic recognition of multi-class voice disorders. The framework combines a pre-trained convolutional neural network, OpenL3, and a support vector machine (SVM) classifier. The Mel spectrum ...
The paper presents the implementation of the speech recognition engine for a voice command controller for fixed-wing Unmanned Aerial Vehicle (UAV) using a deep Convolutional Neural Network that process and classifies voice samples. The architecture was an adaptation of an image processing CNN, ...
The human speech contains paralinguistic information used in many speech recognition applications like automatic speech recognition, speaker recognition, and verification. Gender from voice is considered as one of the essential tasks to be detected for s
we bombard a potential listener with information about ourselves. Voice recognition plays a major part in our social interaction and Pascal Bella, a psychologist at Glazco University, is currently trying to uncover the cerebral architecture of this process by using MRI (magnetic resonance imaging) sc...
CNN: A Speaker Recognition System using a Cascaded Neural Network This work includes the design and implementation of both conventional, and neural network approaches to recognition of the speakers templates which are int... M Zaki,A Ghalwash,AA Elkouny - 《Multidimensional Systems & Signal Processi...
A hybrid model for pathological voice recognition of post-stroke dysarthria by using 1DCNN and double-LSTM networks Post-stroke dysarthria (PSD) is a common and persistent sequela of stroke. To assist objective assessment of dysarthria, the pathological voice recognition... W Ye,Z Jiang,Q Li,...
García, “Improved voice activity detection using contextual multiple hypothesis testing for robust speech recognition,” IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2177–2189, Nov. 2007.[41]R. Tahmasbi and S. Rezaei, “A soft voice activity detection using GARCH ...
展开 关键词: Deepfakes Reviews Forensics Computational modeling Authentication Cloning Speech recognition Feature extraction Artificial intelligence Spectrogram 会议名称: 2024 8th International Conference on Computing, Communication, Control and Automation (ICCUBEA) 主办单位: IEEE 收藏...
Neural network-based models, particularly deep learning models like Recurrent Neural Networks (RNN) and Convolutional Neural Networks (CNN), have revolutionized the field of speech recognition in recent years. These models utilize artificial neural networks to process raw audio data, extract relevant fea...
Emotion Recognition in Speech Using Convolutional Neural Networks This paper aims to implement and analyse the performance of Convolutional Neural Networks (CNNs) in detecting and labelling emotion in speech based on the features used to describe the speech. CNNs are often associated with natural lang...