If you need a model focusing on spech emotion representation, refer to emotion2vec: universal speech emotion representation model.emotion2vec+ seed: Fine-tuned with academic speech emotion data from EmoBox emotion2vec+ base: Fine-tuned with filtered large-scale pseudo-labeled data to obtain the...
multilingual python ai pytorch speech-recognition speech-to-text asr cross-lingual speech-emotion-recognition audio-event-classification aigc llm gpt-4o Updated Jan 8, 2025 Python NexaAI / nexa-sdk Star 4.4k Code Issues Pull requests Discussions Nexa SDK is a comprehensive toolkit for supporti...
Emotion Recognitionswbd_sentimentMacro F161.4link Emotion Recognitionslue_voxcelebMacro F144.0link If you want to check the results of the other recipes, please checkegs2/<name_of_recipe>/asr1/RESULTS.md. CTC Segmentation demo ESPnet1 CTC segmentationdetermines utterance segments within audio files....
Emotion Recognition API Demo - Microsoft Proof of concept for loading Caffe models in TensorFlow YOLO: Real-Time Object Detection AlphaGo - A replication of DeepMind's 2016 Nature publication, "Mastering the game of Go with deep neural networks and tree search" ...
Emotion Recognition API Demo - Microsoft Proof of concept for loading Caffe models in TensorFlow YOLO: Real-Time Object Detection AlphaGo - A replication of DeepMind's 2016 Nature publication, "Mastering the game of Go with deep neural networks and tree search" ...
Enhanced TTS emotion control. Experiment with changing SoVITS token inputs to probability distribution of vocabs. Improve English and Japanese text frontend. Develop tiny and larger-sized TTS models. Colab scripts. Try expand training dataset (2k hours -> 10k hours). ...
multilingualpythonaipytorchspeech-recognitionspeech-to-textasrcross-lingualspeech-emotion-recognitionaudio-event-classificationaigcllmgpt-4o UpdatedMar 23, 2025 Python Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language mo...
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation speech-emotion-recognitionpytorch-implementationiemocapspeech-representation UpdatedDec 23, 2024 ...
pythonopencvemotionfaceface-detectiongender-recognitiongenderemotion-recognition UpdatedAug 24, 2018 Python Load more… Improve this page Add a description, image, and links to thegendertopic page so that developers can more easily learn about it. ...
multilingualpythonaipytorchspeech-recognitionspeech-to-textasrcross-lingualspeech-emotion-recognitionaudio-event-classificationaigcllmgpt-4o UpdatedJan 8, 2025 Python wenet-e2e/wenet Star4.4k Code Issues Pull requests Production First and Production Ready End-to-End Speech Recognition Toolkit ...