CMU Multimodal Opinion Sentiment and Emotion Intensity (CMU-MOSEI) is the largest dataset of sentence-level sentiment analysis and emotion recognition in online videos. CMU-MOSEI contains over 12 hours of annotated video from over 1000 speakers and 250 t
data_path = 'path_to_mosi_dataset' texts, videos, audios, labels = load_dataset(data_path) 特征提取 对每种模态信息进行特征提取。例如,文本可以使用TF-IDF或BERT进行编码,视频和音频可以使用预训练的卷积神经网络(CNN)或循环神经网络(RNN)进行特征提取。 # 文本特征提取 text_features = extract_text_fe...
To use this repository in publications, please cite the original paper for CMU-MOSEI dataset: @inproceedings{zadeh2018multi, title={Multi-attention recurrent network for human communication comprehension}, author={Zadeh, Amir and Liang, Paul Pu and Poria, Soujanya and Vij, Prateek and Cambria, Er...