emotion2vec Request for demo on using emotion2vec with Speech + Text modality

Hello there! I'm currently trying to use the emotion2vec for sentiment analysis tasks and appreciate your work. After reading related papers and documentation, I noticed that you have provided instructions on how to predict using speech or text modal data separately. However, I am also interested in understanding how to combine both speech and text data (i.e., Speech + Text) for multimodal emotion prediction. According to my findings from literature, this seems like an important application scenario. Therefore, could you please provide a simple example demonstrating how to integrate these two modalities of data and run the model? I believe this would be highly beneficial for other users as well. Thank you! 97b35cdff111d3584459175d8e3b9b09

Aug 07 '24 09:08 mika10032

I was wondering the same thing. Any results yet, please?

Aug 09 '24 00:08 June1124

You can refer to Shi et al.'s(2020) and (2023) papers. We reproduced their methods to align with their numbers.

Aug 21 '24 04:08 ddlBoJack

Is there a plan to open source the speech+text model?

Oct 11 '24 00:10 mika10032

Sorry that we don't have the plan. You can reproduce it.

Nov 15 '24 17:11 ddlBoJack