Mixed Signals: Understanding Model Disagreement in Multimodal Empathy Detection

Mixed Signals: Understanding Model Disagreement in Multimodal Empathy Detection

ACL ARR 2025 July Submission1000 Authors

29 Jul 2025 (modified: 03 Sept 2025)ACL ARR 2025 July SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Abstract: Multimodal models play a key role in empathy detection, but their performance can suffer when modalities provide conflicting cues. To understand these failures, we examine cases where unimodal and multimodal predictions diverge. Using fine-tuned models for text, audio, and video, along with a gated fusion model, we find that such disagreements often reflect underlying ambiguity, as evidenced by annotator uncertainty. Our analysis shows that dominant signals in one modality can mislead fusion when unsupported by others. We also observe that humans, like models, do not consistently benefit from multimodal input. These insights position disagreement as a useful diagnostic signal for identifying challenging examples and improving empathy system robustness.

Paper Type: Short

Research Area: Multimodality and Language Grounding to Vision, Robotics and Beyond

Research Area Keywords: empathy detection, multimodality, model interpretability

Contribution Types: Model analysis & interpretability, Data analysis

Languages Studied: English

Previous URL: https://openreview.net/forum?id=dzXdX93sGq

Explanation Of Revisions PDF: pdf

Reassignment Request Area Chair: No, I want the same area chair from our previous submission (subject to their availability).

Reassignment Request Reviewers: Yes, I want a different set of reviewers

Justification For Not Keeping Action Editor Or Reviewers: Certain reviewers were unresponsive in the author response period. We hope to get more engagement in the upcoming round.

A1 Limitations Section: This paper has a limitations section.

A2 Potential Risks: N/A

B Use Or Create Scientific Artifacts: Yes

B1 Cite Creators Of Artifacts: Yes

B1 Elaboration: All previous work, models, and tools we reference and use have been properly cited in all sections and listed in the References section.

B2 Discuss The License For Artifacts: Yes

B2 Elaboration: All previous work, models, and tools we reference and use have been properly cited in all sections and listed in the References section.

B3 Artifact Use Consistent With Intended Use: Yes

B3 Elaboration: We fine-tune publicly available, open-source models (e.g., RoBERTa, HuBERT, VideoMAE) strictly for analysis purposes, as described in Section 3.

B4 Data Contains Personally Identifying Info Or Offensive Content: N/A

B4 Elaboration: We are reusing a publicly available dataset (published dataset) for which those authors already moderated content.

B5 Documentation Of Artifacts: Yes

B5 Elaboration: We will release all code and experimental resources at https://anonymous.4open.science/r/multimodal-empathy-disagreement-F48B to support reproducibility.

B6 Statistics For Data: Yes

B6 Elaboration: Relevant statistics are included in appendix sections A and B.

C Computational Experiments: Yes

C1 Model Size And Budget: Yes

C1 Elaboration: Details are included in section 3 and appendix section B.

C2 Experimental Setup And Hyperparameters: Yes

C2 Elaboration: Details are included in section 3 and appendix section B.

C3 Descriptive Statistics: Yes

C3 Elaboration: Details are included in Section 3.1 and 4.

C4 Parameters For Packages: Yes

C4 Elaboration: Details are included in Section 3.1.

D Human Subjects Including Annotators: Yes

D1 Instructions Given To Participants: Yes

D1 Elaboration: Details are included in appendix section C.

D2 Recruitment And Payment: Yes

D2 Elaboration: See ethics statement and appendix section C.

D3 Data Consent: Yes

D3 Elaboration: We provide detailed information on what we ask the annotators to annotate and how we plan to use the data. The annotators willingly agreed to participate with full knowledge of the task. We provide the full annotation instructions in appendix section C.

D4 Ethics Review Board Approval: N/A

D5 Characteristics Of Annotators: Yes

D5 Elaboration: Details are provided in appendix section C.

E Ai Assistants In Research Or Writing: No

E1 Information About Use Of Ai Assistants: No

Author Submission Checklist: yes

Submission Number: 1000

Loading