Multimodal large language models have shown powerful abilities to understand and reason across text and images, but their ...
Bipolar Disorder, Digital Phenotyping, Multimodal Learning, Face/Voice/Phone, Mood Classification, Relapse Prediction, T-SNE, Ablation Share and Cite: de Filippis, R. and Al Foysal, A. (2025) ...
Abstract: Convolutional neural networks (CNNs) have emerged as a preferred approach for medical image analysis. The dimensionality of images is a principal factor in CNN models, as they are designed ...
We present a domain-independent topic segmentation algorithm for multi-party speech. Our feature-based algorithm combines knowledge about content using a text-based algorithm as a feature and about ...
Abstract: Multimodal aspect-based sentiment analysis (MABSA) is a challenging task that predicts sentiment polarity for specific aspect terms based on inputs across modalities. Existing approaches ...