multimodal 3

[24.08.13 / ECCV 22'] Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-Identification

https://www.ecva.net/papers/eccv_2022/papers_ECCV/papers/136740450.pdfAbstractVisible-Infrared Re-Identification $($VI-ReID$)$ is challenging in image retrievals. The modality discrepancy will easily make huge intraclass variations. Most existing methods either bridge different modalities through modality-invariance or generate the intermediate modality for better performance. Differently, this ..

[24.07.29 / CVPR24'] Multimodal Representation Learning by Alternating Unimodal Adaptation

https://openaccess.thecvf.com/content/CVPR2024/papers/Zhang_Multimodal_Representation_Learning_by_Alternating_Unimodal_Adaptation_CVPR_2024_paper.pdfAbstract Multimodal learning, which integrates data from diverse sensory modes, plays a pivotal role in artificial intelligence. However, existing multimodal learning methods often struggle with challenges where some modalities appear more dominant ..

[24.07.28 / CVPR24'] Querying as Prompt: Parameter-Effcient Learning for Multimodal Language Model

https://openaccess.thecvf.com/content/CVPR2024/papers/Liang_Querying_as_Prompt_Parameter-Efficient_Learning_for_Multimodal_Language_Model_CVPR_2024_paper.pdfAbstractRecent advancements in language models pre-trained on large-scale corpora have signifcantly propelled developments in the NLP domain and advanced progress in multimodal tasks. In this paper, we propose a Parameter Effcient multimodal..