[24.08.13 / ECCV 22'] Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-Identification

Daily Abstract Digest

[24.08.13 / ECCV 22'] Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-Identification

Emos Yalp 2024. 8. 13. 11:35

https://www.ecva.net/papers/eccv_2022/papers_ECCV/papers/136740450.pdf

Abstract

Visible-Infrared Re-Identification $($VI-ReID$)$ is challenging in image retrievals. The modality discrepancy will easily make huge intraclass variations. Most existing methods either bridge different modalities through modality-invariance or generate the intermediate modality for better performance. Differently, this paper proposes a novel framework, named Modality Synergy Complement Learning Network $($MSCLNet$)$ with Cascaded Aggregation. Its basic idea is to synergize two modalities to construct diverse representations of identity-discriminative semantics and less noise. Then, we complement synergistic representations under the advantages of the two modalities. Furthermore, we propose the Cascaded Aggregation strategy for fine-grained optimization of the feature distribution, which progressively aggregates feature embeddings from the subclass, intra-class, and inter-class. Extensive experiments on SYSU-MM01 and RegDB datasets show that MSCLNet outperforms the state-of-the-art by a large margin. On the large-scale SYSU-MM01 dataset, our model can achieve 76.99% and 71.64% in terms of Rank-1 accuracy and mAP value.

Task: Visible-Infrared Re-Identification
Problem Definition: the modality discrepancy
Approach: synergize two modalities to construct diverse representations of identity-discriminative semantics and less noise

'Daily Abstract Digest' 카테고리의 다른 글

[24.08.15 / ECCV 22'] KeypointNeRF: Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints (0)	2024.08.15
[24.08.14 / ECCV 22'] ActionFormer: Localizing Moments of Actions with Transformers (0)	2024.08.14
[24.08.12 / ECCV22'] Neighborhood Collective Estimation for Noisy Label Identification and Correction (0)	2024.08.12
[24.08.11 / ECCV 22'] Few-Shot Object Counting and Detection (0)	2024.08.11
[24.08.10 / ECCV 22'] UniNet: Unified Architecture Search with Convolution, Transformer, and MLP (0)	2024.08.10

현재글[24.08.13 / ECCV 22'] Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-Identification

Emos Yalp

inpainting, visual place recognition, ICCV, eccv, cvpr, clip, noisy label, diffusion, 3d shape, rendering, re-identification, multimodal, llm, Query, NERF, 3d, Abstract, Convolution, astract, TTA,

Today :
Yesterday :

일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28

Emos Yalp