Visual Basic Language

TransVG++: End-to-End Visual Grounding With Language Conditioned Vision Transformer

Abstract: In this work, we explore neat yet effective Transformer-based frameworks for visual grounding. The previous methods generally address the core problem of visual grounding, i.e., multi-modal ...

IEEE

Visual and Language Collaborative Learning for RGBT Object Tracking

Abstract: Despite the extensive research on RGBT object tracking, there are still several challenges and issues in practical applications, such as modality differences, lighting variations and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

TransVG++: End-to-End Visual Grounding With Language Conditioned Vision Transformer

Visual and Language Collaborative Learning for RGBT Object Tracking

Trending now