Abstract: Visual grounding in remote sensing images aims to localize objects referenced by natural language descriptions. While most prior work focuses on single-target grounding via cross-modal ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results