Abstract: CLIP, a foundational vision-language model, has emerged as a powerful tool for open-vocabulary semantic segmentation. While freezing the text encoder preserves its powerful embeddings, ...
Abstract: Domain-adaptive remote sensing image (RSI) semantic segmentation mitigates the overfitting problem that affects the effectiveness of segmentation, which results from the scarcity of ...