Abstract: Vision foundation models (VFMs) based on self-supervised learning are highly valued in the analysis of RGB imagery for their ability to generalize well without labeled data. Foundation ...