Abstract: Cross-domain federated learning (FL), as an emerging paradigm of distributed machine learning, enables collaborative training across multiple parties while preserving data privacy. However, ...
Abstract: We introduce VideoComp, a benchmark and learning framework for advancing video-text compositionality understanding, aimed at improving vision-language models (VLMs) in fine-grained temporal ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results