LMMS Keyboard - Search News

Enabling the finetuning of the latest Large Multimodal Models

More and more large multimodal models (LMMs) are being released from time to time, but the finetuning of these models is not always straightforward. This codebase aims to provide a unified, minimal ...

GitHub

Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency

Understanding real-world videos with complex semantics and long temporal dependencies remains a fundamental challenge in computer vision. Recent progress in multimodal large language models (MLLMs) ...

IEEE

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Abstract: Existing Large Multimodal Models (LMMs) generally focus on only a few regions and languages. As LMMs continue to improve, it is increasingly important to ensure they understand cultural ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Enabling the finetuning of the latest Large Multimodal Models

Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Trending now