We introduce VLM-Mamba, the first Vision-Language Model built entirely on State Space Models (SSMs), specifically leveraging the Mamba architecture.
ai ml transformers pytorch state-space attention ssm mamba vision-transformer vision-language-model vlms vision-ssm language-ssm
-
Updated
Sep 29, 2025 - Python