PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.
audio-captioning multimodal-large-language-models audio-language-models audio-question-answering audio-reasoning
-
Updated
Mar 7, 2025 - Python