From 5f26c3c79e37d2f96d9496c94e129a3b68a12ae4 Mon Sep 17 00:00:00 2001 From: Yuan Gong Date: Sat, 9 Dec 2023 21:14:51 -0500 Subject: [PATCH] add code --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index b56a6b6..aae4df8 100644 --- a/README.md +++ b/README.md @@ -13,6 +13,8 @@ --- ## Introduction +

Illustration of CAV-MAE.

+ This repository contains the official implementation (in PyTorch), pretrained checkpoints, and datasets of LTU and LTU-AS. LTU and LTU-AS are first generation of audio and speech large language models that bridges audio perception with understanding. They not only achieve SOTA on multiple closed-ended audio and speech tasks, but can answer any question based on the given audio.