From 36752b0afb540e6455850cece85443f39f856c5f Mon Sep 17 00:00:00 2001 From: Ziyang Ma Date: Mon, 20 May 2024 16:48:43 +0000 Subject: [PATCH] update README --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index 60631c30..5055cf2c 100644 --- a/README.md +++ b/README.md @@ -27,6 +27,7 @@ developers to train custom multimodal large language model (MLLM), focusing on < 5. [Acknowledge](#acknowledge) # News +- [Update May. 21, 2024] Recipes for [Spatial Audio Understanding](examples/seld_spatialsoundqa/README.md) has been supported. - [Update May. 20, 2024] Recipes for [music caption (MC)](examples/mc_musiccaps/README.md) has been supported. - [Update May. 8, 2024] Recipes for [visual speech recognition (VSR)](examples/vsr_LRS3/README.md) has been supported. - [Update May. 4, 2024] Recipes for [zero-shot text-to-speech (TTS)](examples/vallex/README.md) has been supported. @@ -67,6 +68,7 @@ We provide reference implementations of various LLM-based speech, audio, and mus - [Visual Speech Recognition (VSR)](examples/vsr_LRS3/README.md) - **Audio Task** - [Automated Audio Captioning (AAC)](examples/aac_audiocaps/README.md) + - [Spatial Audio Understanding](examples/seld_spatialsoundqa/README.md) - **Music Task** - [Music Caption (MC)](examples/mc_musiccaps/README.md)