From 8efc5307ee8472909cd4d4c18c642f0341c83b68 Mon Sep 17 00:00:00 2001 From: Rex Cheng Date: Sun, 22 Dec 2024 22:40:30 +0000 Subject: [PATCH] add evaluation --- README.md | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 50b4cb6..91aa732 100644 --- a/README.md +++ b/README.md @@ -189,7 +189,9 @@ Work in progress. ## Evaluation -Work in progress. +You can access the precomputed results on VGGSound, AudioCaps, and MovieGen here: https://huggingface.co/datasets/hkchengrex/MMAudio-precomputed-results + +We have shared our evaluation code here: https://github.com/hkchengrex/av-benchmark ## Training Datasets @@ -198,7 +200,7 @@ MMAudio was trained on several datasets, including [AudioSet](https://research.g ## Citation ```bibtex -@inproceedings{cheng2024putting, +@inproceedings{cheng2024taming, title={Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis}, author={Cheng, Ho Kei and Ishii, Masato and Hayakawa, Akio and Shibuya, Takashi and Schwing, Alexander and Mitsufuji, Yuki}, booktitle={arXiv},