Shivam Mehta shivammehta25

Hello I am Shivam Mehta

I am a Researcher in Department of Speech Music and Hearing at KTH Royal Institute of Technology!

🎯 Currently, I am in Menlo Park, CA as a Research Scientist Intern at Meta.
🎯 Summer 2024 I was in Seattle, WA as a Research Intern at Microsoft Research.
🔭 I work on Speech synthesis with probabilistic generative models
💬 Ask me about: Python, Deep Learning, Machine Learning and Generative modelling
📫 Reach me or read my blog at: https://shivammehta25.github.io/
💬 Open for collaborations and interesting projects!

My recent works:
⚡ MAGI: Multimodal Audio and Gesture, Integrated: https://shivammehta25.github.io/MAGI/
⚡ 🍵 Matcha-TTS: https://shivammehta25.github.io/Matcha-TTS/
⚡ Unified speech and gesture synthesis using flow matching: https://shivammehta25.github.io/Match-TTSG/
⚡ Diff-TTSG: https://shivammehta25.github.io/Diff-TTSG/
⚡ OverFlow: https://shivammehta25.github.io/OverFlow
⚡ Neural HMM TTS: https://shivammehta25.github.io/Neural-HMM

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shivam Mehta shivammehta25

Achievements

Achievements

Highlights

Block or report shivammehta25

Hello I am Shivam Mehta

I am a Researcher in Department of Speech Music and Hearing at KTH Royal Institute of Technology!

GitHub Stats:

Pinned Loading