HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.
-
Updated
Sep 28, 2025 - Python
HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
[IJCV] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Workshop on Detection and Classification of Acoustic Scenes and Events
Generate synchronized foley audio for video within ComfyUI, powered by Tencent's HunyuanVideo-Foley model.
This project was developed in the context of the "Computer Music: Languages and Systems" course of the Master's Degree in Music and Acoustic Engineering at Politecnico di Milano.
Commercial Sound Design developed in the context of the "Music Production Technologies" course of the Politecnico di Milano Master's Degree in Music and Acoustic Engineering.
Add a description, image, and links to the foley-sound-synthesis topic page so that developers can more easily learn about it.
To associate your repository with the foley-sound-synthesis topic, visit your repo's landing page and select "manage topics."