FunCineForge: Revolutionizing Zero-Shot Movie Dubbing with Open-Source Power

The Future of Automated Dubbing is Here

FunCineForge from FunAudioLLM represents a breakthrough in AI-driven movie dubbing. This comprehensive open-source project delivers both a unified dataset pipeline and a multimodal LLM-based dubbing model that excels across diverse cinematic scenes – from monologues and narration to complex multi-speaker dialogues.

What Makes FunCineForge Special?

🎬 End-to-End Dataset Pipeline

The pipeline transforms raw video footage into production-ready dubbing datasets:

Video Normalization & Trimming (normalize_trim.py)
Speech Separation (vocals from background music)
Video Clipping with bilingual subtitle support (Chinese/English)
Speaker Diarization using multimodal active speaker detection
CoT Correction with MLLMs (Gemini-3-Pro) for 0.94% CER accuracy

🤖 State-of-the-Art Dubbing Model

Superior audio quality
Perfect lip synchronization
Seamless timbre transitions
Excellent instruction following

🚀 Quick Start Guide

git clone [email protected]:FunAudioLLM/FunCineForge.git
conda create -n FunCineForge python=3.10
conda activate FunCineForge
python setup.py

Dataset Processing:

python normalize_trim.py --root datasets/raw_zh --intro 10 --outro 10
cd speech_separation && python run.py --root datasets/clean/zh
cd ../video_clip && bash run.sh --stage 1 --stop_stage 2 --lang zh

Inference:

cd exps
bash infer.sh

📊 Key Results

CineDub-CN: First large-scale Chinese TV dubbing dataset
CER reduced from 4.53% → 0.94%
Speaker diarization error: 8.38% → 1.20%
Consumer-grade GPU inference

🎯 Who Should Use This?

AI Researchers building speech/video datasets
Content Creators needing automated dubbing
Film Studios exploring localization solutions
Developers working on multimodal TTS

Recent Updates (March 2026)

✅ Open-sourced inference code + checkpoints
✅ English dataset (CineDub-EN) released
✅ Bilingual pipeline support
✅ Demo samples available at funcineforge.github.io

📚 Citation

@misc{liu2026funcineforgeunifieddatasettoolkit,
title={FunCineForge: A Unified Dataset Toolkit...},
author={Jiaxuan Liu and Yang Xiang...}

⭐ Star the repo and join the Tongyi Lab Speech Team's mission to make professional dubbing accessible to everyone.

Explore FunCineForge on GitHub | Dataset Demos