Trending Open Source Projects
Discover trending open source projects with rapid star growth. AI summaries help you stay ahead of the curve.
FunCineForge: Zero-Shot Movie Dubbing Pipeline
Discover FunCineForge, the groundbreaking open-source toolkit for creating large-scale movie dubbing datasets and deploying zero-shot dubbing models. This end-to-end pipeline handles video processing, speech separation, speaker diarization, and multimodal corrections using MLLMs. Build CineDub-CN/EN datasets from raw footage and generate high-quality dubs with perfect lip-sync and timbre matching. Includes inference code, demo samples, and supports both Chinese and English. Perfect for AI researchers and content creators.
VoiceChanger: Open‑Source Real‑Time Voice Conversion
Discover how VoiceChanger lets you transform speech on‑the‑fly using cutting‑edge AI models like Beatrice and RVC. This open‑source project features a cross‑platform GUI, Docker support, network‑mode, and tutorials for AMD Linux and Google Colab. Whether you’re a game developer, streamer, or hobbyist, learn how to install, configure, and upgrade the software in minutes and explore the exciting world of real‑time voice manipulation.
EasyOCR: A Fast, Multilingual OCR Library for Python
EasyOCR brings 80+ language support right into your Python projects. With a quick pip install, lightweight model downloads, and an intuitive API, you can extract text from images in seconds. This guide covers everything from basic usage and custom language sets to Docker deployment and Hugging Face Space integration. Whether you’re building a photo‑management tool or a data‑entry pipeline, EasyOCR gives you the speed and accuracy you need.
VibeVoice: Microsoft’s Open‑Source Voice AI Suite
Explore VibeVoice, Microsoft’s cutting‑edge open‑source toolkit that brings long‑form ASR, multi‑speaker TTS, and real‑time streaming to developers and researchers. Learn how to harness its 60‑minute ASR pipeline, 90‑minute TTS, and lightweight real‑time model, and discover integration with Hugging Face Transformers for seamless deployment.
RCLI: On‑Device Voice AI for macOS – Zero‑Cloud, Fast
RCLI turns your Mac into a fully‑local voice assistant and document explorer. Powered by Apple Silicon’s MetalRT GPU engine, it runs state‑of‑the‑art STT, LLM, and TTS locally—no cloud, no API keys. Discover how to install with Homebrew, control 38 macOS actions, embed PDFs with sub‑4 ms RAG, and benchmark MetalRT against llama.cpp. Whether you’re a developer, power user, or AI enthusiast, RCLI brings the most cutting‑edge local AI to your desktop with minimal setup. Find out why this repo is a must‑try for anyone building voice‑driven macOS tools.
LiveTalking: Real-Time AI Digital Human with Lip Sync
Discover LiveTalking, the open-source powerhouse for creating real-time interactive digital humans. This Python project supports multiple models (wav2lip, musetalk, ernerf) with voice cloning, WebRTC streaming, and interruption handling. Deploy via Docker, run on GPU with 60+ FPS performance, and create commercial-grade talking avatars. Perfect for streamers, educators, and AI developers seeking production-ready lip-sync solutions.
Edict: Ancient Empire AI Agents on OpenClaw
Discover Edict, a groundbreaking OpenClaw system inspired by China's 1300-year-old 'Three Departments Six Ministries' imperial structure. 12 specialized AI agents (Prince, Zhongshu, Menxia, Shangshu + 6 ministries) collaborate with institutional checks-and-balances that surpass CrewAI and AutoGen. Features real-time Kanban dashboard, remote skills management, model switching, and one-click Docker demo. Experience ancient wisdom meets modern AI orchestration.
oMLX: Mac Menu Bar LLM Server with SSD Cache
Discover oMLX, the ultimate local LLM server for Apple Silicon Macs. Run LLMs, VLMs, and embeddings from your menu bar with continuous batching, tiered KV caching (RAM + SSD), and multi-model serving. Features admin dashboard, OpenAI API compatibility, Claude Code optimization, and one-click model downloads from Hugging Face. Install via DMG, Homebrew, or source – perfect for developers wanting production-grade local AI without cloud costs.
40 Verified OpenClaw AI Agent Use Cases Guide
Discover 40 battle-tested OpenClaw AI agent use cases that automate your work and life. From Chinese ecosystem integrations like Feishu, DingTalk, and WeCom bots to global scenarios for content creation, DevOps self-healing servers, and multi-agent teams. Beginner-friendly with copy-paste prompts, setup guides, and difficulty ratings. Transform OpenClaw into your 24/7 AI employee today!
Siftly: Self-Hosted AI Twitter Bookmark Organizer
Transform your Twitter/X bookmarks into a powerful local knowledge base with Siftly. This open-source Next.js app runs a 4-stage AI pipeline: import bookmarks without extensions, extract entities, analyze images with vision AI, and auto-categorize with semantic tags. Features AI natural language search, interactive mindmaps, filtered browsing, and exports. One-command setup with Claude CLI integration—no cloud required. Perfect for developers managing bookmark overload.