AIBit - Discover Open Source Projects

Page Agent: Control Web UIs with Natural Language

March 18, 2026

Tags:

Web Automation AI Agent Alibaba page-agent gui-agent

Discover Page Agent, Alibaba's revolutionary in-page GUI agent that transforms web interfaces into natural language playgrounds. No browser extensions, no Python, no headless browsers—just pure JavaScript magic. With 10.5k GitHub stars and MIT license, this TypeScript powerhouse enables SaaS AI copilots, smart form filling, accessibility enhancements, and multi-page automation. Integrate in one line of code and execute commands like 'Click the login button'. Perfect for developers building intelligent web experiences.

Read more Original

Practical Open Source Projects

FunCineForge: Zero-Shot Movie Dubbing Pipeline

March 17, 2026

Tags:

movie dubbing dataset pipeline zero-shot AI speech diarization multimodal LLM

Discover FunCineForge, the groundbreaking open-source toolkit for creating large-scale movie dubbing datasets and deploying zero-shot dubbing models. This end-to-end pipeline handles video processing, speech separation, speaker diarization, and multimodal corrections using MLLMs. Build CineDub-CN/EN datasets from raw footage and generate high-quality dubs with perfect lip-sync and timbre matching. Includes inference code, demo samples, and supports both Chinese and English. Perfect for AI researchers and content creators.

Read more Original

Practical Open Source Projects

VoiceChanger: Open‑Source Real‑Time Voice Conversion

March 15, 2026

Tags:

Open Source AI Docker voice conversion gaming

Discover how VoiceChanger lets you transform speech on‑the‑fly using cutting‑edge AI models like Beatrice and RVC. This open‑source project features a cross‑platform GUI, Docker support, network‑mode, and tutorials for AMD Linux and Google Colab. Whether you’re a game developer, streamer, or hobbyist, learn how to install, configure, and upgrade the software in minutes and explore the exciting world of real‑time voice manipulation.

Read more Original

Practical Open Source Projects

EasyOCR: A Fast, Multilingual OCR Library for Python

March 15, 2026

Tags:

Open Source Python OCR Multilingual easyocr

EasyOCR brings 80+ language support right into your Python projects. With a quick pip install, lightweight model downloads, and an intuitive API, you can extract text from images in seconds. This guide covers everything from basic usage and custom language sets to Docker deployment and Hugging Face Space integration. Whether you’re building a photo‑management tool or a data‑entry pipeline, EasyOCR gives you the speed and accuracy you need.

Read more Original

Practical Open Source Projects

VibeVoice: Microsoft’s Open‑Source Voice AI Suite

March 15, 2026

Tags:

Open Source Microsoft tts Voice AI ASR

Explore VibeVoice, Microsoft’s cutting‑edge open‑source toolkit that brings long‑form ASR, multi‑speaker TTS, and real‑time streaming to developers and researchers. Learn how to harness its 60‑minute ASR pipeline, 90‑minute TTS, and lightweight real‑time model, and discover integration with Hugging Face Transformers for seamless deployment.

Read more Original

Practical Open Source Projects

RCLI: On‑Device Voice AI for macOS – Zero‑Cloud, Fast

March 13, 2026

Tags:

macOS Voice AI on‑device MetalRT RCLI

RCLI turns your Mac into a fully‑local voice assistant and document explorer. Powered by Apple Silicon’s MetalRT GPU engine, it runs state‑of‑the‑art STT, LLM, and TTS locally—no cloud, no API keys. Discover how to install with Homebrew, control 38 macOS actions, embed PDFs with sub‑4 ms RAG, and benchmark MetalRT against llama.cpp. Whether you’re a developer, power user, or AI enthusiast, RCLI brings the most cutting‑edge local AI to your desktop with minimal setup. Find out why this repo is a must‑try for anyone building voice‑driven macOS tools.

Read more Original

Practical Open Source Projects

LiveTalking: Real-Time AI Digital Human with Lip Sync

March 11, 2026

Tags:

WebRTC digital-human lip-sync wav2lip musetalk

Discover LiveTalking, the open-source powerhouse for creating real-time interactive digital humans. This Python project supports multiple models (wav2lip, musetalk, ernerf) with voice cloning, WebRTC streaming, and interruption handling. Deploy via Docker, run on GPU with 60+ FPS performance, and create commercial-grade talking avatars. Perfect for streamers, educators, and AI developers seeking production-ready lip-sync solutions.

Read more Original

OpenClaw Use Cases

Edict: Ancient Empire AI Agents on OpenClaw

March 10, 2026

Tags:

Agent Framework Multi-Agent AI Orchestration OpenClaw Kanban Dashboard

Discover Edict, a groundbreaking OpenClaw system inspired by China's 1300-year-old 'Three Departments Six Ministries' imperial structure. 12 specialized AI agents (Prince, Zhongshu, Menxia, Shangshu + 6 ministries) collaborate with institutional checks-and-balances that surpass CrewAI and AutoGen. Features real-time Kanban dashboard, remote skills management, model switching, and one-click Docker demo. Experience ancient wisdom meets modern AI orchestration.

Read more Original

Practical Open Source Projects

oMLX: Mac Menu Bar LLM Server with SSD Cache

March 10, 2026

Tags:

Apple Silicon MLX oMLX LLM Server Mac AI

Discover oMLX, the ultimate local LLM server for Apple Silicon Macs. Run LLMs, VLMs, and embeddings from your menu bar with continuous batching, tiered KV caching (RAM + SSD), and multi-model serving. Features admin dashboard, OpenAI API compatibility, Claude Code optimization, and one-click model downloads from Hugging Face. Install via DMG, Homebrew, or source – perfect for developers wanting production-grade local AI without cloud costs.

Read more Original

OpenClaw Use Cases

40 Verified OpenClaw AI Agent Use Cases Guide

March 10, 2026

Tags:

AI Agents Automation OpenClaw Use Cases Chinese AI

Discover 40 battle-tested OpenClaw AI agent use cases that automate your work and life. From Chinese ecosystem integrations like Feishu, DingTalk, and WeCom bots to global scenarios for content creation, DevOps self-healing servers, and multi-agent teams. Beginner-friendly with copy-paste prompts, setup guides, and difficulty ratings. Transform OpenClaw into your 24/7 AI employee today!

Read more Original

Categories

Trending Open Source Projects

Page Agent: Control Web UIs with Natural Language

FunCineForge: Zero-Shot Movie Dubbing Pipeline

VoiceChanger: Open‑Source Real‑Time Voice Conversion

EasyOCR: A Fast, Multilingual OCR Library for Python

VibeVoice: Microsoft’s Open‑Source Voice AI Suite

RCLI: On‑Device Voice AI for macOS – Zero‑Cloud, Fast

LiveTalking: Real-Time AI Digital Human with Lip Sync

Edict: Ancient Empire AI Agents on OpenClaw

oMLX: Mac Menu Bar LLM Server with SSD Cache

40 Verified OpenClaw AI Agent Use Cases Guide