AIBit-Discover Open Source Projects AIBit-Discover Open Source Projects
Open Source ProjectsWeb Scraping & DataAI Agents & AutomationAI Tools & Resources
More
Learning & TutorialsAI Research & BenchmarksDevelopment & SecurityWeb & InfrastructureMedia & Content CreationHardware & Edge AIStartup Resources

June 28, 2025

Bilingual Book Maker: AI-Powered Epub/Txt/SRT Translation

Discover bilingual_book_maker, an open-source AI translation tool leveraging various large language models like ChatGPT, Gemini, and Claude to create bilingual EPUB, TXT, and SRT files. Ideal for translating public domain books and subtitle files, this project simplifies the process of creating multi-language content. Learn how to install, configure, and use this powerful tool for your translation needs, supporting a wide array of models and offering features like context-aware translation, prompt tweaking, and Docker support. Enhance your reading experience and language learning with automatically generated bilingual books.

  • Jun 28, 2025

    Monica AI: Your All-in-One AI Assistant for Daily Tasks

    Discover Monica, the versatile AI assistant designed to streamline your daily digital tasks across chat, search, writing, and coding. Available as a Chrome/Edge extension, and on mobile/desktop, Monica leverages leading AI models like GPT-4o and Claude 3.7. Learn how this powerful tool can enhance productivity with features like AI chat, summarization, writing assistance, and web enhancement, trusted by over 10 million users globally.

  • Jun 27, 2025

    MarkItDown: Microsoft's Open-Source Tool for LLM Data Prep

    Discover MarkItDown, Microsoft's powerful open-source Python utility designed to bridge the gap between diverse document formats and Large Language Models (LLMs). This tool intelligently converts files like PDFs, Word documents, Excel sheets, images, audio, and even YouTube URLs into clean, structured Markdown. Ideal for developers and AI practitioners, MarkItDown ensures document content is optimized for LLM consumption, preserving critical structure while maximizing token efficiency. Learn how this practical project can streamline your data preparation workflows for AI applications and text analysis.

  • Jun 27, 2025

    LLaMA-Factory: Unified Fine-Tuning for 100+ LLMs & VLMs

    Fine-tuning large language models can be a complex and resource-intensive task. LLaMA-Factory emerges as a game-changer, offering a unified and highly efficient platform for the fine-tuning of over 100 Large Language Models (LLMs) and Vision Language Models (VLMs). This open-source project, recognized at ACL 2024, simplifies complex AI development workflows with its zero-code command-line interface and intuitive Web UI. Trusted by industry giants like Amazon and NVIDIA, LLaMA-Factory empowers developers and researchers to enhance model performance across diverse tasks, from multi-turn dialogue to multimodal understanding, using advanced techniques like QLoRA and FlashAttention-2. Explore how this powerful tool can accelerate your AI projects.

  • Jun 27, 2025

    Unsloth: Dramatically Speed Up LLM Fine-tuning & Save VRAM

    Discover Unsloth, the open-source library revolutionizing Large Language Model (LLM) fine-tuning. Achieve up to 2x faster training and reduce GPU VRAM consumption by up to 80% compared to standard methods. Unsloth supports a wide range of models like Llama, Qwen, Gemma, and Mistral, along with Text-to-Speech and Vision models. Its user-friendly approach allows for free fine-tuning via beginner-friendly notebooks, enabling efficient training even on limited hardware. Dive into efficient LLM development with Unsloth's powerful features and robust performance.

  • Jun 27, 2025

    Magenta RT: Realtime AI Music Generation Library by Google

    Discover Magenta RT, Google DeepMind's new open-source Python library designed for streaming music audio generation directly on your local device. This innovative project offers real-time capabilities for music creation, serving as a powerful companion to existing AI music platforms. Explore its core features, including chunk-by-chunk generation, dynamic style blending with MusicCoCa, and high-fidelity audio tokenization via SpectroStream. Get started easily with the official Colab demo or through local installation, and unlock new possibilities for AI-powered music production with this Apache 2.0 licensed tool.

  • Jun 27, 2025

    Mastering GRPO: Train Reasoning LLMs with Unsloth Efficiently

    Dive into the world of Reinforcement Learning (RL) and discover how advanced techniques like GRPO revolutionized AI model training. This article breaks down core RL concepts, explains the difference between PPO and GRPO, and reveals how Unsloth’s cutting-edge optimizations slash GPU VRAM requirements by over 90%. Learn to train powerful reasoning Large Language Models (LLMs) on consumer-grade hardware, optimize your training workflow, and design effective reward functions. From foundational principles to practical implementation tips, unlock the secrets to building smarter, more efficient AI with Unsloth.

  • Jun 27, 2025

    AI-Powered Manga Image Translator for Seamless Reads

    Dive into the world of manga and comics without language barriers! Manga Image Translator is an innovative open-source tool that harnesses advanced AI, including OCR, text detection, and image inpainting, to seamlessly translate text directly within images. Whether you're a fan of Japanese manga, Chinese comics, or any image-based content, this project empowers you to understand previously inaccessible material. It supports over 20 languages and offers versatile installation options, from local Python setups to Docker containers and web interfaces, making powerful translation capabilities accessible to everyone. Discover how this project removes text from images and replaces it with accurate translations, preserving the original artwork.

  • Jun 27, 2025

    Dango-Translator: Real-Time OCR & Comic Translation Software

    Dive into Dango-Translator, an open-source OCR-based tool designed to break language barriers in real-time. Whether you're playing foreign games, browsing untranslated websites, or reading raw comics, this powerful Windows software instantly captures and translates text from your screen. Featuring advanced image processing for comics (including text recognition, erasure, and re-embedding), support for 15 diverse translation sources, and cloud-saved settings, Dango-Translator offers a seamless and efficient solution for handling 'raw' content. Discover how this practical project can transform your digital experience, making inaccessible content instantly understandable and enhancing your engagement with multilingual media.

  • Jun 27, 2025

    Defuddle: Your Open-Source Solution for Clean Web Content

    Tired of cluttered web pages? Introducing Defuddle, an innovative open-source JavaScript library designed to extract the main content from any webpage, removing unnecessary elements like ads, comments, and sidebars. This powerful tool provides a clean, standardized HTML output, making it ideal for web clippers, content archiving, and data processing. Defuddle offers advantages over traditional readability tools by being more forgiving in its cleaning process, providing consistent output for various elements, and extracting rich metadata. Whether you're building a web application or need to process online articles programmatically, Defuddle streamlines content acquisition, ensuring you get only the most relevant information without the noise.

  • Jun 27, 2025

    ICONIC: Bubble Skill Icons for Your Developer Portfolio

    Elevate your GitHub READMEs, personal portfolios, and resumes with ICONIC, an open-source library offering a vibrant collection of sleek, bubble-shaped skill icons. Designed for clarity and aesthetic appeal, these icons come with both light and dark theme variants and are incredibly easy to embed using simple HTML snippets. Discover how ICONIC can help you visually showcase your technical proficiencies effectively and attractively.

  • Jun 27, 2025

    Bark: Custom Push Notifications for iOS Devices

    Discover Bark, an innovative open-source iOS application that lets you send custom push notifications directly to your iPhone. Leveraging Apple's APNs, Bark is free, secure, and highly customizable, offering features like grouped notifications, custom icons, sounds, and time-sensitive alerts. It even supports self-hosted servers and encrypted pushes for enhanced privacy. Learn how to integrate Bark into your workflows, from simple URL requests to advanced API parameters, making it an essential tool for developers and users needing tailored notification solutions.

Previous 39 / 49 Next

Curated AI tools, open source projects, tutorials, and resources for developers building with artificial intelligence.

Terms of Service Privacy Policy © 2026 AIBit-Discover Open Source Projects