Posts tagged with: LLM
Content related to LLM
Skyvern AI: Automate Browser Workflows with LLMs & Vision
Discover Skyvern, an innovative open-source project that revolutionizes browser automation using large language models (LLMs) and computer vision. Unlike traditional methods, Skyvern adapts to website changes, making automation more robust and efficient. Learn how it can streamline tasks like form filling, data extraction, and even complex workflows across various websites, transforming the way businesses and individuals approach web-based operations. Get started with quick installation and explore its powerful features for a wide range of applications from invoice downloading to job applications.
Microsoft Agent Lightning: Train AI Agents with Zero Code Change
Discover Agent Lightning, Microsoft's innovative open-source framework designed to supercharge your AI agents. This platform allows you to optimize any AI agent, regardless of its underlying framework, with minimal code changes. Leveraging advanced algorithms like Reinforcement Learning and Automatic Prompt Optimization, Agent Lightning empowers developers and researchers to enhance agent performance, turning them into 'optimizable beasts.' Explore its core features, understand its architecture, and learn how to integrate this powerful trainer into your projects to accelerate AI agent development and achieve remarkable results.
Build Your Own ChatGPT: Nanochat - The $100 LLM
Discover Nanochat, Andrej Karpathy's open-source project offering a full-stack implementation of a ChatGPT-like LLM. Learn how to train, fine-tune, and run your own large language model on a budget, requiring as little as $100 in compute. This minimal, hackable codebase is designed for accessibility, allowing enthusiasts to experiment with LLM development from scratch. Dive into the world of micro-models and understand the journey from tokenization to web serving, all within a single, cohesive project. Get hands-on with practical scripts and gain insights into the costs and complexities of training your own AI assistant.
Nanobrowser: AI Web Automation Chrome Extension
Discover Nanobrowser, an open-source Chrome extension revolutionizing AI-powered web automation. This free alternative to OpenAI Operator offers a multi-agent system, flexible LLM options, and complete privacy. Run complex workflows directly in your browser using your own API keys, ensuring zero subscription fees and full control over your data. Learn how to install, configure, and leverage Nanobrowser for tasks like news summaries, GitHub research, and shopping automation.
RAG-Anything: The All-in-One Multimodal RAG Framework
Discover RAG-Anything, an innovative open-source framework that revolutionizes Retrieval-Augmented Generation (RAG) by offering comprehensive support for multimodal documents. This cutting-edge system processes text, images, tables, and equations seamlessly, overcoming the limitations of traditional RAG. Learn how RAG-Anything, built on LightRAG, provides an end-to-end pipeline for document ingestion, analysis, and intelligent querying, making it an indispensable tool for academic research, technical documentation, and enterprise knowledge management.
Tongyi DeepResearch: Alibaba's Open-Source AI Agent
Explore Tongyi DeepResearch, Alibaba's groundbreaking open-source AI agent. This 30.5 billion parameter model, with an efficient 3.3 billion parameter activation per token, excels in long-horizon, deep information-seeking tasks. Demonstrating state-of-the-art performance across various agentic search benchmarks like Humanity's Last Exam and BrowserComp, Tongyi DeepResearch builds on advancements from the WebAgent project. Discover its features, including automated synthetic data generation, continual pre-training on agentic data, and robust reinforcement learning techniques. Learn how to set up and run the model for your own deep research needs, leveraging its compatibility with ReAct and Heavy inference paradigms.
Build Your Own LLM Server in a Week
Dive into the world of Large Language Models with Tiny LLM, a practical, open-source course designed for systems engineers. Learn to build and optimize LLM serving infrastructure from scratch using MLX on Apple Silicon. This week-long journey covers everything from fundamental matrix operations to advanced C++/Metal kernels and request batching for high throughput. Whether you're curious about LLM internals or aiming to deploy your own, Tiny LLM offers clear guidance and community support to demystify LLM serving.
Koog: Kotlin Framework for AI Agents
Discover Koog, the official Kotlin framework for building robust, scalable, and production-ready AI agents. This open-source project empowers developers to create AI agents across diverse platforms, including backend services, Android, iOS, JVM, and web browsers. Leveraging JetBrains' AI expertise, Koog offers solutions for complex LLM challenges, featuring pure Kotlin implementation, MCP integration, embedding capabilities, custom tool creation, and a flexible graph workflow system. Learn how Koog can streamline your AI agent development with its multiplatform support and array of ready-to-use components.
LangExtract: LLM Text Structuring Made Easy
Discover LangExtract, a powerful Python library designed to transform unstructured text into structured data using Large Language Models (LLMs). This tool offers precise source grounding for extracted information, interactive visualizations, and flexible LLM support, including Gemini and Ollama. Whether you're working with clinical notes, reports, or literature, LangExtract simplifies complex data extraction tasks, enabling reliable results with just a few examples. Learn how to install, configure API keys, and leverage its capabilities for your projects, including handling long documents and visualizing extracted entities in an easy-to-understand HTML format.
Crush: Your Terminal's AI Coding Companion
Discover Crush, the revolutionary AI coding agent designed to supercharge your terminal workflow. This open-source project integrates seamlessly with your favorite LLMs, offering a powerful, flexible, and extensible solution for developers. Learn how Crush enhances your coding experience with features like multi-model support, session management, LSP integration, and broad compatibility across operating systems. Installation is a breeze via various package managers, and customization options allow you to tailor Crush to your specific needs. Dive into the future of terminal-based AI assistance with Crush.