Computer Vision - Open Source Projects

FastSAM: 50x Faster Segment Anything Model

April 09, 2026

Tags:

Computer Vision FastSAM Segment Anything YOLOv8 Image Segmentation

Discover FastSAM, the revolutionary CNN-based Segment Anything Model that delivers SAM-level performance at 50x speed using just 2% of SA-1B dataset. This open-source powerhouse supports everything/text/box/points prompts with Python inference, Gradio UI, HuggingFace demos, and YOLOv8 integration. Run it locally in 40ms inference time on RTX 3090 - perfect for real-time applications like anomaly detection, salient object detection, and building extraction.

Supervision: Your Reusable Computer Vision Toolkit

August 21, 2025

Tags:

Computer Vision Open Source Python Object Detection Roboflow

Discover Supervision, a powerful open-source Python library designed to streamline your computer vision workflows. From efficient data loading and annotation to seamless integration with popular models like YOLO and Transformers, Supervision simplifies complex tasks. This article explores its core features, including model-agnostic connectors, versatile annotators, and robust dataset utilities for formats like COCO and YOLO. Learn how to accelerate your computer vision projects with this indispensable tool.

Animate Any Portrait: Introducing LivePortrait, Your Open-Source AI Animator

June 04, 2025

Tags:

Computer Vision AI Tools Open Source Python

Animate still portraits of humans, cats, & dogs with LivePortrait, an open-source PyTorch implementation. Driven by video, images, or templates, it offers fine-grained control & a user-friendly Gradio interface.

"Create Professional ID Photos Instantly: Your Free Online Tool for All Document Types"

June 03, 2025

Tags:

Computer Vision AI Tools Screen Parsing Open Source Online Utilities

Create professional ID photos instantly with HivisionIDPhotos - our free online tool automatically generates standardized photos for passports, visas, and IDs with perfect specifications and background options. Upload, select, download!

OmniParser: Revolutionizing Screen Understanding for Vision-Based GUI Agents

June 03, 2025

Tags:

GUI Automation Computer Vision AI Tools Screen Parsing Open Source

OmniParser revolutionizes screen parsing for vision-based GUI agents by transforming interface screenshots into structured data, enhancing model interaction capabilities, and providing powerful tools for AI researchers and developers building GUI automation solutions.

Categories

Posts tagged with: Computer Vision

FastSAM: 50x Faster Segment Anything Model

Supervision: Your Reusable Computer Vision Toolkit

Animate Any Portrait: Introducing LivePortrait, Your Open-Source AI Animator

"Create Professional ID Photos Instantly: Your Free Online Tool for All Document Types"

OmniParser: Revolutionizing Screen Understanding for Vision-Based GUI Agents