Categories
- All Posts 549
- Practical Open Source Projects 478
- Tutorial Articles 22
- Online Utilities 13
- AI news 7
- Tiny Startups Showcase 7
- Claude Code Skills 6
- Prompt Templates 5
- Hugging Face Spaces 3
- OpenClaw Use Cases 3
- LLM Learning Resources 1
- Online AI Image Tools 1
- OpenClaw Master Skills Collection 1
- Rust Training Resources 1
- AI Short Drama Tools 1
- My Favorites 0
Posts tagged with: Computer Vision
Content related to Computer Vision
FastSAM: 50x Faster Segment Anything Model
Discover FastSAM, the revolutionary CNN-based Segment Anything Model that delivers SAM-level performance at 50x speed using just 2% of SA-1B dataset. This open-source powerhouse supports everything/text/box/points prompts with Python inference, Gradio UI, HuggingFace demos, and YOLOv8 integration. Run it locally in 40ms inference time on RTX 3090 - perfect for real-time applications like anomaly detection, salient object detection, and building extraction.
Supervision: Your Reusable Computer Vision Toolkit
Discover Supervision, a powerful open-source Python library designed to streamline your computer vision workflows. From efficient data loading and annotation to seamless integration with popular models like YOLO and Transformers, Supervision simplifies complex tasks. This article explores its core features, including model-agnostic connectors, versatile annotators, and robust dataset utilities for formats like COCO and YOLO. Learn how to accelerate your computer vision projects with this indispensable tool.
Animate Any Portrait: Introducing LivePortrait, Your Open-Source AI Animator
Animate still portraits of humans, cats, & dogs with LivePortrait, an open-source PyTorch implementation. Driven by video, images, or templates, it offers fine-grained control & a user-friendly Gradio interface.
"Create Professional ID Photos Instantly: Your Free Online Tool for All Document Types"
Create professional ID photos instantly with HivisionIDPhotos - our free online tool automatically generates standardized photos for passports, visas, and IDs with perfect specifications and background options. Upload, select, download!
OmniParser: Revolutionizing Screen Understanding for Vision-Based GUI Agents
OmniParser revolutionizes screen parsing for vision-based GUI agents by transforming interface screenshots into structured data, enhancing model interaction capabilities, and providing powerful tools for AI researchers and developers building GUI automation solutions.