Categories
- All Posts 549
- Practical Open Source Projects 478
- Tutorial Articles 22
- Online Utilities 13
- AI news 7
- Tiny Startups Showcase 7
- Claude Code Skills 6
- Prompt Templates 5
- Hugging Face Spaces 3
- OpenClaw Use Cases 3
- LLM Learning Resources 1
- Online AI Image Tools 1
- OpenClaw Master Skills Collection 1
- Rust Training Resources 1
- AI Short Drama Tools 1
- My Favorites 0
Posts tagged with: Alibaba
Content related to Alibaba
Zvec: Lightning-Fast In-Process Vector DB from Alibaba
Discover Zvec, Alibaba's open-source vector database that embeds directly into your apps with zero server setup. Search billions of vectors in milliseconds, support dense/sparse embeddings, hybrid search, and run anywhere—from notebooks to edge devices. Latest v0.3.0 adds Windows/Android support, RabitQ quantization, and C-API for AI agents. Install via pip or npm and start building RAG apps today with this production-grade, lightweight powerhouse boasting 9.3k GitHub stars.
Page Agent: Control Web UIs with Natural Language
Discover Page Agent, Alibaba's revolutionary in-page GUI agent that transforms web interfaces into natural language playgrounds. No browser extensions, no Python, no headless browsers—just pure JavaScript magic. With 10.5k GitHub stars and MIT license, this TypeScript powerhouse enables SaaS AI copilots, smart form filling, accessibility enhancements, and multi-page automation. Integrate in one line of code and execute commands like 'Click the login button'. Perfect for developers building intelligent web experiences.
Qwen3-ASR: Alibaba’s Open‑Source 52‑Language ASR Model
Alibaba Cloud’s latest release, Qwen3‑ASR, brings state‑of‑the‑art multilingual speech recognition to the open‑source community. Supporting 52 languages and 22 Chinese dialects, the two 1.7B/0.6B models excel on benchmarks and rival commercial APIs. The repo ships with a full inference toolkit that works with transformers or the high‑performance vLLM backend, automatic timestamping via the Qwen3‑ForcedAligner, and a ready‑to‑run Gradio demo. Whether you’re a researcher, developer, or hobbyist, this guide walks you through downloading, setting up, benchmarking, and deploying Qwen3‑ASR in Docker or directly on GPU, so you can start transcribing speech, music, and songs with ease. Key highlights: multilingual support, streaming inference, forced‑alignment, quick‑start scripts, Docker deployments, and API integration with OpenAI‑compatible endpoints.
Tongyi DeepResearch: Alibaba's Open-Source AI Agent
Explore Tongyi DeepResearch, Alibaba's groundbreaking open-source AI agent. This 30.5 billion parameter model, with an efficient 3.3 billion parameter activation per token, excels in long-horizon, deep information-seeking tasks. Demonstrating state-of-the-art performance across various agentic search benchmarks like Humanity's Last Exam and BrowserComp, Tongyi DeepResearch builds on advancements from the WebAgent project. Discover its features, including automated synthetic data generation, continual pre-training on agentic data, and robust reinforcement learning techniques. Learn how to set up and run the model for your own deep research needs, leveraging its compatibility with ReAct and Heavy inference paradigms.