Categories
- All Posts 549
- Practical Open Source Projects 478
- Tutorial Articles 22
- Online Utilities 13
- AI news 7
- Tiny Startups Showcase 7
- Claude Code Skills 6
- Prompt Templates 5
- Hugging Face Spaces 3
- OpenClaw Use Cases 3
- LLM Learning Resources 1
- Online AI Image Tools 1
- OpenClaw Master Skills Collection 1
- Rust Training Resources 1
- AI Short Drama Tools 1
- My Favorites 0
Posts tagged with: chunking
Content related to chunking
rag‑chunk: CLI Tool to Benchmark and Optimize RAG Chunking
Rag‑chunk is a lightweight, Python‑based command‑line utility that lets data scientists and ML engineers test, benchmark, and refine chunking strategies for Retrieval‑Augmented Generation (RAG). With support for fixed‑size, sliding‑window, paragraph, and even recursive character splitting, you can compare recall scores, tune token‑accurate boundaries using tiktoken, and export results in tables, JSON or CSV. This article walks through installation, key features, real‑world examples, and tips to choose the best strategy for your markdown documents. Whether you’re prototyping a new RAG pipeline or fine‑tuning a production read‑time system, rag‑chunk gives you the data you need to make informed decisions.