Categories
- All Posts 549
- Practical Open Source Projects 478
- Tutorial Articles 22
- Online Utilities 13
- AI news 7
- Tiny Startups Showcase 7
- Claude Code Skills 6
- Prompt Templates 5
- Hugging Face Spaces 3
- OpenClaw Use Cases 3
- LLM Learning Resources 1
- Online AI Image Tools 1
- OpenClaw Master Skills Collection 1
- Rust Training Resources 1
- AI Short Drama Tools 1
- My Favorites 0
Posts tagged with: Speech-to-Text
Content related to Speech-to-Text
SpeechRecognition: Ultimate Python Speech-to-Text Library
Discover SpeechRecognition, the most comprehensive Python library for converting speech to text. Supports offline engines like CMU Sphinx, Vosk, and OpenAI Whisper, plus cloud APIs from Google, OpenAI, Groq, and Cohere. Install with one pip command and start transcribing microphone input or audio files instantly. Perfect for voice assistants, transcription apps, and meeting recorders. Includes detailed setup guides for PyAudio, PocketSphinx, and troubleshooting tips.
WhisperLiveKit: Real-time Local Speech-to-Text
Discover WhisperLiveKit, a powerful open-source project enabling real-time, fully local speech-to-text, translation, and speaker diarization. It leverages state-of-the-art research like SimulStreaming and WhisperStreaming for unparalleled accuracy and low latency, overcoming the limitations of traditional audio chunk processing. With a user-friendly server and web UI, WhisperLiveKit is ideal for applications ranging from meeting transcriptions and accessibility tools to content creation and customer service analysis. The project offers straightforward installation via pip, various configuration options for different models and backends, and robust deployment guides for both CPU and GPU environments using Docker.