Speech & Transcription
Speech-to-text and audio processing
Skills
addis-assistant-stt
Provides Speech-to-Text (STT) and text.
agent-voice
Command-line blogging platform for AI agents.
akaunting
Interact with Akaunting open-source accounting software via REST API.
alexa-cli
Control Amazon Alexa devices and smart home via the `alexacli` CLI.
announcer
Announce text throughout the house via AirPlay speakers using Airfoil +.
assemblyai-transcribe
Transcribe audio/video with AssemblyAI.
audio-gen
Generate audiobooks, podcasts, or educational audio content.
audio-reply
Generate audio replies using TTS.
auto-whisper-safe
RAM-safe voice transcription with auto-chunking — works on 16GB machines without crashes.
brw-de-ai-ify
Remove AI-generated jargon and restore human voice to text.
chichi-speech
A RESTful service for high-quality text-to-speech using Qwen3.
claw-voice
You are connected to a live user session via voice.
clonev
Clone any voice and generate speech using Coqui XTTS v2.
critical-article-writer
Generate draft articles, outlines.
cult-of-carcinization
Give your agent a voice — and ears.
deepdub-tts
Generate speech audio using Deepdub and attach it as a MEDIA.
deepgram
— command-line interface for Deepgram speech-to-text.
dellight-cro-revenue-ops
DELLIGHT.AI is an AI startup in DIFC, Dubai.
documents-ai
Real-time OCR and data extraction API by Veryfi.
doubao-api-open-tts
Text-to-Speech service using Doubao (Volcano Engine)
duby
Convert text to speech using Duby.so API.
eachlabs-voice-audio
TTS, STT, voice conversion using ElevenLabs, Whisper, RVC.
easyverein-api
Work with the easyVerein v2.0 REST API.
elevenlabs-agents
Create, manage, and deploy ElevenLabs.
elevenlabs-media
ElevenLabs music generation.
elevenlabs-transcribe
Transcribe audio to text using ElevenLabs.
elevenlabs-tts
ElevenLabs TTS - the best ElevenLabs integration for OpenClaw.
elevenlabs-voices
High-quality voice synthesis with 18 personas, 32.