name: openai-whisper-api description: 通过OpenAI音频转录API(Whisper)进行音频转文字。 homepage: https://platform.openai.com/docs/guides/speech-to-text metadata: { “openclaw”: { “emoji”: “☁️”, “requires”: { “bins”: [“curl”], “env”: [“OPENAI_API_KEY”] }, “primaryEnv”: “OPENAI_API_KEY”, }, }
OpenAI Whisper API (curl)
通过OpenAI的/v1/audio/transcriptions端点转录音频文件。
快速开始
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
默认设置:
- 模型:
whisper-1 - 输出:
<输入文件名>.txt
常用参数
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "说话人姓名:Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
API密钥
设置OPENAI_API_KEY环境变量,或在~/.openclaw/openclaw.json中配置:
{
skills: {
"openai-whisper-api": {
apiKey: "OPENAI_KEY_HERE",
},
},
}