Name: OpenAIWhisperAPI音频转录工具Skill
Rating: 5 (66 reviews)
Author: openclaw

OpenAIWhisperAPI音频转录工具Skill openai-whisper-api

这是一个基于OpenAI Whisper模型的音频转录技能，通过命令行调用API，可将音频文件（如m4a、ogg等格式）快速、准确地转换为文字文本或JSON格式的转录稿。支持指定语言、添加提示词以优化识别效果，是语音识别、字幕生成、会议记录自动化的高效工具。关键词：语音转文字，音频转录，Whisper API，OpenAI，命令行工具，语音识别，字幕生成。

NLP 0 次安装 66 次浏览更新于 2/24/2026

name: openai-whisper-api description: 通过OpenAI音频转录API（Whisper）进行音频转文字。 homepage: https://platform.openai.com/docs/guides/speech-to-text metadata: { “openclaw”: { “emoji”: “☁️”, “requires”: { “bins”: [“curl”], “env”: [“OPENAI_API_KEY”] }, “primaryEnv”: “OPENAI_API_KEY”, }, }

OpenAI Whisper API (curl)

通过OpenAI的/v1/audio/transcriptions端点转录音频文件。

快速开始

{baseDir}/scripts/transcribe.sh /path/to/audio.m4a

默认设置：

模型：whisper-1
输出：<输入文件名>.txt

常用参数

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "说话人姓名：Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json

API密钥

设置OPENAI_API_KEY环境变量，或在~/.openclaw/openclaw.json中配置：

{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE",
    },
  },
}