Which audio formats can this skill process?

It can process any audio format supported by the OpenAI Audio API, including .m4a, .mp3, .wav, and .ogg files.

What are the requirements to use the OpenAI Whisper API skill?

You need the 'curl' binary installed on your system and a valid OpenAI API key configured in your environment or configuration file.

How do I improve transcription accuracy for specific terms?

You can use the --prompt flag to provide the model with context, such as specific technical terms or speaker names, to guide the transcription logic.

Can I get the transcription in a machine-readable format?

Yes, by using the --json flag, the skill will output the transcription in a structured JSON format instead of plain text.

OpenAI Whisper Transcriber

Name: OpenAI Whisper Transcriber
Author: HarleyCoops

byHarleyCoops

0•

Data Science & ML

Transcribes audio files into text or JSON using OpenAI's Whisper API via simple terminal commands.

The OpenAI Whisper API skill integrates high-accuracy speech-to-text capabilities directly into your development workflow. It enables the automated transcription of various audio formats such as M4A, OGG, and MP3 using the whisper-1 model. This tool is particularly useful for developers who need to process voice data, as it supports custom prompts for context, language specification for improved accuracy, and flexible output options including plain text or structured JSON for further programmatic manipulation.

Key Features

01Structured JSON output for data processing

02Customizable language and context prompts

03Support for multiple audio formats via curl

04High-accuracy audio-to-text transcription

05Lightweight integration using the whisper-1 model

060 GitHub stars

Use Cases

01Generating searchable transcripts for meetings and interviews

02Automating the creation of subtitles and captions for media files

03Converting voice notes into structured data for NLP applications

What are Skills?·How to Install

Install with 🐟 Skill.Fish

npx skillfish add harleycoops/clawdbot openai-whisper-api

For use in Claude.ai and ChatGPT

Key Features

01Structured JSON output for data processing

02Customizable language and context prompts

03Support for multiple audio formats via curl

04High-accuracy audio-to-text transcription

05Lightweight integration using the whisper-1 model

060 GitHub stars

Use Cases

01Generating searchable transcripts for meetings and interviews

02Automating the creation of subtitles and captions for media files

03Converting voice notes into structured data for NLP applications

What are Skills?·How to Install

Install with 🐟 Skill.Fish

npx skillfish add harleycoops/clawdbot openai-whisper-api

For use in Claude.ai and ChatGPT