I built a little browser-based tool for playing with the audio output from the OpenAI GPT-4o audio preview model - you can set a system prompt and a regular prompt, play the resulting audio, download the wav file and also export out the underlying JSON

Simon Willison

Tool is here (you'll need to provide your own OpenAI API key, stored in localStorage): https://tools.simonwillison.net/openai-audio-output

Notes on how I built it (with Claude) here: https://simonwillison.net/2024/Oct/28/prompt-gpt-4o-audio/

Simon Willison

Bonus tool: if you save the raw API JSON as a Gist you can add the Gist ID to this URL to serve up a page that lets other people play back your audio

Here's my example with a system prompt specifying a "thick French accent":

Gist Audio Player

(tools.simonwillison.net)