I built a little browser-based tool for playing with the audio output from the OpenAI GPT-4o audio preview model - you can set a system prompt and a regular prompt, play the resulting audio, download the wav file and also export out the underlying JSON
-
I built a little browser-based tool for playing with the audio output from the OpenAI GPT-4o audio preview model - you can set a system prompt and a regular prompt, play the resulting audio, download the wav file and also export out the underlying JSON
Tool is here (you'll need to provide your own OpenAI API key, stored in localStorage): https://tools.simonwillison.net/openai-audio-output
Notes on how I built it (with Claude) here: https://simonwillison.net/2024/Oct/28/prompt-gpt-4o-audio/
-
Simon Willisonreplied to Simon Willison last edited by
Bonus tool: if you save the raw API JSON as a Gist you can add the Gist ID to this URL to serve up a page that lets other people play back your audio
Here's my example with a system prompt specifying a "thick French accent":
Copyright © 2024 NodeBB | Contributors