https://github.com/openai/openai-realtime-console but its on ozempic. simplest possible frontend only VAD server implementation, because the original was way too bloated. Ripped out SCSS, added Tailwind.
You can see the diffs here - we are at -1200 LOC deleted now. - more now that we can use the library versions of wavtools and 3rd party realtime api sdk.
see diff
i've also suppressed all the less useful event spam into the console and made the transcripts log nicely
i've also fixed the memory injection so it starts off with a little memory.
also added a mute button.
This is a React project created using create-react-app
that is bundled via Webpack.
Install it by extracting the contents of this package and using;
$ npm i
Start your server with:
$ npm start
It should be available via localhost:3000
.
The console requires an OpenAI API key (user key or project key) that has access to the
Realtime API. You'll be prompted on startup to enter it. It will be saved via localStorage
and can be
changed at any time from the UI.
To start a session you'll need to connect. This will require microphone access. We only have vad (Voice Activity Detection) conversation mode here.
There is only one function enabled;
set_memory
: You can ask the model to remember information for you, and it will store it in a JSON blob on the left. We've added some basic initial memory for the user to mess with.
You can freely interrupt the model at any time.