Question
What are the specs for gpt-realtime image input? I've been having where my WebRTC data channel closes on me silently and I couldn't figure out what was wrong.. Until I drastically compressed the image I'm sending.
FWIW I was able to reproduce this with the realtime-next example and by adding the option to upload an image (rather than camera capture)