The big improve from GPT-three.5 is OpenAI's 4th era language model is multimodal, which means it might method both equally text, pictures and audio. This implies you'll be able to show it images and it'll reply to them alongside a text prompt – an early example of this, noted by The New York Times, associated giving GPT-4 a photo of some fridge