With the recent announcement of GPT-4o, Open AI has crossed the boundaries with AI. Not only text but now GPT can response taking input in form of video, image and live feed. Just open up camera and start asking questions and GPT doesn't only response by text by a very natural like voice with emotions. Isn't that crossing boundaries?
Key Features of OpenAI's GPT-4o :
- GPT now has vision that means it can now see live video feed or your screen and response.
- GPT can response in text as well as very natural voice.
- Human like conversation and complex answers.
- Speech recognition, text to speech conversion, audio analysis and generation
- video, image and audio/music generation.
- Free to use.
- Very fast response time.
- Multilingual and real time response.
- You can interrupt GPT when it is responding.
Just open phone GPT camera then interact using videocall and you can live ask
question by voice. It is same like talking to some in videocall just so fast ,amazing and very natural with emotions. You can watch demo video here:
What are your thoughts on it?

0 Comments