10.5 C
Sunday, May 19, 2024

OpenAI may debut a multimodal AI digital assistant quickly

Must read

- Advertisement -

OpenAI has been exhibiting a few of its clients a brand new multimodal AI mannequin that may each speak to you and acknowledge objects, in keeping with a new report from The Information. Citing unnamed sources who’ve seen it, the outlet says this might be a part of what the corporate plans to show on Monday.

The brand new mannequin reportedly gives quicker, extra correct interpretation of photos and audio than what its current separate transcription and text-to-speech fashions can do. It could apparently have the ability to assist customer support brokers “higher perceive the intonation of callers’ voices or whether or not they’re being sarcastic,” and “theoretically,” the mannequin will help college students with math or translate real-world indicators, writes The Info.

The outlet’s sources say the mannequin can outdo GPT-4 Turbo at “answering some sorts of questions,” however continues to be inclined to confidently getting issues incorrect.

It’s attainable OpenAI can be readying a brand new built-in ChatGPT capability to make telephone calls, in keeping with Developer Ananay Arora, who posted the above screenshot of call-related code. Arora additionally spotted evidence that OpenAI had provisioned servers supposed for real-time audio and video communication.

None of this is able to be GPT-5, if it’s being unveiled subsequent week. CEO Sam Altman has explicitly denied that its upcoming announcement has something to do with the mannequin that’s purported to be “materially better” than GPT-4. The Info writes GPT-5 could also be publicly launched by the top of the 12 months.

- Advertisement -

Source link

More articles

- Advertisement -

Latest article