OpenAI's new Realtime API lets developers add realistic conversations to their apps

1 year ago 22

ARTICLE AD BOX

OpenAI announced several new features for app developers at its DevDay conference. The company is now offering tools to integrate AI-generated voices and fine-tune GPT-4o with images.

The new "Realtime API" lets developers add six AI voices to their apps. These voices are different from those used in ChatGPT. To avoid legal issues, developers can't use third-party voices.

OpenAI showed off a travel planning app using the Realtime API. Users could talk to an AI assistant about a London trip and get quick responses. The API can also add restaurant suggestions to maps.

The technology works for phone calls too, like placing orders. OpenAI doesn't automatically disclose it's an AI voice, leaving that up to developers for now.

THE DECODER Newsletter

The most important AI news straight to your inbox.

✓ Weekly

✓ Free

✓ Cancel at any time

New functions for GPT-4o and cost savings

Other Updates:

Developers can use images to fine-tune GPT-4o
New prompt caching to cut costs and speed up responses
"Model distillation" to improve smaller models like GPT-4o mini
Doubled rate limit for the new o1 model

OpenAI says its prompt caching works automatically, potentially saving up to 50% on tokens. "Stored completions" let developers save model interactions on OpenAI's platform for later fine-tuning. The company also released new evaluation tools.

Read Entire Article