AI

OpenAI delays ChatGPT’s new Voice Mode

Comment

Image Credits: OpenAI

In May, when OpenAI first demoed a far more realistic, nearly-real-time conversational experience for its AI-powered chatbot platform ChatGPT, called advanced Voice Mode, the company said that the feature would roll out to paying ChatGPT users within a few weeks.

Months later, OpenAI says that it needs more time.

In a post on OpenAI’s official Discord server, OpenAI says that it’d planned to start rolling out advanced Voice Mode in alpha to a small group of ChatGPT Plus users in late June, but that lingering issues forced it to postpone the launch to sometime in July.

“For example, we’re improving the model’s ability to detect and refuse certain content,” OpenAI writes. “We’re also working on improving the user experience and preparing our infrastructure to scale to millions while maintaining real-time responses. As part of our iterative deployment strategy, we’ll start the alpha with a small group of users to gather feedback and expand based on what we learn.”

Advanced Voice Mode might not launch for all ChatGPT Plus customers until the fall, OpenAI says, depending on whether it meets certain internal safety and reliability checks. However, the delay won’t affect the rollout of the new video and screen sharing capabilities demoed separately during OpenAI’s spring press event, said the company.

Those capabilities include solving math problems given a picture of the problem, for example, and explaining various settings menus on a device. They’re designed to work across ChatGPT on smartphones as well as desktop clients, like the ChatGPT app for macOS, which became available to all ChatGPT users earlier today.

“ChatGPT’s advanced Voice Mode can understand and respond with emotions and nonverbal cues, moving us closer to real-time, natural conversations with AI,” OpenAI writes. “Our mission is to bring these new experiences to you thoughtfully.”

On stage at the launch event, OpenAI employees showed off ChatGPT responding almost instantly to requests such as solving a math problem on a piece of paper placed in front of a researcher’s smartphone camera.

OpenAI’s advanced Voice Mode generated quite a bit of controversy for the default voice’s similarity to actress Scarlett Johansson’s. Johansson later released a statement saying that she hired legal counsel to inquire about the voice and get exact details about how it was developed — and that she’d refused repeated entreaties from OpenAI to license her voice for ChatGPT.

OpenAI, while denying that it used Johansson’s voice without permission or a soundalike, later removed the offending voice.

More TechCrunch

Giving robots a human-like exterior has been the standard for years — centuries even. But giving them actual, living skin that can be manipulated horrifying, slimy expressions? That’s new. The…

This smiling robot face made of living skin is absolute nightmare fuel

In May, when OpenAI first demoed a far more realistic, nearly-real-time conversational experience for its AI-powered chatbot platform ChatGPT, called advanced Voice Mode, the company said that the feature would…

OpenAI delays ChatGPT’s new Voice Mode
Image Credits: OpenAI

GM’s Cruise has appointed Marc Whitten — a video game veteran who was a founding engineer at Xbox and Xbox Live— as CEO.  The Cruise CEO position has been vacant…

GM’s Cruise taps Xbox video game veteran as next CEO

You can now more easily use ChatGPT on your Mac computer. OpenAI’s popular AI chatbot is available to all macOS users, the company announced on Tuesday. The app was first…

ChatGPT for Mac is now available to all

Volkswagen Group said Tuesday it will invest $1 billion into EV startup Rivian as part of a broad software development deal that could expand to as much as $5 billion.…

VW to invest up to $5B in Rivian software deal

Fearless Fund’s co-founder and COO Ayana Parsons has announced that she was stepping down. The firm is being sued by a politically conservative group.

Fearless Fund’s founder has resigned, and it’s a sad reflection on the VC world for Black women

Reddit says the update shouldn’t affect the majority of users or good faith actors, like researchers and organizations, such as the Internet Archive.

Reddit’s upcoming changes attempt to safeguard the platform against AI crawlers

Unfortunately for Udio and Suno, the RIAA has a few thousand smoking guns in the lawsuit: songs it owns that are clearly being regurgitated by the music models.

The RIAA’s lawsuit against generative music startups will be the bloodbath AI needs

Welcome to TechCrunch Fintech! This week, we’re looking at a Brex exec’s jump to join venture firm a16z, Klarna selling off its payments unit and some mega-raises. To get a roundup…

From Brex exec to venture capitalist

Instagram’s Twitter/X rival Threads is furthering its expansion into the fediverse — the interconnected social network that includes apps like Mastodon, PeerTube and others running the ActivityPub protocol. On Tuesday,…

Threads users can now share to the open social web, aka the fediverse

As browsers continue to add AI features into their products, Mozilla is looking to give users some choice in the matter. The company announced on Tuesday that it’s launching an…

Firefox now lets you choose your preferred AI chatbot in its Nightly builds

Smart ring makers Oura and Circular on Tuesday announced a settlement in an ongoing patent suit. The agreed-upon terms find the French company entering into a multi-year agreement with Oura,…

Circular will pay competitor Oura royalties to sell its smart ring in the US

The new addition was inspired by the video-sharing activity that was already taking place on apps like TikTok.

Inspired by Gen Z, Pinterest users can now turn boards into videos for sharing on Instagram and TikTok

It’s unclear where Stability goes from here.

Stability AI lands a lifeline from Sean Parker, Greycroft

London-based internet rights monitoring group NetBlocks has reported a major internet disruption in Kenya following a wave of demonstrations across the country, as police violently cracked down on citizens taking…

Internet goes dark in Kenya in the wake of major protests over finance bill

Waymo no longer has a waitlist for its San Francisco robotaxi service, removing the final obstacle for customers keen to use the self-driving technology.  Waymo said Tuesday that anyone can…

Waymo dumps its waitlist and opens up its San Francisco robotaxi service to everyone

Popular productivity tool Notion has long allowed its users to make any of their pages public. Now, the company is expanding on this with the launch of Notion Sites, which…

Notion Sites takes Notion sites up a level

Investors, you know you need to keep your pipelines primed, and one of the best places to find early-stage startups with promising portfolio potential is, you guessed it, TechCrunch Disrupt.…

Maximize your deal flow at TechCrunch Disrupt 2024

Payabli builds the infrastructure that allows companies, specifically software companies, to embed and facilitate payments through APIs.

Payabli is building payment management tools for software startups

Patreon, the paid membership platform for creators, announced Tuesday the release of new features designed to help creators monetize their non-paying followers and tap into new revenue streams. This includes…

Patreon introduces a gifting feature and other creator tools

Google is rolling out a new Gemini AI side panel in Gmail that can help you write emails and summarize email threads. The company is also adding the Gemini side…

Google brings its Gemini AI to Gmail to help you write and summarize emails

iPhone Mirroring, one of the more notable features arriving in Apple’s upcoming operating systems, is now available to developers testing the beta versions of iOS 18 and macOS Sequoia. The…

Apple launches iPhone Mirroring on Mac in latest iOS and Mac betas

Tengo uses AI to find, evaluate and respond to public tenders. It’s a software-as-a-service tool that helps companies handle public tenders at scale — a bit like Govly in the…

Tengo untangles the messy world of public sector procurement with AI

Smashing is an AI and community-powered content recommendation app, now launching into an invite-only beta.

Smashing, from Goodreads’ co-founder, curates the best of the web using AI and human recommendations

Wisk Aero, a subsidiary of Boeing, has acquired Verocel, a software verification and validation company that’s served the aerospace industry for 25 years. 

Boeing’s Wisk Aero buys Verocel to boost software safety for its self-flying eVTOL

In 2024, it seems like no week goes by without a media organization, author group or artist suing generative AI companies for using their work to train models without permission.…

Backed by David Sacks, Garry Tan and Walter Isaacson, Created by Humans helps people license their creative work to AI models

Coder’s open-source software has around 1.2 million monthly active users, and Dropbox, Discord and Skydio are among the company’s paying customers.

Coder nabs new funds to move dev environments to the cloud

Leveraging large languge models, Jobright created an AI agent that acts as a headhunter tailored to individual job seekers.

How Jobright uses AI to help foreign workers navigate the US job market

k-ID’s platform makes it easy for game devs to comply with child safety and data privacy regulations.

k-ID wins $45M to help game devs speedrun the child safety compliance puzzle

A startup called EvolutionaryScale, founded by ex-Meta researchers, has raised $142 million for its AI-powered protein-generating tech.

EvolutionaryScale, backed by Amazon and Nvidia, raises $142M for protein-generating AI