ChatGPT’s groundbreaking Advanced Voice Mode is transforming how users interact with AI chatbots. Launched in late July 2024 for select ChatGPT Plus subscribers, this innovative feature allows for real-time, human-like conversations without the constraints of text prompts or back-and-forth audio exchanges. Originally showcased at OpenAI’s Spring Update event, Advanced Voice Mode promises a more natural and dynamic conversational experience.
Availability and Rollout of Advanced Voice Mode
OpenAI initially released the Advanced Voice feature to a limited group of ChatGPT Plus subscribers. While the exact size of this initial rollout remains undisclosed, OpenAI has committed to expanding access in the coming weeks, with the goal of reaching all Plus subscribers by fall 2024. Users eager to experience this new feature will receive an email invitation and an in-app notification upon gaining access.
Device Requirements for Advanced Voice Mode
the advanced voice alert on the chatgpt app
Beyond a ChatGPT Plus subscription, specific device requirements apply. Android users need a handset running app version 1.2024.206 or later, while iPhone users require iOS 16.4 or later and the same app version. Meeting these requirements doesn’t guarantee access to the alpha release, as OpenAI’s selection criteria remain undisclosed. Selected users will receive an email notification and an in-app tooltip to activate the feature.
Data Usage and Privacy in Advanced Voice Mode
OpenAI utilizes audio data from Advanced Voice Mode conversations during the alpha release to further train its models. Users who wish to opt out can easily disable data sharing within the app’s settings. Navigate to the Data Controls tab in the Settings menu and deselect “Improve voice for everyone.”
Usage Limits and Conversation Duration
OpenAI implements daily usage limits for both input and output in Advanced Voice Mode. While specific durations haven’t been officially announced and are subject to change, user experiences suggest conversations can last up to 10 minutes. The AI provides a 3-minute warning before concluding the conversation and reverting to the standard voice interface.
Capabilities and Limitations of Advanced Voice Mode
Khan!!!!!! pic.twitter.com/xQ8NdEojSX
— Ethan Sutin (@EthanSutin) July 30, 2024
Advanced Voice Mode leverages the same GPT-4o large language model used for text-based queries, offering a new interaction method with familiar functionalities. Early users have explored diverse applications, from beatboxing and storytelling to rapid calculations. However, certain limitations exist. Users cannot create new memories, utilize custom instructions, or access GPTs within Advanced Voice Mode. While the AI retains context from prior Advanced Voice conversations, it doesn’t yet access previous text-based or standard voice mode chats. Furthermore, respecting creators’ rights, OpenAI has implemented filters to prevent Advanced Voice Mode from generating musical content, including singing.
Conclusion
ChatGPT’s Advanced Voice Mode represents a significant advancement in conversational AI, offering a more natural and engaging user experience. While currently limited to select Plus subscribers, OpenAI’s planned expansion promises wider access in the near future. This innovative feature opens up new possibilities for human-AI interaction, paving the way for more intuitive and dynamic conversations.