ChatGPT is ready for more natural audio conversations. OpenAI said Tuesday that its popular chatbot now has an enhanced voice feature for those who pay for the premium service. The gadget enables more fluid talks.
The introduction of this enhanced voice feature marks a significant leap forward in human-computer interaction, blurring the lines between artificial intelligence and human conversation. This advancement not only showcases OpenAI's commitment to pushing the boundaries of AI technology but also raises intriguing questions about the future of communication in an increasingly digital world.
The release will continue throughout the week. The company stated that it is not yet available in EU countries, Iceland, Liechtenstein, Norway, Switzerland, or the United Kingdom.
OpenAI unveiled the new capability in May. The release received a lot of attention because of a voice called Sky, which resembled Scarlett Johansson's in the 2013 film "Her." Legal counsel for Johansson sent OpenAI letters stating the firm lacked the right to use the nearly identical voice, prompting OpenAI to halt its use in its products.
This incident highlights the complex legal and ethical considerations surrounding AI-generated voices, particularly when they closely mimic real individuals. It underscores the need for clear guidelines and permissions in the rapidly evolving field of AI-generated content, especially when it comes to replicating the likeness of public figures.
In recent months, people have been able to arrange ChatGPT to speak to them in different voices via a free tier. The upgraded version replies faster and will pause and listen if you interrupt it. There are now nine voices to pick from, and you can provide voice chat instructions in the Customizations section of the app's settings.
"Hope you think it was worth the wait," OpenAI's co-founder and CEO, Sam Altman, wrote in an X post Tuesday. OpenAI, supported by Microsoft, faces rising competition in this field.
Google has recently released its English-language Gemini Live voice tool for Android devices. According to Reuters, Meta will soon offer celebrity voices via Facebook, Instagram, and WhatsApp.
The intensifying competition in the AI voice assistant market is driving rapid innovation and improvement in these technologies. As tech giants vie for dominance, consumers stand to benefit from increasingly sophisticated and user-friendly AI interactions. However, this race also raises concerns about data privacy and the potential for misuse of AI-generated voices in areas such as deepfakes and fraud.
When OpenAI introduced ChatGPT in late 2022, it gained a competitive advantage in the generative AI chatbot market. In August, OpenAI informed media outlets that ChatGPT had more than 200 million weekly active users.
The advanced option is only available to OpenAI Plus, Team, and Enterprise subscribers. The Plus tier is the most cheap option, costing $20 per month. Here is what to do.
If you're paying, it's simple to get started, provided OpenAI has allowed access to your device.
First, check sure you have the most recent version of the app on your phone. Open the ChatGPT application.
According to OpenAI, you will receive a notification in the app once access to the new functionality is enabled. Click the continue button to get started.
Swipe right or tap the two-line icon in the top left corner to start a new chat, then select ChatGPT from the menu above. A sound wave icon should appear to the right of the "Message" text field and microphone icon. Tap that and ensure your sound is turned on.
After a few seconds, you'll hear a small "bump" sound, and the circle in the center of the screen will transform into a fluid sky-like blue and white animation. Start speaking. You should get an answer fast. Expect some audio breakup.
OpenAI claimed to have improved accents in several foreign languages and accelerated interactions. If you don't like what you're hearing, you can tell ChatGPT to speak differently. You can tell it to speed up, for example, or to use a Southern accent.
The ability to customize ChatGPT's voice and accent opens up exciting possibilities for language learning and cultural exchange. Users can now practice conversations in various dialects and accents, potentially enhancing their language skills and cultural understanding. This feature could prove particularly valuable for international business communications, travel preparation, or simply broadening one's linguistic horizons. With advanced speech mode, you can have ChatGPT read you a bedtime tale, prepare you for a job interview, or even practice your foreign language skills.
Even if you pay, you won't get limitless access to sophisticated voice mode. After using it for approximately a half hour on Tuesday, I noticed "15 minutes left" at the bottom of the screen. OpenAI did not immediately respond to a request for information on the time limit.
The implementation of a time limit on the advanced speech mode raises questions about the resource intensity of this feature and OpenAI's capacity to provide unlimited access. It also suggests that as AI technologies become more sophisticated and resource-demanding, we may see more tiered access models in the future, potentially creating new disparities in access to cutting-edge AI tools.