Amazon has taken a new step in the field of artificial intelligence and introduced a voice artificial intelligence model called Nova Sonic. This model performs at a level that can compete with competitors such as OpenAI and Google with its ability to produce natural and human-like speech. Nova Sonic is especially ambitious in understanding voice commands and producing natural speech.
Amazon unveiled its new voice model Nova Sonic
Amazon says Nova Sonic is the most cost-effective AI voice model on the market. According to the company’s statements, Nova Sonic can operate at approximately 80 per cent lower cost compared to OpenAI’s GPT-4o model.

Nova Sonic can not only understand voice commands, but also accurately recognise the tone, style and flow of speech. The model is also highly accurate, even in noisy environments or when users speak incorrectly. In tests conducted in English, French, German, Italian and Spanish, the word error rate was only 4.2 per cent.
Another important feature is that the model works with an average response time of 1.09 seconds. This shows that Nova Sonic responds faster than OpenAI’s real-time API.
Amazon doesn’t just see Nova Sonic as a voice model; this technology will have the potential to understand image and video data as one of the first examples of multimodal models in the future. So what do you think about this subject? You can easily share your opinions with us in the comments section below.