Amazon’s New Nova Sonic AI Model Features a ‘More Human-like Voice’
Amazon Nova Canvas is a foundation model for developers to create high-quality images. Image: Amazon Amazon is the latest tech giant to unveil a voice AI model. According to Amazon, its Nova Sonic is “a new foundation model that unifies speech understanding and speech generation into a single model, to enable more human-like voice conversations in AI applications.” Nova Sonic will compete with similar AI models by OpenAI, Google, and other tech companies. Nova Sonic understands more than words The Nova Sonic doesn’t just understand the speaker’s words, but it can also process the tone, style, and pace. The AI voice generator adapts to the conversation context, so dialogue flows more naturally, compared to the more stilted models from the first generations of Alexa. The Nova Sonic can do this because it combines multiple speech processing and generating functions into a single AI model instead of using multiple different models. Traditionally, AI voice tools involved running multiple models in sequence: a speech recognition model would convert speech to text, then a large language model (LLM) …









