Google's new AI can perfectly imitate human speech
Google's new AI can perfectly imitate human speech
Google has created a new text-to-speech AI called Tacotron 2, which uses deep neural networks to perfectly imitate human speech. Google published a research paper about the system, which is able to read text in a generated female voice that is indistinguishable from an actual human reading the same text.
The system is Google’s second official generation of the technology, which consists of two deep neural networks. The first network translates the text into a spectrogram, a visual way to represent audio frequencies over time. That spectrogram is then fed into WaveNet, a system from Alphabet’s AI research lab DeepMind, which reads the chart and generates the corresponding audio elements accordingly.
Join HWZ's Telegram channel here and catch all the latest tech news!
Our articles may contain affiliate links. If you buy through these links, we may earn a small commission.