Obsessed with technology?
Subscribe to the latest tech news as well as exciting promotions from us and our partners!
By subscribing, you indicate that you have read & understood the SPH's Privacy Policy and PDPA Statement.
News Categories

Google's new AI can perfectly imitate human speech

By James Lu - on 27 Dec 2017, 10:53am

Google's new AI can perfectly imitate human speech

Google has created a new text-to-speech AI called Tacotron 2, which uses deep neural networks to perfectly imitate human speech. Google published a research paper about the system, which is able to read text in a generated female voice that is indistinguishable from an actual human reading the same text.

The system is Google’s second official generation of the technology, which consists of two deep neural networks. The first network translates the text into a spectrogram, a visual way to represent audio frequencies over time. That spectrogram is then fed into WaveNet, a system from Alphabet’s AI research lab DeepMind, which reads the chart and generates the corresponding audio elements accordingly.

You can check out audio samples of Tacotron 2 and compare it to an actual human speaking the same text here.

Google via Quartz