Try to disconnect any other audio hardware and just use one audio. If you have speaker and headset, try to disconnect the headset.
And make sure you set the audio to the default audio device, with every volume maxed out.
If you’re using Azure, make sure you are online, and also make sure you have downloaded English US language pack including the Text to Speech in your Windows 10 Language settings.