The SpeechSynthesisUtterance interface of the Web Speech API represents a speech request. It contains the content the speech service should read and information about how to read it (e.g. language, pitch and volume.)
The SpeechSynthesisUtterance interface of the Web Speech API represents a speech request. It contains the content the speech service should read and information about how to read it (e.g. language, pitch and volume.)
SpeechSynthesisUtterance()Returns a new SpeechSynthesisUtterance object instance.
SpeechSynthesisUtterance also inherits properties from its parent interface, EventTarget.
SpeechSynthesisUtterance.langGets and sets the language of the utterance.
SpeechSynthesisUtterance.pitchGets and sets the pitch at which the utterance will be spoken at.
SpeechSynthesisUtterance.rateGets and sets the speed at which the utterance will be spoken at.
SpeechSynthesisUtterance.textGets and sets the text that will be synthesized when the utterance is spoken.
SpeechSynthesisUtterance.voiceGets and sets the voice that will be used to speak the utterance.
SpeechSynthesisUtterance.volumeGets and sets the volume that the utterance will be spoken at.
Listen to these events using addEventListener() or by assigning an event listener to the oneventname property of this interface.
boundary Fired when the spoken utterance reaches a word or sentence boundary. Also available via the onboundary property.
end Fired when the utterance has finished being spoken. Also available via the onend property.
error Fired when an error occurs that prevents the utterance from being successfully spoken. Also available via the onerror property
mark Fired when the spoken utterance reaches a named SSML "mark" tag. Also available via the onmark property.
pause Fired when the utterance is paused part way through. Also available via the onpause property.
resume Fired when a paused utterance is resumed. Also available via the onresume property.
start Fired when the utterance has begun to be spoken. Also available via the onstart property.
In our basic Speech synthesizer demo, we first grab a reference to the SpeechSynthesis controller using window.speechSynthesis. After defining some necessary variables, we retrieve a list of the voices available using SpeechSynthesis.getVoices() and populate a select menu with them so the user can choose what voice they want.
Inside the inputForm.onsubmit handler, we stop the form submitting with preventDefault(), use the constructor to create a new utterance instance containing the text from the text <input>, set the utterance's voice to the voice selected in the <select> element, and start the utterance speaking via the SpeechSynthesis.speak() method.
js
const synth = window.speechSynthesis; const inputForm = document.querySelector("form"); const inputTxt = document.querySelector("input"); const voiceSelect = document.querySelector("select"); let voices; function loadVoices() { voices = synth.getVoices(); for (let i = 0; i < voices.length; i++) { const option = document.createElement("option"); option.textContent = `${voices[i].name} (${voices[i].lang})`; option.value = i; voiceSelect.appendChild(option); } } // in Google Chrome the voices are not ready on page load if ("onvoiceschanged" in synth) { synth.onvoiceschanged = loadVoices; } else { loadVoices(); } inputForm.onsubmit = (event) => { event.preventDefault(); const utterThis = new SpeechSynthesisUtterance(inputTxt.value); utterThis.voice = voices[voiceSelect.value]; synth.speak(utterThis); inputTxt.blur(); };
| Specification |
|---|
| Web Speech API # speechsynthesisutterance |
| Desktop | Mobile | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chrome | Edge | Firefox | Internet Explorer | Opera | Safari | WebView Android | Chrome Android | Firefox for Android | Opera Android | Safari on IOS | Samsung Internet | |
SpeechSynthesisUtterance |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
SpeechSynthesisUtterance |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
boundary_event |
14 | 49 | No | 21 | 7 | No | 62 | No | 7 | 3.0 | ||
end_event |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
error_event |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
lang |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
mark_event |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
pause_event |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
pitch |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
rate |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
resume_event |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
start_event |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
text |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
voice |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
volume |
33 | 14 | 49 | No | 21 | 7 | No | 33 | 62 | No | 7 | 3.0 |
© 2005–2023 MDN contributors.
Licensed under the Creative Commons Attribution-ShareAlike License v2.5 or later.
https://developer.mozilla.org/en-US/docs/Web/API/SpeechSynthesisUtterance