Solved: Re: Anyone have an example of Amazon Polly Speech...

bvanbens · ‎2021-11-16

Zoom team - do you have a full example of how to use SSML for text to speech.

The help link goes directly to a AWS web page with table of tags.

For a person not familiar with the tags it's confusing.

can you provide an example how to use it in a typical text to speech like shown below ?

naveen · ‎2021-11-18

@bvanbens - Generally no need to use any of SSML TAG and it will convert all plan text to speech (wave file)...

E.g. If you want to add more pause time in between option1 to option2, then need to use SSML TAGs. (here adding additional 3 seconds delay)

<speak> Thank you for calling the IT Helpdesk <break time="3s"/> To open a new ticket press1 <break time="3s"/> To check existing ticket press 2</speak>

More saemple tags, plz refer to: supportedtags

View solution in original post

naveen · ‎2021-11-18

@bvanbens - Generally no need to use any of SSML TAG and it will convert all plan text to speech (wave file)...

E.g. If you want to add more pause time in between option1 to option2, then need to use SSML TAGs. (here adding additional 3 seconds delay)

<speak> Thank you for calling the IT Helpdesk <break time="3s"/> To open a new ticket press1 <break time="3s"/> To check existing ticket press 2</speak>

More saemple tags, plz refer to: supportedtags

JoshRyder · ‎2022-11-10

The SSML tags are not working for me. The voice simply reads the tags and their contents.

For example the following:

Please wait while we connect your call.
<break time="3s"/>
A representative will be with you shortly.

Is read as:

Please wait while we connect your call.
Break time equals 3's. Greater than
A representative will be with you shortly

JoshRyder · ‎2022-11-10

Solved.

Any text outside of the <speak> tags appears to break the code and they are read literally.

Dave-Myers · ‎2022-01-26

Adding onto this thread, in the Amazon docs there is a column in the table for "Availability with Neural Voices", with values of "full availability", "partial availability" and "not available". It looks like Zoom supports the ones marked as "full availability" but not "partial availability", is this correct?

The one I'm trying to get working is <say-as> to pronounce digits correctly as well as say text characters:

<say-as interpret-as=\"digits\">123456</say-as>

<say-as interpret-as=\"characters\">ID</say-as>

When I add these to the message to play field, I'm getting an error of "Invalid SSML request".

thanks!

Dave-Myers · ‎2022-01-26

My bad - looks like <say-as> is supported, I escaped the ". This works:

TCSDIRZCC · ‎2024-12-23

Curious, does this only work when published. If I use the play function in the widget, it still reads the SSML as text.

Communication

Productivity

Apps & Integration

Spaces

Employee Engagement

Effective January 9th, 2026 through January 22nd, 2026: The Zoom Community is currently in read-only mode with login disabled, to deliver you a new and improved community experience!

Anyone have an example of Amazon Polly Speech Synthesis Markup Language (SSML) in text to speech

Workplace

Communication

Productivity

Apps & Integration

Spaces

Employee Engagement

Business Services

Customer Care

Sales

Marketing

Ecosystems

Enterprise

Download the Zoom app

Zoom Virtual Backgrounds

Business Services

Community User Groups

Programs

Location and Language

Industry

Community Help

All Support

Community Resources

Community Events

Community Events

Effective January 9th, 2026 through January 22nd, 2026: The Zoom Community is currently in read-only mode with login disabled, to deliver you a new and improved community experience!

Anyone have an example of Amazon Polly Speech Synthesis Markup Language (SSML) in text to speech