3 Best Practices for Designing Empathetic Voice Experiences

One of the biggest challenges with voice assistants is that they sound like humans…almost.

Will Hall
Chief Creative Officer

One of the biggest challenges with voice assistants is that they sound like humans… almost. The almost problem is more profound than it may at first seem, though. Because the voices are almost human, we as users tend to graft human expectations onto these interfaces that simply aren’t yet up to the task of conversing intuitively, understanding emotions or building true empathy. As architects of these experiences, we must find ways to set appropriate user expectations and bridge the gap with thoughtful design. The following are three best practices to do so:

Mind the Gaps:

In voice experiences, users insert their emotions most into the negative spaces of conversations.

For example, if a consumer used a customer service skill that had long pauses after each question, the user will channel “are they even listening?!” into that gap. Understanding that these moments will spark the most emotion means that we must anticipate and respond accordingly.


We don’t always say exactly what we mean.

As designers, we must anticipate and predict intent wherever possible.

For example, if a user checks their balance with a banking app, they might just want to know if they can purchase something instead of the actual available funds. If a user asks Alexa about the weather, they might want to know whether they should bring an umbrella or not, as opposed the the actual temperature etc. The more we can enable the tech to intuit intent and anticipate requests, the more frictionless and empathetic our experiences will become.


We think of voice as a “focus group of one” because of the specific nature of conversation.

Users will tell you specifically what they want in voice.

This is a powerful truth that can help voice experiences drive personalization for users and glean qualitatively rich insights for brands.

Portions of this content originally appeared as part of SoundHound’s Finding Your Brand Voice: 6 Ways to Build a Better VUI Guide

What’s a Rich Text element?

The rich text element allows you to create and format headings, paragraphs, blockquotes, images, and video all in one place instead of having to add and format them individually. Just double-click and easily create content.

Static and dynamic content editing

A rich text element can be used with static or dynamic content. For static content, just drop it into any page and begin editing. For dynamic content, add a rich text field to any collection and then connect a rich text element to that field in the settings panel. Voila!

How to customize formatting for each rich text

Headings, paragraphs, blockquotes, figures, images, and figure captions can all be styled after a class is added to the rich text element using the "When inside of"

nested selector system.

About RAINdrops 

Created and curated by our team of experts, RAINdrop articles cover the many ways voice technology is transforming your industry.

See All Articles

Get Voice on Voice 

Every Tuesday, our industry leading briefing covers the latest updates on voice and beyond. Join over 12,000 subscribers and sign up today.

Voice on Voice

Don't miss another briefing