In a previous post, Streamline your UX research with customer experience analytics, we discussed the overall benefits and guidelines that UserTesting’s Research team keeps in mind as they incorporate survey-style questions into their studies. One tool they really like to use—and recommend you do, too—are rating scale questions.
There are a lot of best practices to keep in mind for effective UX research questions. The good news is, that it’s not as hard to design effective rating scale questions as you might think. Keep reading to learn how to get the data-backed support for the qualitative findings you need.
Rating scale questions gather answers that represent a range of choices along with particular themes, like satisfaction level, how likely a respondent is to do something, and the extent to which a respondent agrees with a particular statement.
Here are a few examples:
Rating scale questions help you quantify abstract or intangible concepts with approximate answers constrained to a single, logical value set (e.g., very likely to not at all likely, 1-7, 1-5).
As the most utilized type of metric question, rating scale questions:
There are four main types of rating scales.
Ordinal scales deliver answer sets that occur in a logical, systematic order and have a relational link. For example:
Likert scales assess the degree to which a respondent agrees or disagrees about a given statement. For example:
Semantic differential scales gauge emotional attitudes toward a topic by asking respondents to rate a product, company, service, etc. within the frames of a multi-point rating option representing opposite adjectives at each end. For example:
Interval scales feature answer sets where each interval represents a deeper meaning, not just an ordered grouping. For example:
Now that you understand the basics, here are some tips to help you get the most out of the rating scale questions you write.
Rating scales are highly adaptable and can be used to measure many different things—but only if the people taking your test understand what you’re asking them to measure. So make sure your rating scale involves two ends of a spectrum (which you effectively communicate) and assign clear labels to each one.
If you ask users how easy it was to accomplish something, their inclination is to say it was easy—even if they struggled—because people are naturally eager to please. You can avoid introducing bias by making sure you include both endpoints in your question (e.g., the good/the bad, how hard/how easy, etc.).
Pro tip: If you include both sides of the scale in the body of your question, you’ll also reinforce the endpoints you’ve defined and lower the chance a user will misread your question.
We tend to consider low numbers to be bad and high numbers to be good. Looked at another way: we automatically associate higher numbers with good news and lower numbers with bad news. So do yourself — and your users — a favor, and always label 1 as the “pain” point, and 5 as the “positive” point.
Pro tip: If you set up a rating scale question and realize the two endpoints are neutral and/or the positive point is right in the middle, consider turning it into a multiple-choice question. Instead of asking users to rate something on a scale from (1) “Too small” to (5) “Too big,” for example, change the question to a multiple-choice format featuring three answer options — “Too small,” “Just right,” and “Too big.”
You want respondents to differentiate their answers as much as possible, but you also don’t want to provide so many points that your rating scale becomes confusing or unreliable. It’s a delicate balance. So here are two tried-and-true guidelines that can help.
For ideas that range from positive to negative, use a 1-7 point scale that includes a middle or neutral point.
And for ideas that range from zero to positive, use a 1-5 point scale.
5-point scales are popular because they strike a balance between giving respondents enough options to express their feelings or opinions and not overwhelming them with too many choices. This scale typically includes two positive options, two negative options, and a neutral/middle option. It’s straightforward and helps to prevent respondent fatigue, which can lead to more reliable data.
7-point scales provide a finer gradation, which can be useful when you need more sensitivity in responses. They can capture slight variations in attitudes or opinions more effectively than fewer points. This scale is advantageous when the distinction between response categories needs to be more nuanced, allowing for a more precise measurement of respondents' feelings or opinions.
The choice between these scales depends largely on the level of granularity needed in the data and the specific context of the survey:
Imagine you easily download a new app and then spend seven extremely frustrating minutes trying to figure out how to create a new account.
Then you’re confronted with the following question:
By asking about two different elements—downloading the app and setting up an account—this question is trying to measure two distinct elements. And it will yield invalid, inconclusive, and misleading results So be sure to write separate, individual rating scale questions for each task your users perform.
Pro tip: If you’re worried about users mistaking the second question as a duplicate, you can always capitalize the task you’re asking users to rate, for additional emphasis:
Careful design means your user research questions can provide deeper understanding of your customers' contexts. Allowing respondents to expand upon their rating answers can help you understand why they answered the way they did. It can also alert you to problems and opportunities with your offering. So include a free-text answer box underneath each rating scale question where respondents can input any additional information they want to share about why they chose their answer.
Interpreting data collected from rating scales involves several statistical measures that can provide valuable insights into the opinions or behaviors of your survey respondents. You'll likely be familiar with mean, median, and mode, but just in case, let's take a quick refresher course. understanding how to calculate and interpret these measures can help you make informed decisions and identify trends in your data.
Mean (Average):
Median:
Mode:
Identifying trends: By tracking changes in the mean and median over time, you can identify trends in attitudes or satisfaction levels among your respondents. For example, if the mean satisfaction rating consistently increases after certain changes or improvements, this can indicate a positive trend in customer perception.
Making informed decisions: Understanding the mode can help decision-makers focus on the most common needs or preferences of their audience. For instance, if the mode of a product feature rating is low, it might indicate a need for improvement in that area.
Analyzing response distribution: Comparing the mean, median, and mode can provide insights into the distribution of your data. If the mean and median are close but the mode is significantly different, this might suggest a skewed distribution or the presence of outliers. This can lead to deeper investigations into why certain responses differ from the general trend.
By effectively calculating and interpreting these measures, you can gain a comprehensive understanding of your rating scale data. This not only helps in assessing the current state of respondent attitudes or behaviors but also aids in planning future actions based on informed, data-driven insights.
As you can tell, rating scales are a versatile tool, but there are times you'll want to avoid this approach universally applicable to all types of survey questions. Understanding when and how to use rating scales can greatly enhance their effectiveness
Rating scales are ideal for questions that seek to measure:
While rating scales are useful, some survey questions require different approaches:
Choosing the right type of question format is crucial for gathering meaningful data in surveys. Rating scales are highly effective for capturing the degree of respondents' feelings or behaviors and are best used when nuances in intensity or frequency need to be quantified. However, for clear-cut decisions, qualitative insights, or comparative preferences, other question types like binary yes/no, open-ended, multiple-choice, or ranking questions might be more appropriate. By aligning the question type with the goal of your survey, you can optimize the collection of actionable and accurate data.