A confirmatory factorial analysis of the Chatbot Usability Scale: A multilanguage validation

Simone Borsci, Martin Schmettow, Alessio Malizia, Alan Chamberlain*, Frank Van Der Velde

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

6 Downloads (Pure)

Abstract

The Bot Usability Scale (BUS) is a standardised tool to assess and compare the satisfaction of users after interacting with chatbots to support the development of usable conversational systems. The English version of the 15-item BUS scale (BUS-15) was the result of an exploratory factorial analysis; a confirmatory factorial analysis tests the replicability of the initial model and further explores the properties of the scale aiming to optimise this tool seeking for the stability of the original model, the potential reduction of items, and testing multiple language versions of the scale. BUS-15 and the usability metrics for user experience (UMUX-LITE), used here for convergent validity purposes, were translated from English to Spanish, German, and Dutch. A total of 1292 questionnaires were completed in multiple languages; these were collected from 209 participants interacting with an overall pool of 26 chatbots. BUS-15 was acceptably reliable; however, a shorter and more reliable solution with 11 items (BUS-11) emerged from the data. The satisfaction ratings obtained with the translated version of BUS-11 were not significantly different from the original version in English, suggesting that the BUS-11 could be used in multiple languages. The results also suggested that the age of participants seems to affect the evaluation when using the scale, with older participants significantly rating the chatbots as less satisfactory, when compared to younger participants. In line with the expectations, based on reliability, BUS-11 positively correlates with UMUX-LITE scale. The new version of the scale (BUS-11) aims to facilitate the evaluation with chatbots, and its diffusion could help practitioners to compare the performances and benchmark chatbots during the product assessment stage. This tool could be a way to harmonise and enable comparability in the field of human and conversational agent interaction.
Original languageEnglish
Number of pages14
JournalPersonal and ubiquitous computing
Early online date4 Aug 2022
DOIs
Publication statusE-pub ahead of print/First online - 4 Aug 2022

Fingerprint

Dive into the research topics of 'A confirmatory factorial analysis of the Chatbot Usability Scale: A multilanguage validation'. Together they form a unique fingerprint.

Cite this