Skip to main navigation Skip to search Skip to main content

Gesturing Toward Abstraction: Multimodal Convention Formation in Collaborative Physical Tasks

  • Kiyosu Maeda
  • , William P. McCarthy
  • , Ching Yi Tsai
  • , Jeffrey Mu
  • , Haoliang Wang
  • , Robert Hawkins
  • , Judith E. Fan
  • , Parastoo Abtahi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A quintessential feature of human intelligence is the ability to create ad hoc conventions over time to achieve shared goals efficiently. We investigate how communication strategies evolve through repeated collaboration as people coordinate on shared procedural abstractions. To this end, we conducted an online unimodal study (n = 98) using natural language to probe abstraction hierarchies. In a follow-up lab study (n = 40), we examined how multimodal communication (speech and gestures) changed during physical collaboration. Pairs used augmented reality to isolate their partner's hand and voice; one participant viewed a 3D virtual tower and sent instructions to the other, who built the physical tower. Participants became faster and more accurate by establishing linguistic and gestural abstractions and using cross-modal redundancy to emphasize key changes from previous interactions. Based on these findings, we extend probabilistic models of convention formation to multimodal settings, capturing shifts in modality preferences. Our findings and model provide building blocks for designing convention-aware intelligent agents situated in the physical world.

Original languageEnglish (US)
Title of host publicationCHI 2026 - Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems
EditorsNuria Oliver, David A. Shamma, Heloisa Candello, Pablo Cesar, Pedro Lopes, Alessandro Bozzon, Thomas Kosch, Vera Liao, Xiaojuan Ma, Valentino Artizzu, Fiona Draxler, Gustavo Lopez, Anke V. Reinschluessel, Xin Tong, Phoebe O. Toups Dugas
PublisherAssociation for Computing Machinery
ISBN (Electronic)9798400722783
DOIs
StatePublished - Apr 13 2026
Event2026 CHI Conference on Human Factors in Computing Systems, CHI 2026 - Barcelona, Spain
Duration: Apr 13 2026Apr 17 2026

Publication series

NameConference on Human Factors in Computing Systems - Proceedings

Conference

Conference2026 CHI Conference on Human Factors in Computing Systems, CHI 2026
Country/TerritorySpain
CityBarcelona
Period4/13/264/17/26

All Science Journal Classification (ASJC) codes

  • Human-Computer Interaction
  • Computer Graphics and Computer-Aided Design
  • Software

Keywords

  • abstraction
  • augmented reality
  • complementary
  • hand gestures
  • modality preference
  • Multimodal conventions
  • rational speech act

Fingerprint

Dive into the research topics of 'Gesturing Toward Abstraction: Multimodal Convention Formation in Collaborative Physical Tasks'. Together they form a unique fingerprint.

Cite this