EMNLP 2026 Workshop on
Multimodal Interaction in Face-to-Face Dialogue (MINT)

Workshop date TBA (co-located with EMNLP 2026, October 24th-29th, 2026)

EMNLP 2026 in Budapest, Hungary

News

2026-02-20
MINT has been accepted as a workshop at EMNLP 2026.

About

MINT: Multimodal Interaction in Face-to-Face Dialogue is a new workshop on modelling, generating, and understanding multimodal verbal and non-verbal behaviour in face-to-face communication.

Human communication is inherently multimodal: meaning is conveyed not only through words and speech, but also through prosody, gesture, facial expression, gaze, and body pose. Yet much multimodal NLP still treats these signals as peripheral context or as outputs generated only after linguistic content has already been decided. MINT addresses this gap by focusing on multimodal interaction as a core part of communication.

The workshop aims to bring together researchers across computational linguistics, NLP, computer vision, cognitive science, HCI, and robotics to study the resources, modelling approaches, processing methods, and evaluation strategies needed for face-to-face dialogue.

We are particularly interested in work that connects insights about human multimodal communication with the development of AI systems that interact with humans or simulate human-like behaviour. This includes research on new datasets, multimodal learning methods, embodied agents, social robots, and evaluation frameworks for naturalistic interaction.

Call for papers

We invite submissions to MINT: Multimodal Interaction in Face-to-Face Dialogue. The workshop brings together researchers from computational linguistics, NLP, computer vision, HCI, robotics, and cognitive science working on multimodal face-to-face communication.

We welcome work on topics including:

  • computational models that integrate verbal and non-verbal cues such as speech, text, gesture, facial expression, gaze, and body pose;
  • cognitive and linguistic insights about face-to-face communication that can inform AI systems;
  • multimodal datasets with synchronised speech, video, and motion data;
  • evaluation methods for multimodal interaction;
  • applications and tools for embodied conversational agents, social robots, annotation, and behavioural analysis.

We welcome both archival and non-archival contributions. Details on submission routes, formatting, review, and presentation are provided below.

Important dates

Please consult the workshop website and submission portals for the most up-to-date deadlines and timezone details.

Paper submission deadline Monday, June 1st, 2026
Reviewing deadline TBD
Author notification TBD
Camera-ready deadline Saturday, August 1st, 2026
Submission information
Submission routes and portals
  • ARR commitment route. Authors may submit through the current ACL Rolling Review (ARR) cycle and commit their paper to MINT. Please consult the ARR author guidelines for current procedures and requirements.
  • Direct submission route. MINT will also accept direct submissions. This route will have a separate deadline, and its dedicated OpenReview portal will be announced soon.

Papers should be prepared using the official ACL formatting guidelines and the official ACL style files.

Paper categories and length
  • Archival papers. Original, unpublished research to appear in the workshop proceedings. These may be submitted as long papers (up to 8 pages plus references) or short papers (up to 4 pages plus references).
  • Non-archival papers. Previously published work, preliminary research, or demos to be presented at the workshop.
Anonymization

Archival submissions should be anonymized for double-blind review. Please remove author names, affiliations, acknowledgements, and other identifying information from the manuscript and supplementary material where appropriate.

Review criteria

Each paper will receive three reviews. Archival submissions will be evaluated primarily for originality, technical soundness, completeness, and relevance to the workshop. Non-archival and work-in-progress submissions will be evaluated primarily for relevance to the workshop theme and their potential to stimulate discussion.

Presentation

Accepted contributions will be invited for presentation at the MINT workshop co-located with EMNLP 2026 in Budapest, Hungary. Presentation formats may include posters and selected talks, depending on the final program.

Invited speakers

Judith Holler
Judith Holler

Radboud University & Max Planck Institute for Psycholinguistics

Vera Demberg
Vera Demberg

Saarland University

Organizers

Raquel Fernández
Raquel Fernández

University of Amsterdam

Esam Ghaleb
Esam Ghaleb

Max Planck Institute for Psycholinguistics

Asli Ozyurek
Asli Ozyurek

Max Planck Institute for Psycholinguistics / Radboud University

Bulat Khaertdinov
Bulat Khaertdinov

Maastricht University

Ece Takmaz
Ece Takmaz

Utrecht University

Zerrin Yumak
Zerrin Yumak

Utrecht University

Program committee

  • Casey Kennington (Boise State University)
  • Catherine Pelachaud (Sorbonne University)
  • Chinmaya Mishra (Max Planck Institute for Psycholinguistics)
  • David Schlangen (University of Potsdam)
  • Dirk Heylen (University of Twente)
  • Gabriel Skantze (KTH Royal Institute of Technology)
  • Hayley Hung (TU Delft)
  • Hendrik Buschmeier (Bielefeld University)
  • Ilya Burenko (TU Dresden)
  • Justine Cassell (INRIA & École Polytechnique)
  • Kerstin Fischer (University of Southern Denmark)
  • Martijn Vastenburg (Radboud University)
  • Pat Healey (Queen Mary University of London)
  • Patrizia Paggio (University of Malta)
  • Petra Wagner (Bielefeld University)
  • Rachel McDonnell (Trinity College Dublin)
  • Rada Mihalcea (University of Michigan)
  • Ruben Janssens (Ghent University)
  • Stefan Kopp (Bielefeld University)
  • Taras Kucherenko (Electronic Arts)

Sponsor

We thank the Multimodal Language Department (MLD) at Max Planck Institute for Psycholinguistics, for sponsoring this workshop.