Heading into the last week before the conference, we’d like to announce our Award Winners. The Best Paper and Best Theme Paper winners will present at our closing session, but all papers (including those two) will get their own presentations throughout the conference. We’ve included their presentation details to make them easy to find.

Winners were selected through a blind process managed by our Best Paper Committee (see below), with the exception of the SAC awards, which were selected, also blindly, by the SACs of each track (see https://2025.naacl.org/committees/program/), and the Demo award, selected by our Demo chairs. Some tracks are missing SAC awards, either due to a lack of suitable paper or due to the winner also winning another award.

Congratulations to all of our winners!


Best Paper Awards

  • Best Paper Award:
    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models \
    Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Guijin Son, Yejin Cho, Sheikh Shafayat, Jinheon Baek, Sue Hyun Park, Hyeonbin Hwang, Jinkyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang, Seonghyeon Ye, Bill Yuchen Lin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo
    Ballroom A, Session I: Oral/Poster 6, Thursday May 1, 16:00-17:30
  • Best Paper Runner-Up:
    REL-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance
    Kaitlyn Zhou, Jena D. Hwang, Xiang Ren, Nouha Dziri, Dan Jurafsky, Maarten Sap
    Ballroom B, Session H: Oral/Poster 5, Thursday May 1, 14:00-15:30
  • Best Demo Award:
    Towards Unified, Dynamic and Annotation-based Visualisations and Exploration of Annotated Big Data Corpora with the Help of Unified Corpus Explorer
    Alexander Mehler; Giuseppe Abrami; Kevin Bönisch
    Posters (Exhibit Hall), Session J: Oral/Poster 7, Friday May 2, 09:00-10:30
  • Best Social Impact Award:
    FLEURS-ASL: Including American Sign Language in Massively Multilingual Multitask Evaluation
    Garrett Tanzer
    Online, Gather Session 1, Tuesday May 6, 09:00-10:30
  • Best Theme Paper Award:
    WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
    Genta Indra Winata, Frederikus Hudi, Patrick Amadeus Irawan, David Anugraha, Rifki Afina Putri, WANG YUTONG, Adam Nohejl, Ubaidillah Ariq Prathama, Nedjma Ousidhoum, Afifa Amriani, Anar Sabuhi Rzayev, Anirban Das, Ashmari Pramodya, Aulia Adila, Bryan Wilie, Candy Olivia Mawalim, CHENG Ching Lam, Daud Abolade, Emmanuele Chersoni, Enrico Santus, Fariz Ikhwantri, Garry Kuwanto, Hanyang Zhao, Haryo Akbarianto Wibowo, Holy Lovenia, Jan Christian Blaise Cruz, Jan Wira Gotama Putra, Junho Myung, Lucky Susanto, Maria Angelica Riera Machin, Marina Zhukova, Michael Anugraha, Muhammad Farid Adilazuarda, Natasha Christabelle Santosa, Peerat Limkonchotiwat, Raj Dabre, Rio Alexander Audino, Samuel Cahyawijaya, Shi-Xiong Zhang, Stephanie Yulia Salim, Yi Zhou, Yinxuan Gui, David Ifeoluwa Adelani, En-Shiun Annie Lee, Shogo Okada, Ayu Purwarianti, Alham Fikri Aji, Taro Watanabe, Derry Tanti Wijaya, Alice Oh, Chong-Wah Ngo
    Posters (Exhibit Hall), Session C: Oral/Poster 2, Wednesday April 30, 14:00-15:30
  • Best Theme Paper Runner-Up:
    Developing multilingual speech synthesis system for Ojibwe, Mi’kmaq, and Maliseet
    Shenran Wang, Changbing Yang, Michael l parkhill, Chad Quinn, Christopher Hammerly, Jian Zhu
    Online, Gather Session 3, Tuesday May 6, 21:00-22:30

Outstanding Papers

  • PeerQA: A Scientific Question Answering Dataset from Peer Reviews
    Tim Baumgärtner, Ted Briscoe, Iryna Gurevych
    Posters (Exhibit Hall), Session H: Oral/Poster 5, Thursday May 1, 14:00-15:30
  • Is your benchmark truly adversarial? AdvScore: Evaluating Human-Grounded Adversarialness
    Yoo Yeon Sung, Maharshi Gor, Eve Fleisig, Ishani Mondal, Jordan Lee Boyd-Graber
    Posters (Exhibit Hall), Session H: Oral/Poster 5, Thursday May 1, 14:00-15:30
  • NLI under the Microscope: What Atomic Hypothesis Decomposition Reveals
    Neha Srikanth, Rachel Rudinger
    Posters (Exhibit Hall), Session C: Oral/Poster 2, Wednesday April 30, 14:00-15:30
  • IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
    David Ifeoluwa Adelani, Jessica Ojo, Israel Abebe Azime, Jian Yun Zhuang, Jesujoba Oluwadara Alabi, Xuanli He, Millicent Ochieng, Sara Hooker, Andiswa Bukula, En-Shiun Annie Lee, Chiamaka Ijeoma Chukwuneke, Happy Buzaaba, Blessing Kudzaishe Sibanda, Godson Koffi KALIPE, Jonathan Mukiibi, Salomon KABONGO KABENAMUALU, Foutse Yuehgoh, Mmasibidi Setaka, Lolwethu Ndolela, Nkiruka Odu, Rooweither Mabuya, Salomey Osei, Shamsuddeen Hassan Muhammad, Sokhar Samb, Tadesse Kebede Guge, Tombekai Vangoni Sherman, Pontus Stenetorp
    Ruidoso, Session F: Oral/Poster 4, Thursday May 1, 10:30-12:00
  • ACCORD: Closing the Commonsense Measurability Gap
    François Roewer-Després, Jinyue Feng, Zining Zhu, Frank Rudzicz
    Posters (Exhibit Hall), Session B: Oral/Poster 1, Wednesday April 30, 11:00-12:30
  • DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students’ Hand-Drawn Math Images
    Sami Baral, Li Lucy, Ryan Knight, Alice Ng, Luca Soldaini, Neil Heffernan, Kyle Lo
    Posters (Exhibit Hall), Session K: Oral/Poster 8, Friday May 2, 11:00-12:30
  • A Logical Fallacy-Informed Framework for Argument Generation
    Luca Mouchel, Debjit Paul, Shaobo Cui, Robert West, Antoine Bosselut, Boi Faltings
    Ballroom C, Session I: Oral/Poster 6, Thursday May 1, 16:00-17:30
  • Learning vs Retrieval: The Role of In-Context Examples in Regression with Large Language Models
    Aliakbar Nafar, K. Brent Venable, Parisa Kordjamshidi
    Ballroom C, Session D: Oral/Poster 3, Wednesday April 30, 16:00-17:30
  • Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision–Language Models
    Minh Duc Bui, Katharina von der Wense, Anne Lauscher
    Posters (Exhibit Hall), Session C: Oral/Poster 2, Wednesday April 30, 14:00-15:30
  • How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs
    Ran Zhang, Wei Zhao, Steffen Eger
    Online, Gather Session 1, Tuesday May 6, 09:00-10:30

Senior Area Chair (SAC) Awards

  • SAC Award for Generation: Decoding Speculative Decoding
    Minghao Yan, Saurabh Agarwal, Shivaram Venkataraman
    Mesilla, Session B: Oral/Poster 1, Wednesday April 30, 11:00-12:30
  • SAC Award for Interpretability and Analysis of Models for NLP:
    On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
    Nitay Calderon, Roi Reichart
    Ballroom C, Session F: Oral/Poster 4, Thursday May 1, 10:30-12:00
  • SAC Award for Language Modeling:
    In-Context Learning with Long-Context Models: An In-Depth Exploration
    Amanda Bertsch, Maor Ivgi, Emily Xiao, Uri Alon, Jonathan Berant, Matthew R. Gormley, Graham Neubig
    Posters (Exhibit Hall), Session C: Oral/Poster 2, Wednesday April 30, 14:00-15:30
  • SAC Award for Linguistic Theories, Cognitive Modeling and Psycholinguistics:
    Language Models Largely Exhibit Human-like Constituent Ordering Preferences
    Ada Tur, Gaurav Kamath, Siva Reddy
    Ballroom B, Session I: Oral/Poster 6, Thursday May 1, 16:00-17:30
  • SAC Award for Low-resource Methods for NLP:
    Advancing MoE Efficiency: A Collaboration-Constrained Routing C2R Strategy for Better Expert Parallelism Design
    Mohan Zhang, Pingzhi Li, Jie Peng, Mufan Qiu, Tianlong Chen
    Online, Gather Session 3, Tuesday May 6, 21:00-22:30
  • SAC Award for Resources and Evaluation:
    Unifying AI Tutor Evaluation: An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors
    Kaushal Kumar Maurya, KV Aditya Srivatsa, Kseniia Petukhova, Ekaterina Kochmar
    Ballroom A, Session J: Oral/Poster 7, Friday May 2, 09:00-10:30
  • SAC Award for Special Theme:
    Meta-Cultural Competence: Climbing the Right Hill of Cultural Awareness
    Sougata Saha, Saurabh Kumar Pandey, Monojit Choudhury
    Mesilla, Session C: Oral/Poster 2, Wednesday April 30, 14:00-15:30
  • SAC Award for Speech Processing and Spoken Language Understanding:
    Behavior-SD: Behaviorally Aware Spoken Dialogue Generation with Large Language Models
    Sehun Lee, Kang-wook Kim, Gunhee Kim
    Ballroom C, Session K: Oral/Poster 8, Friday May 2, 11:00-12:30
  • SAC Award for Summarization:
    Coverage-based Fairness in Multi-document Summarization
    Haoyuan Li, Yusen Zhang, Rui Zhang, Snigdha Chaturvedi
    Posters (Exhibit Hall), Session H: Oral/Poster 5, Thursday May 1, 14:00-15:30

Once again, a hearty congratulations to all the award winners!

The NAACL 2025 best paper committee was:

Chairs:

Anna Rumshisky, University of Massachusetts Lowell

Marine Carpuat, University of Maryland

Committee members:

Ana Marasović, University of Utah

Ani Nenkova, Adobe Research

Chris Callison-Burch, University of Pennsylvania

Dan Roth, University of Pennsylvania

Dan Garrette, Google

Danqi Chen, Princeton University

David Chiang, University of Notre Dame

David Bamman, University of California, Berkeley

Dilek Hakkani-Tur, University of Illinois Urbana-Champaign

Diyi Yang, Stanford University

Ehud Reiter, University of Aberdeen

Greg Durrett, University of Texas at Austin

He He, New York University

Ivan Vladimir Meza Ruiz, IIMAS, UNAM (Universidad Nacional Autónoma de México)

Jackie Cheung, McGill University

Julia Kreutzer, Cohere

Kristina Toutanova, Google

Mike Lewis, Meta

Mohit Bansal, University of North Carolina at Chapel Hill (UNC)

Monojit Choudhury, Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)

Gholamreza Haffari, Monash University

Saif Mohammad, National Research Council Canada

Saleh Soltan, Amazon

Vinodkumar Prabhakaran, Google

Vivek Srikumar, University of Utah

Viviane Moreira, Universidade Federal do Rio Grande do Sul (UFRGS)

Yang Liu, Amazon

Yonatan Bisk, Carnegie Mellon University