Long-Form Analogy Evaluation Challenge

  • Bhavya Bhavya
  • , Chris Palaguachi
  • , Yang Zhou
  • , Suma Bhat
  • , Cheng Xiang Zhai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Given the practical applications of analogies, recent work has studied analogy generation to explain concepts. However, not all generated analogies are of high quality and it is unclear how to measure the quality of this new kind of generated text. To address this challenge, we propose a shared task on automatically evaluating the quality of generated analogies based on seven comprehensive criteria. For this, we will set up a leaderboard based on our dataset annotated with manual ratings along the seven criteria, and provide a baseline solution leveraging GPT-4. We hope that this task would advance the progress in development of new evaluation metrics and methods for analogy generation in natural language, particularly for education.

Original languageEnglish (US)
Title of host publicationINLG 2024 - 17th International Natural Language Generation Conference, Proceedings of the Generation Challenges
EditorsSimon Mille, Miruna-Adriana Clinciu
PublisherAssociation for Computational Linguistics (ACL)
Pages1-16
Number of pages16
ISBN (Electronic)9798891761247
DOIs
StatePublished - 2024
Externally publishedYes
Event17th International Natural Language Generation Conference, INLG 2024 - Tokyo, Japan
Duration: Sep 23 2024Sep 27 2024

Publication series

NameINLG 2024 - 17th International Natural Language Generation Conference, Proceedings of the Generation Challenges

Conference

Conference17th International Natural Language Generation Conference, INLG 2024
Country/TerritoryJapan
CityTokyo
Period9/23/249/27/24

All Science Journal Classification (ASJC) codes

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Information Systems
  • Software

Fingerprint

Dive into the research topics of 'Long-Form Analogy Evaluation Challenge'. Together they form a unique fingerprint.

Cite this