TY - GEN
T1 - Ranking Formal Specifications using LLMs
AU - He, Mike
AU - Ang, Zhendong
AU - Desai, Ankush
AU - Gupta, Aarti
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/10/9
Y1 - 2025/10/9
N2 - Formal specifications are essential for reasoning about the correctness of complex systems. While recent advances have explored automatically learning such specifications, the challenge of distinguishing meaningful, non-trivial specifications from a vast and noisy pool of learned candidates remains largely open. In this position paper, we present an approach for specification ranking, aimed at identifying the most critical specifications that contribute to overall system correctness. To this end, we develop a four-metric rating framework that quantifies the importance of a specification. Our approach leverages the reasoning capabilities of Large Language Models to rank specifications from a set of automatically learned candidates. We evaluate the proposed method on a set of specifications inferred for 11 open-source and 3 proprietary distributed system benchmarks, demonstrating its effectiveness in ranking critical specifications.
AB - Formal specifications are essential for reasoning about the correctness of complex systems. While recent advances have explored automatically learning such specifications, the challenge of distinguishing meaningful, non-trivial specifications from a vast and noisy pool of learned candidates remains largely open. In this position paper, we present an approach for specification ranking, aimed at identifying the most critical specifications that contribute to overall system correctness. To this end, we develop a four-metric rating framework that quantifies the importance of a specification. Our approach leverages the reasoning capabilities of Large Language Models to rank specifications from a set of automatically learned candidates. We evaluate the proposed method on a set of specifications inferred for 11 open-source and 3 proprietary distributed system benchmarks, demonstrating its effectiveness in ranking critical specifications.
KW - Ranking specifications, Agentic workflows
UR - https://www.scopus.com/pages/publications/105021598011
UR - https://www.scopus.com/pages/publications/105021598011#tab=citedBy
U2 - 10.1145/3759425.3763386
DO - 10.1145/3759425.3763386
M3 - Conference contribution
AN - SCOPUS:105021598011
T3 - LMPL 2025 - Proceedings of the 1st ACM SIGPLAN International Workshop on Language Models and Programming Languages, Co-located with ICFP/SPLASH 2025
SP - 51
EP - 56
BT - LMPL 2025 - Proceedings of the 1st ACM SIGPLAN International Workshop on Language Models and Programming Languages, Co-located with ICFP/SPLASH 2025
A2 - Le Goues, Claire
A2 - Mechtaev, Sergey
A2 - Shi, Qingkai
A2 - Wang, Chengpeng
A2 - Ye, He
A2 - Zhang, Zhuo
PB - Association for Computing Machinery, Inc
T2 - 1st ACM SIGPLAN International Workshop on Language Models and Programming Languages, LMPL 2025, co-located with ICFP/SPLASH 2025
Y2 - 12 October 2025 through 18 October 2025
ER -