TY - GEN
T1 - Credible Without Credit
T2 - 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023
AU - Peskoff, Denis
AU - Stewart, Brandon M.
N1 - Publisher Copyright:
© 2023 Association for Computational Linguistics.
PY - 2023
Y1 - 2023
N2 - Language models have recently broken into the public consciousness with the release of the wildly popular ChatGPT. Commentators have argued that language models could replace search engines, make college essays obsolete, or even write academic research papers. All of these tasks rely on accuracy of specialized information which can be difficult to assess for non-experts. Using 10 domain experts across science and culture, we provide an initial assessment of the coherence, conciseness, accuracy, and sourcing of two language models across 100 expert-written questions. While we find the results are consistently cohesive and concise, we find that they are mixed in their accuracy. These results raise questions of the role language models should play in general-purpose and expert knowledge seeking.
AB - Language models have recently broken into the public consciousness with the release of the wildly popular ChatGPT. Commentators have argued that language models could replace search engines, make college essays obsolete, or even write academic research papers. All of these tasks rely on accuracy of specialized information which can be difficult to assess for non-experts. Using 10 domain experts across science and culture, we provide an initial assessment of the coherence, conciseness, accuracy, and sourcing of two language models across 100 expert-written questions. While we find the results are consistently cohesive and concise, we find that they are mixed in their accuracy. These results raise questions of the role language models should play in general-purpose and expert knowledge seeking.
UR - http://www.scopus.com/inward/record.url?scp=85172223217&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85172223217&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85172223217
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 427
EP - 438
BT - Short Papers
PB - Association for Computational Linguistics (ACL)
Y2 - 9 July 2023 through 14 July 2023
ER -