Skip to main navigation
Skip to search
Skip to main content
Princeton University Home
Help & FAQ
Home
Profiles
Research units
Facilities
Projects
Research output
Press/Media
Search by expertise, name or affiliation
Rethinking Math Benchmarks for LLMs using IRT
Jane Castleman
, Nimra Nadeem
, Tanvi Namjoshi
,
Lydia T. Liu
Computer Science
Princeton Language and Intelligence (PLI)
Research output
:
Contribution to journal
›
Conference article
›
peer-review
Overview
Fingerprint
Fingerprint
Dive into the research topics of 'Rethinking Math Benchmarks for LLMs using IRT'. Together they form a unique fingerprint.
Sort by
Weight
Alphabetically
Computer Science
Large Language Model
100%
Response Theory
50%
Mathematical Reasoning
50%
Reasoning Task
50%
Economics, Econometrics and Finance
Item Response Theory
100%
Keyphrases
IRT Analysis
33%
Artificial Intelligence in Education (AIEd)
33%