TY - GEN
T1 - Leveraging preposition ambiguity to assess compositional distributional models of semantics
AU - Ritter, Samuel
AU - Long, Cotie
AU - Paperno, Denis
AU - Baroni, Marco
AU - Botvinick, Matthew
AU - Goldberg, Adele
N1 - Funding Information:
Denis Paperno and Marco Baroni were sup ported by ERC 2011 Starting Independent Research Grant n. 283554 (COMPOSES). Samuel Ritter and Matthew Botvinick were supported by Intelligence Advanced Research Projects Activity (IARPA) Grant n. 102-01.
PY - 2015
Y1 - 2015
N2 - Complex interactions among the meanings of words are important factors in the function that maps word meanings to phrase meanings. Recently, compositional distributional semantics models (CDSM) have been designed with the goal of emulating these complex interactions; however, experimental results on the effectiveness of CDSM have been difficult to interpret because the current metrics for assessing them do not control for the confound of lexical information. We present a new method for assessing the degree to which CDSM capture semantic interactions that dissociates the influences of lexical and compositional information. We then provide a dataset for performing this type of assessment and use it to evaluate six compositional models using both co-occurrence based and neural language model input vectors. Results show that neural language input vectors are consistently superior to co-occurrence based vectors, that several CDSM capture substantial compositional information, and that, surprisingly, vector addition matches and is in many cases superior to purpose-built paramaterized models.
AB - Complex interactions among the meanings of words are important factors in the function that maps word meanings to phrase meanings. Recently, compositional distributional semantics models (CDSM) have been designed with the goal of emulating these complex interactions; however, experimental results on the effectiveness of CDSM have been difficult to interpret because the current metrics for assessing them do not control for the confound of lexical information. We present a new method for assessing the degree to which CDSM capture semantic interactions that dissociates the influences of lexical and compositional information. We then provide a dataset for performing this type of assessment and use it to evaluate six compositional models using both co-occurrence based and neural language model input vectors. Results show that neural language input vectors are consistently superior to co-occurrence based vectors, that several CDSM capture substantial compositional information, and that, surprisingly, vector addition matches and is in many cases superior to purpose-built paramaterized models.
UR - http://www.scopus.com/inward/record.url?scp=84959882770&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84959882770&partnerID=8YFLogxK
U2 - 10.18653/v1/s15-1023
DO - 10.18653/v1/s15-1023
M3 - Conference contribution
AN - SCOPUS:84959882770
T3 - Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015
SP - 199
EP - 204
BT - Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015
PB - Association for Computational Linguistics (ACL)
T2 - 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015
Y2 - 4 June 2015 through 5 June 2015
ER -