TY - GEN
T1 - Exploiting Attention to Reveal Shortcomings in Memory Models
AU - Burns, Kaylee
AU - Nematzadeh, Aida
AU - Grant, Erin
AU - Gopnik, Alison
AU - Griffiths, Thomas L.
N1 - Publisher Copyright:
© 2018 Association for Computational Linguistics
PY - 2018
Y1 - 2018
N2 - The decision making processes of deep networks are difficult to understand and while their accuracy often improves with increased architectural complexity, so too does their opacity. Practical use of machine learning models, especially for question and answering applications, demands a system that is interpretable. We analyze the attention of a memory network model to reconcile contradictory performance on a challenging question-answering dataset that is inspired by theory-of-mind experiments. We equate success on questions to task classification, which explains not only test-time failures but also how well the model generalizes to new training conditions.
AB - The decision making processes of deep networks are difficult to understand and while their accuracy often improves with increased architectural complexity, so too does their opacity. Practical use of machine learning models, especially for question and answering applications, demands a system that is interpretable. We analyze the attention of a memory network model to reconcile contradictory performance on a challenging question-answering dataset that is inspired by theory-of-mind experiments. We equate success on questions to task classification, which explains not only test-time failures but also how well the model generalizes to new training conditions.
UR - http://www.scopus.com/inward/record.url?scp=85070342297&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85070342297&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85070342297
T3 - EMNLP 2018 - 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, Proceedings of the 1st Workshop
SP - 378
EP - 380
BT - EMNLP 2018 - 2018 EMNLP Workshop BlackboxNLP
PB - Association for Computational Linguistics (ACL)
T2 - 1st Workshop on BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, co-located with the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018
Y2 - 1 November 2018
ER -