TY - GEN
T1 - Equipping educational applications with domain knowledge
AU - Sakakini, Tarek
AU - Gong, Hongyu
AU - Lee, Jong Yoon
AU - Schloss, Robert
AU - Xiong, Jinjun
AU - Bhat, Suma
N1 - Publisher Copyright:
© BEA 2019.All right reserved.
PY - 2019
Y1 - 2019
N2 - One of the challenges of building natural language processing (NLP) applications for education is finding a large domain-specific corpus for the subject of interest (e.g., history or science). To address this challenge, we propose a tool, Dexter, that extracts a subjectspecific corpus from a heterogeneous corpus, such as Wikipedia, by relying on a small seed corpus and distributed document representations. We empirically show the impact of the generated corpus on language modeling, estimating word embeddings, and consequently, distractor generation, resulting in a better performance than while using a general domain corpus, a heuristically constructed domainspecific corpus, and a corpus generated by a popular system: BootCaT.
AB - One of the challenges of building natural language processing (NLP) applications for education is finding a large domain-specific corpus for the subject of interest (e.g., history or science). To address this challenge, we propose a tool, Dexter, that extracts a subjectspecific corpus from a heterogeneous corpus, such as Wikipedia, by relying on a small seed corpus and distributed document representations. We empirically show the impact of the generated corpus on language modeling, estimating word embeddings, and consequently, distractor generation, resulting in a better performance than while using a general domain corpus, a heuristically constructed domainspecific corpus, and a corpus generated by a popular system: BootCaT.
UR - http://www.scopus.com/inward/record.url?scp=85120969569&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85120969569&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85120969569
T3 - ACL 2019 - Innovative Use of NLP for Building Educational Applications, BEA 2019 - Proceedings of the 14th Workshop
SP - 472
EP - 477
BT - ACL 2019 - Innovative Use of NLP for Building Educational Applications, BEA 2019 - Proceedings of the 14th Workshop
PB - Association for Computational Linguistics (ACL)
T2 - 14th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2019, collocated with ACL 2019
Y2 - 2 August 2019
ER -