TY - GEN
T1 - Unsupervised topic modelling for multi-party spoken discourse
AU - Purver, Matthew
AU - Körding, Konrad P.
AU - Griffiths, Thomas L.
AU - Tenenbaum, Joshua B.
PY - 2006
Y1 - 2006
N2 - We present a method for unsupervised topic modelling which adapts methods used in document classification (Blei et al., 2003; Griffiths and Steyvers, 2004) to unsegmented multi-party discourse transcripts. We show how Bayesian inference in this generative model can be used to simultaneously address the problems of topic segmentation and topic identification: automatically segmenting multi-party meetings into topically coherent segments with performance which compares well with previous unsupervised segmentation-only methods (Galley et al., 2003) while simultaneously extracting topics which rate highly when assessed for coherence by human judges. We also show that this method appears robust in the face of off-topic dialogue and speech recognition errors.
AB - We present a method for unsupervised topic modelling which adapts methods used in document classification (Blei et al., 2003; Griffiths and Steyvers, 2004) to unsegmented multi-party discourse transcripts. We show how Bayesian inference in this generative model can be used to simultaneously address the problems of topic segmentation and topic identification: automatically segmenting multi-party meetings into topically coherent segments with performance which compares well with previous unsupervised segmentation-only methods (Galley et al., 2003) while simultaneously extracting topics which rate highly when assessed for coherence by human judges. We also show that this method appears robust in the face of off-topic dialogue and speech recognition errors.
UR - http://www.scopus.com/inward/record.url?scp=84860524978&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84860524978&partnerID=8YFLogxK
U2 - 10.3115/1220175.1220178
DO - 10.3115/1220175.1220178
M3 - Conference contribution
AN - SCOPUS:84860524978
SN - 1932432655
SN - 9781932432657
T3 - COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
SP - 17
EP - 24
BT - COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
PB - Association for Computational Linguistics (ACL)
T2 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, COLING/ACL 2006
Y2 - 17 July 2006 through 21 July 2006
ER -