Unsupervised sense disambiguation using bilingual probabilistic models

TitleUnsupervised sense disambiguation using bilingual probabilistic models
Publication TypeConference Papers
Year of Publication2004
AuthorsBhattacharya I, Getoor L, Bengio Y
Conference NameProceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Date Published2004///
PublisherAssociation for Computational Linguistics
Conference LocationStroudsburg, PA, USA
Abstract

We describe two probabilistic models for unsupervised word-sense disambiguation using parallel corpora. The first model, which we call the Sense model, builds on the work of Diab and Resnik (2002) that uses both parallel text and a sense inventory for the target language, and recasts their approach in a probabilistic framework. The second model, which we call the Concept model, is a hierarchical model that uses a concept latent variable to relate different language specific sense labels. We show that both models improve performance on the word sense disambiguation task over previous unsupervised approaches, with the Concept model showing the largest improvement. Furthermore, in learning the Concept model, as a by-product, we learn a sense inventory for the parallel language.

URLhttp://dx.doi.org/10.3115/1218955.1218992
DOI10.3115/1218955.1218992