A Comparison of Language Identification Approaches on Short, Query-Style Texts
Gurrin, Cathal (Hrsg). Advances in information retrieval : 32nd European Conference on IR Research, ECIR 2010, Milton Keynes, UK, March 28 - 31, 2010 ; proceedings. Berlin: Springer 2010 S. 611 - 614
Erscheinungsjahr: 2010
ISBN/ISSN: 978-3-642-12274-3
Publikationstyp: Buchbeitrag (Konferenzband)
Sprache: Englisch
Geprüft | Bibliothek |
Inhaltszusammenfassung
In a multi-language Information Retrieval setting, the knowledge about the language of a user query is important for further processing. Hence, we compare the performance of some typical approaches for language detection on very short, query-style texts. The results show that already for single words an accuracy of more than 80Ên be achieved, for slightly longer texts we even observed accuracy values close to 100%.
Klassifikation
DFG Fachgebiet:
Informatik
DDC Sachgruppe:
Informatik