Automatic Language Identification in Texts von Tommi Jauhiainen | ISBN 9783031458248

Automatic Language Identification in Texts

von Tommi Jauhiainen, Marcos Zampieri, Timothy Baldwin und Krister Lindén
Mitwirkende
Autor / AutorinTommi Jauhiainen
Autor / AutorinMarcos Zampieri
Autor / AutorinTimothy Baldwin
Autor / AutorinKrister Lindén
Buchcover Automatic Language Identification in Texts | Tommi Jauhiainen | EAN 9783031458248 | ISBN 3-031-45824-9 | ISBN 978-3-031-45824-8

Automatic Language Identification in Texts

von Tommi Jauhiainen, Marcos Zampieri, Timothy Baldwin und Krister Lindén
Mitwirkende
Autor / AutorinTommi Jauhiainen
Autor / AutorinMarcos Zampieri
Autor / AutorinTimothy Baldwin
Autor / AutorinKrister Lindén
This book provides readers with a brief account of the history of Language Identification (LI) research and a survey of the features and methods most used in LI literature. LI is the problem of determining the language in which a document is written and is a crucial part of many text processing pipelines. The authors use a unified notation to clarify the relationships between common LI methods. The book introduces LI performance evaluation methods and takes a detailed look at LI-related shared tasks. The authors identify open issues and discuss the applications of LI and related tasks and proposes future directions for research in LI.