Automatic Language Identification in Texts

Author:   Tommi Jauhiainen ,  Marcos Zampieri ,  Timothy Baldwin ,  Krister Lindén
Publisher:   Springer International Publishing AG
ISBN:  

9783031458248


Pages:   148
Publication Date:   04 January 2025
Format:   Paperback
Availability:   Manufactured on demand   Availability explained
We will order this item for you from a manufactured on demand supplier.

Our Price $118.77 Quantity:  
Add to Cart

Share |

Automatic Language Identification in Texts


Overview

Full Product Details

Author:   Tommi Jauhiainen ,  Marcos Zampieri ,  Timothy Baldwin ,  Krister Lindén
Publisher:   Springer International Publishing AG
Imprint:   Springer International Publishing AG
ISBN:  

9783031458248


ISBN 10:   3031458249
Pages:   148
Publication Date:   04 January 2025
Audience:   Professional and scholarly ,  Professional & Vocational
Format:   Paperback
Publisher's Status:   Active
Availability:   Manufactured on demand   Availability explained
We will order this item for you from a manufactured on demand supplier.

Table of Contents

1 Introduction to Language Identification.- 2 Features and Methods.- 3 Evaluation and measurement.- 4 Specific Challenges of Variation and Text Types.- 5 Large scale, Multi-domain Language Identification.- 6 Applications and Related Tasks.- 7 Conclusion and Future Directions.

Reviews

Author Information

Tommi Jauhiainen, Ph.D., is a Post-doctoral Researcher at The University of Helsinki. He wrote his master’s thesis on automatic language identification and continued his research on the same subject as a doctoral student. Dr. Jauhiainen organized the first shared task in Cuneiform Language Identification (CLI) in 2019 as well as the Uralic Language Identification (ULI) shared tasks in 2020 and 2021. He is the first author of approximately 20 peer-reviewed publications on language identification. Marcos Zampieri, Ph.D., is an Assistant Professor at George Mason University. He received his PhD from Saarland University with a thesis on computational modelling of language variation. He has published over 100 peer-reviewed papers on various topics in computational linguistics and NLP such as language and dialect identification, native language identification, machine translation, lexical complexity prediction, and social media mining.  Timothy Baldwin, Ph.D., is the Acting Provost and Chair of the Department of Natural Language Processing at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) in addition to being a Melbourne Laureate Professor in the School of Computing and Information Systems at The University of Melbourne. Prior to joining The University of Melbourne, he was a Senior Research Engineer at the Center for the Study of Language and Information at Stanford University. He is the author of over 450 peer-reviewed publications across diverse topics in natural language processing and AI, in addition to being an ARC Future Fellow, and the recipient of a number of prestigious awards at top conferences.  Krister Lindén, Ph.D., is the Research Director of Language Technology at the University of Helsinki in addition to the National Coordinator of FIN-CLARIN, the Finnish Node of CLARIN ERIC, which is a European research infrastructure for Social Sciences and the Humanities. Heis the Chair of the CLARIN National Coordinators Forum and a member of CLIC (Committee for Legal and Ethical Issues in CLARIN). He holds a doctoral degree in Language Technology from the University of Helsinki. He is the co-author of more than 160 publications related to language technology and its utilization in digital humanities and language resource processing. He is currently also a deputy team leader in the Centre of Excellence of Ancient Near Eastern Empires.

Tab Content 6

Author Website:  

Countries Available

All regions
Latest Reading Guide

SEPRG2025

 

Shopping Cart
Your cart is empty
Shopping cart
Mailing List