Highly Granular Dialect Normalization and Phonological Dialect Translation for Limburgish

Andreas Simons, Stefano De Pascale, Karlien Franco

Onderzoeksoutput: Conference paper

30 Downloads (Pure)

Samenvatting

We study highly granular dialect normalization and phonological dialect translation on Limburgish, a non-standardized low-resource language with a wide variation in spelling conventions and phonology. We find improvements to the traditional transformer by embedding the geographic coordinates of dialects in dialect normalization tasks and use these geographically-embedded transformers to translate words between the phonologies of different dialects. These results are found to be consistent with notions in traditional Limburgish dialectology.

Originele taal-2English
TitelVarDial 2024 - 11th Workshop on NLP for Similar Languages, Varieties and Dialects, Proceedings of the Workshop
RedacteurenYves Scherrer, Tommi Jauhiainen, Nikola Ljubesic, Marcos Zampieri, Preslav Nakov, Jorg Tiedemann
Plaats van productieMexico City
UitgeverijAssociation for Computational Linguistics (ACL)
Pagina's152-162
Aantal pagina's11
ISBN van elektronische versie9798891761049
ISBN van geprinte versie9798891761049
StatusPublished - 2024
Evenement11th Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial 2024 - Mexico City, Mexico
Duur: 20 jun. 2024 → …

Publicatie series

NaamVarDial 2024 - 11th Workshop on NLP for Similar Languages, Varieties and Dialects, Proceedings of the Workshop

Conference

Conference11th Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial 2024
Land/RegioMexico
StadMexico City
Periode20/06/24 → …

Bibliografische nota

Publisher Copyright:
© 2024 Association for Computational Linguistics.

Citeer dit