publications

2024

GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text
Michael Ginn, Lindia Tjuatja, Taiqi He, Enora Rice, Graham Neubig, Alexis Palmer, Lori Levin
November 2024. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)
Can we teach language models to gloss endangered languages?
Michael Ginn, Mans Hulden, Alexis Palmer
November 2024. Findings of the Association for Computational Linguistics: EMNLP 2024
PyFoma: a Python finite-state compiler module
Mans Hulden, Michael Ginn, Miikka Silfverberg, Michael Hammond
August 2024. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
BELT: Building Endangered Language Technology
Michael Ginn, David Saavedra-Beltrán, Camilo Robayo, Alexis Palmer
August 2024. Proceedings of the Sixth Workshop on Teaching NLP @ ACL 2024
🏆  Best Paper
Resisting the Lure of the Skyline: Grounding Practices in Active Learning for Morphological Inflection
Saliha Muradoglu, Michael Ginn, Miikka Silfverberg, Mans Hulden
August 2024. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Decomposing Fusional Morphemes with Vector Embeddings
Michael Ginn, Alexis Palmer
June 2024. Proceedings of the 21st SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology @ NAACL 2024
On the Robustness of Neural Models for Full Sentence Transformation
Michael Ginn, Ali Marashian, Bhargav Shandilya, Claire Post, Enora Rice, Juan Vásquez, Marie McGregor, Matthew Buchholz, Mans Hulden, Alexis Palmer
June 2024. Proceedings of the 4th Workshop on Natural Language Processing for Indigenous Languages of the Americas (AmericasNLP 2024) @ NAACL 2024

2023

Robust Generalization Strategies for Morpheme Glossing in an Endangered Language Documentation Context
Michael Ginn, Alexis Palmer
December 2023. Proceedings of the 1st GenBench Workshop on (Benchmarking) Generalisation in NLP
Findings of the SIGMORPHON 2023 Shared Task on Interlinear Glossing
Michael Ginn, Sarah Moeller, Alexis Palmer, Anna Stacey, Garrett Nicolai, Mans Hulden, Miikka Silfverberg
July 2023. Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology
Ginn-Khamov at SemEval-2023 Task 6, Subtask B: Legal Named Entities Extraction for Heterogenous Documents
Michael Ginn, Roman Khamov
July 2023. Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)