Standardizing Heterogeneous Corpora with DUUR: A Dual Data- and Process-Oriented Approach to Enhancing NLP Pipeline Integration
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, The Asian Federation of Natural Language Processing and The Association for Computational Linguistics, 2025, ISBN 979-8-89176-303-6@inproceedings{Hammerla:et:al:2025a,
author = "L. Hammerla and A. Mehler and G. Abrami",
title = "Standardizing Heterogeneous Corpora with DUUR: A Dual Data- and Process-Oriented Approach to Enhancing NLP Pipeline Integration",
year = 2025,
booktitle = "Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics",
publisher = "The Asian Federation of Natural Language Processing and The Association for Computational Linguistics",
pages = "1410-1425",
keywords = "duui",
url = "https://aclanthology.org/2025.findings-ijcnlp.87/",
isbn = "979-8-89176-303-6"
}