Non-Repeatable Experiments and Non-Reproducible Results: The Reproducibility Crisis in Human Evaluation in NLP
Findings of the Association for Computational Linguistics: ACL 2023. Rogers, A., Boyd-Graber, J., Okazaki, N. (eds.). Association for Computational Linguistics, pp. 3676-3687, 12 pages
Chapters in Books, Reports and Conference Proceedings: Chapters
- Digital Object Identifier
- https://doi.org/10.18653/v1/2023.findings-acl.226
- Open Access
- http://aura.abdn.ac.uk/bitstream/2164/21800/1/Belz_etal_ACL_Non-repetable_Experiments_And_VoR.pdf