Language (Technology) is Power: A Critical Survey of "Bias" in NLP, Su Lin Blodgett, Solon Barocas, Hal Daumé III, Hanna Wallach, 2020Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (Association for Computational Linguistics)DOI: 10.18653/v1/2020.acl-main.485 - This survey offers a critical conceptualization of "bias" in NLP, discussing various forms of harms and challenges in evaluation, providing a broader understanding of the section's topics.
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models, Samuel Gehman, Suchin Gururangan, Maarten Sap, Yejin Choi, Noah A. Smith, 2020Findings of the Association for Computational Linguistics: EMNLP 2020 (Association for Computational Linguistics)DOI: 10.18653/v1/2020.findings-emnlp.301 - This paper introduces a method and dataset for systematically evaluating the generation of toxic content by language models, directly addressing toxicity measurement.