Building Safer AI Systems, OpenAI, Ongoing - This resource details OpenAI's methodologies for developing secure AI, including strategies for content moderation and preventing misuse.
Automated Content Moderation: A Survey, Shahriar, B. and Nandi, A., 2020Journal of Information Systems and Telecommunication, Vol. 8 (Academic Center for Education, Culture and Research (ACECR))DOI: 10.22061/JIST.2020.10.155.1504 - This survey paper reviews various techniques and systems used for automated content moderation, covering different types of harmful content.
Google AI Principles, Google, 2018 (Google) - This document outlines Google's ethical guidelines for AI development, emphasizing safety and responsible use, which supports the need for content moderation.
Perspective API, Jigsaw, 2017 (Jigsaw (a Google company)) - The official website for an API that helps identify toxic and other harmful comments, offering multi-category content classification.