Anthropic sur Twitter : "We examine which safety techniques for LMs are more robust to human-written, adversarial inputs ..."
Tags:
About This Document
File info