HONEST: Measuring Hurtful Sentence Completion in Language Models

Nozza, Debora; Bianchi, Federico; Hovy, Dirk

doi:10.18653/v1/2021.naacl-main.191

Public

HONEST: Measuring Hurtful Sentence Completion in Language Models

Published • Jan 1, 2021

Authors:

Debora Nozza

Federico Bianchi

Dirk Hovy

Abstract

Language models have revolutionized the field of NLP. However, language models capture and proliferate hurtful stereotypes, especially in text generation. Our results show that 4.3% of the time, language models complete a sentence with a hurtful word. These cases are not random, but follow language ...

View

Open Access

Subject

Sentence

Natural language processing

Lexicon

Generate AI Take for this paper

Highlights, strengths & weaknesses, commercial applications, and societal impact — written for this paper on demand.

Research Assistant

AI chat, annotations, notes & similar papers

Finding related papers...

Discussions

(0)

No comments yet

Be the first to share your thoughts!