Comprehensive stereotype content dictionaries using a semi-automated method

Gandalf Nicolas, Xuechunzi Bai, Susan T. Fiske

Research output: Contribution to journalArticlepeer-review

Abstract

Advances in natural language processing provide accessible approaches to analyze psychological open-ended data. However, comprehensive instruments for text analysis of stereotype content are missing. We developed stereotype content dictionaries using a semi-automated method based on WordNet and word embeddings. These stereotype content dictionaries covered over 80% of open-ended stereotypes about salient American social groups, compared to 20% coverage from words extracted directly from the stereotype content literature. The dictionaries showed high levels of internal consistency and validity, predicting stereotype scale ratings and human judgments of online text. We developed the R package Semi-Automated Dictionary Creation for Analyzing Text (SADCAT; https://github.com/gandalfnicolas/SADCAT) for access to the stereotype content dictionaries and the creation of novel dictionaries for constructs of interest. Potential applications of the dictionaries range from advancing person perception theories through laboratory studies and analysis of online data to identifying social biases in artificial intelligence, social media, and other ubiquitous text sources.

Original languageEnglish (US)
Pages (from-to)178-196
Number of pages19
JournalEuropean Journal of Social Psychology
Volume51
Issue number1
DOIs
StatePublished - Feb 2021

All Science Journal Classification (ASJC) codes

  • Social Psychology

Keywords

  • WordNet
  • dictionaries
  • stereotype content
  • text analysis
  • word embeddings

Fingerprint Dive into the research topics of 'Comprehensive stereotype content dictionaries using a semi-automated method'. Together they form a unique fingerprint.

Cite this