[slides and audio] AI generates covertly racist decisions about people based on their dialect

nature portfolio Corresponding author(s): Valentin Hofmann, Sharese King Last updated by author(s): Jun 21, 2024 # Reporting Summary Nature Portfolio aims to improve the reproducibility of the work it publishes. This form provides structure for consistency and transparency in reporting. For further information on Nature Portfolio policies, see our Editorial Policies and the Editorial Policy Checklist. ## Statistics For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. Confirmed - The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement - A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly - The statistical test(s) used AND whether they are one- or two-sided Only common tests should be described solely by name; describe more complex techniques in the Methods section. - A description of all covariates tested - A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons - A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) - For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted. Give P values as exact values whenever suitable. - For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes - Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated Our web collection on statistics for biologists contains articles on many of the points above. ## Software and code Policy information about availability of computer code Data collection We used Python 3.10 to probe the language models. Specifically, we drew upon the package openai 0.28.1 to probe GPT3.5 and GPT4, and transformers 4.36.2 to probe GPT2, RoBERTa, and T5. Data analysis Data analysis was performed in Python 3.10. The specific packages we used were numpy 1.22.4, pandas 1.5.2, scipy 1.7.3, and statsmodels 0.13.2. All code used for data analysis can be found at https://github.com/valentinhofmann/dialect-prejudice. For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors and reviewers. We strongly encourage codenature portfolio Corresponding author(s): Valentin Hofmann, Sharese King Last updated by author(s): Jun 21, 2024 # Reporting Summary Nature Portfolio aims to improve the reproducibility of the work it publishes. This form provides structure for consistency and transparency in reporting. For further information on Nature Portfolio policies, see our Editorial Policies and the Editorial Policy Checklist. ## Statistics For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. Confirmed - The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement - A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly - The statistical test(s) used AND whether they are one- or two-sided Only common tests should be described solely by name; describe more complex techniques in the Methods section. - A description of all covariates tested - A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons - A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) - For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted. Give P values as exact values whenever suitable. - For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes - Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated Our web collection on statistics for biologists contains articles on many of the points above. ## Software and code Policy information about availability of computer code Data collection We used Python 3.10 to probe the language models. Specifically, we drew upon the package openai 0.28.1 to probe GPT3.5 and GPT4, and transformers 4.36.2 to probe GPT2, RoBERTa, and T5. Data analysis Data analysis was performed in Python 3.10. The specific packages we used were numpy 1.22.4, pandas 1.5.2, scipy 1.7.3, and statsmodels 0.13.2. All code used for data analysis can be found at https://github.com/valentinhofmann/dialect-prejudice. For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors and reviewers. We strongly encourage code

Reporting Summary

Apr 2023 | Valentin Hofmann, Sharese King