Explicitly unbiased large language models still form biased associations

Published in Proceedings of the National Academy of Sciences, 2025

Recommended citation: Xuechunzi Bai and Angelina Wang and Ilia Sucholutsky and Thomas L Griffiths, "Explicitly unbiased large language models still form biased associations." Proceedings of the National Academy of Sciences, 2025. https://www.pnas.org/doi/abs/10.1073/pnas.2416228122

Access paper here