LSE - Small Logo
LSE - Small Logo

Hao Peng

Misha Teplitskiy

David Jurgens

April 27th, 2024

Researchers with minority ethnic names are written out of US science journalism

0 comments | 6 shares

Estimated reading time: 8 minutes

Hao Peng

Misha Teplitskiy

David Jurgens

April 27th, 2024

Researchers with minority ethnic names are written out of US science journalism

0 comments | 6 shares

Estimated reading time: 8 minutes

Drawing on a study of 223,587 science news stories, Hao Peng, Misha Teplitskiy, and David Jurgens find that researchers with non-Anglo names are more likely to not be directly named in news stories and have their names replaced with those of their institutions.


News media plays a key role in disseminating research findings to the public. It also adds to scholars’ academic prestige and shapes the public’s perception of who is doing good science. Yet, we know little about how journalists choose to present researchers in their news stories and the potential downstream effects these choices have on the career and leadership representation of marginalised groups in academia.

To begin to analyse how scientists’ demographics are related to science reporting, it is useful to separate media coverage into two aspects: (1) the likelihood of coverage: whose paper gets reported on and what we might call (2) the quality of coverage: how thorough and accurate the reporting is when it is published. For example, media coverage may give much or little space to describing the researchers, and potentially include their quotes.

we know little about how journalists choose to present researchers in their news stories and the potential downstream effects these choices have on the career and leadership representation of marginalised groups in academia

In our recent paper, we focused on coverage quality, measuring the variation in how journalists choose to describe the researchers behind a paper. Specifically, we assess whether the author is mentioned by name. Focusing on the subset of papers receiving coverage, we sidestep the question of whose research is covered in the news in the first place, which may itself be associated with author demographics and other confounding factors, such as a paper’s newsworthiness and the author’s self-promotion on social media.

We focused on the “ethnicity” of authors rather than their nationality for two reasons: (1) an author’s nationality may be very fluid, especially in the U.S. (2) journalists generally see only author names’ and affiliations upon reading the paper, and the names often reliably signal race and ethnicity. We decided to use ethnicity because it contains richer information than race.

To match the information available to journalists, we based our study on the perceived ethnicity inferred from names. This choice entails substantial trade-offs. On the one hand, self-described identities may occasionally differ from perceived ones, and some authors may identify with multiple ethnicities. Nevertheless, in most of these mismatching cases, journalists will not know how authors self-identify themselves and instead infer identity from names. In these cases, using authors’ self-identities would be problematic, as it would misrepresent the actual perceptions journalists form and possibly use when they write their stories.

we found that most authors with minority-ethnicity names are significantly less likely to be mentioned than those with Anglo names.

Our corpus of news-reported papers was sourced from Altmetric.com. It consists of 223,587 news stories from 288 U.S.-based outlets reporting on 100,486 scientific papers. We additionally obtained papers’ and authors’ metadata from the Microsoft Academic Graph and the Web of Science databases. For each paper, we focused on authors at the highest “risk” of being mentioned (the first author, last author, and all other authors designated as the corresponding author) and treated each (story, paper, author) triplet as an observation in a logistic regression that predicts author mention with author demographics and controls. We developed a computational method to detect three types of attributions, including name mentions, quotations, and institution mentions.

We found substantial disparities in author mentions across name-inferred ethnicities. These disparities are robust to important factors related to the paper, story, and author such as research topics, journal impact, author prestige, corresponding author status, affiliation location, name complexity, article length, etc. Specifically, we found that most authors with minority-ethnicity names are significantly less likely to be mentioned than those with Anglo names. Of these minority groups, authors with European names were disadvantaged the least, while East Asian and African names disadvantaged the most. There is a up to six percentage points decrease in the mention probability for East Asian and African names, which equals to a 15% decrease relative to Anglo-named authors in media representation.

We find that journalists are more likely to replace African and East Asian named authors with their institutions.

Second, the disparities are similarly large for authors with African and East Asian names who are affiliated with U.S. institutions. In particular, the U.S.-based Chinese, non-Chinese East Asian, and African-named authors experience a 4.8, 3.8, and 4.6 percentage points drop in mention rate compared to their Anglo-named U.S. counterparts. This suggests that being affiliated with U.S. institutions in the same geographic region does not eliminate the observed disparities.

These findings could be due to media bias. For example, journalists may consider certain names as less authoritative than Anglo names. However, our result so far does not directly imply media bias because all U.S.-based authors may still differ in other factors, such as the perceived or actual English proficiency that could impact journalists’ decision on reaching out to them. To directly test rhetorical bias on part of the media, we examined “institution-substitution” where the author is mentioned by their institution but not by name, e.g., being named as “researchers at the University of Michigan.” We find that journalists are more likely to replace African and East Asian named authors with their institutions. Among U.S.-based authors, this mention type should not depend on pragmatic factors such as English fluency. Thus, this substitution effect likely reveals that journalists place less rhetorical value on authors with minority names.

The disparity in Press Releases outlets is particularly notable, as stories in these outlets typically reuse content from university press-releases, suggesting that universities’ press offices themselves, while having less disparity than other outlet types, still prefer to mention scholars with Anglo names

Third, we also find consistent disparities across three types of outlets, including Press Releases, General News, and stories published in outlets focusing on Science & Technology. The disparity in Press Releases outlets is particularly notable, as stories in these outlets typically reuse content from university press-releases, suggesting that universities’ press offices themselves, while having less disparity than other outlet types, still prefer to mention scholars with Anglo names. This result is unexpected because local press offices are expected to have greater direct familiarity with their researchers, reduce the misuse of stereotypes, and to be more responsible for representing minority researchers equitably.

The largest disparities are seen in General News outlets, e.g., The New York Times and The Washington Post, where again scholars with Chinese- and African-associated names have 6.0-8.0 percentage points drop in mention rates. This significant drop reduces nearly one third of the deserved media representation of a large community of scientists (General News outlets mention authors with a 24.2% chance on average). As General News outlets often have well trained editorial staff and science journalists dedicated to accurately reporting science and tend to publish longer stories that have room to mention and engage with authors, this result is alarming. Historically, these ethnic minorities have been stereotyped and underrepresented in U.S. media and leadership roles, which has continued in objective science reporting across all outlet types. The mechanisms of this variation deserve further investigation.

Our work thus suggests that disparities in science media are likely to compound across different aspects of coverage, yielding differences in outcomes much larger than those shown by studies of any one stage.

Our work shows that science journalism is rife with disparities in terms of which authors receive name attributions when their research papers are reported in U.S. news. Mention rates are especially low for East Asian and African names, less pronounced for European names, and even less pronounced for Indian and Middle Eastern names. As science continues to globalise and is increasingly produced by authors from non-Western countries, the way English-language media responds to non-Anglo-named scholars will only grow in importance.

While our study only focuses on the “second stage” of science media coverage and its quality, it is likely that such ethnic disparities are even larger in the first stage of coverage, where media outlets choose whose papers to report on in the first place. Our work thus suggests that disparities in science media are likely to compound across different aspects of coverage, yielding differences in outcomes much larger than those shown by studies of any one stage.


About the author

Hao Peng

Hao Peng is a Postdoc at the Kellogg School of Management of Northwestern University. He is affiliated with the Northwestern Institute on Complex Systems. He studies human-centered topics at the intersection of computational social science, science of innovation, and social networks. His research aims to generate novel insights that can help organizations leverage the full potential of human capital to accelerate discoveries and breakthroughs.

Misha Teplitskiy

Misha Teplitskiy is an Assistant Professor at the University of Michigan School of Information. He is a sociologist of innovation, studying how institutions and technology can help accelerate scientific discovery. His recent work focuses on understanding and mitigating biases in research assessment.

David Jurgens

David Jurgens is an associate professor at the University of Michigan jointly in the School of Information and Department of Computer Science & Engineering. His research develops methods in Natural Language Processing and Computational Social Science to study human behavior and language.

Posted In: Democracy and culture | Uncategorized

Leave a Reply

Your email address will not be published. Required fields are marked *

LSE Review of Books Visit our sister blog: British Politics and Policy at LSE

RSS Latest LSE Events podcasts