Drawing on insights from a recent international survey on research integrity and a recent high-profile case, Nick Allum and Robin Brooker find previous work on scientific plagiarism may have underestimated its prevalence.
The resignation of Claudine Gay as President of Harvard University has pitched the subject of research integrity into the mainstream. Gay, along with the presidents of MIT and UPenn, fell foul of questioning from Rep. Elise Stefanik (R-N.Y.) at a House committee hearing on antisemitism in November 2023 concerning their institutions’ response to calls for the genocide of Jews. However, if demands for her resignation had simply concerned her performance in front of the House committee, there is an even chance she would still be in post. It was the additional questions surrounding her scholarship and the opening salvo of what has been called the ‘plagiarism wars’ that tipped the scales in favour of her departure.
Plagiarism can be defined as:
“When somebody presents the work of others (data, words or theories) as if they were his/her own and without proper acknowledgment”.
Claudine Gay’s work in political science is currently facing rigorous examination for possible plagiarism. Recent reports have revealed as many as 50 instances of plagiarism in her work, stretching back to the 1990s and encompassing her PhD thesis.
Appropriately acknowledging the work of others is fundamental to research integrity in a well-functioning scholarly system but behind this broad definition there are numerous forms of plagiarism, some of which are arguably more or less damaging. For example, one of us (Nick Allum) discovered recently that a co-authored article published in a scholarly journal was completely replicated and published under different names in another journal. A particularly egregious example, encouraged by the rise of predatory publishers with little regard for research integrity.
In Gay’s case, many examples involve exact copying of short sections of text without attribution. While this still sounds like an open and shut case, there have been variable responses as to its seriousness – not least from researchers, such as David Canon, whose work Gay now admits to copying. Some of the examples could be regarded as minor infractions as they are technical descriptions of statistical models, or set out only background information. Arguably, they do not claim an important original insight that is actually due to someone else. Not everyone agrees. For example, Carol Swain, who alleges that her work was demonstrably plagiarised by Gay.
In light of this furore, it is useful to ask: how widespread are the kind of behaviours of which Gay has been accused and in some cases admitted? A recent meta-analysis (combining the results of multiple previous studies) estimated that 2.9 percent of researchers had admitted to plagiarism or other types of misconduct – falsification or fabrication of data. Another study estimates that 1.7 percent of researchers have plagiarised while 30 percent have witnessed others plagiarising. This discrepancy might be due to individuals’ reluctance to admit their own misconduct and the greater likelihood of observing others’ behaviour.
In our recent International Research Integrity Survey (IRIS), part of a larger research integrity project, we asked over 35,000 respondents, amongst other things, about how often they had engaged in questionable research practices (The supporting data can be found on here). One of the specific questions we asked was:
“Thinking about research carried out for your publications over the last three years, how often has the following occurred?
“Deliberately using another researcher’s unpublished idea without giving credit. For example, publishing an idea voiced by a colleague at an informal meeting without giving them credit.”
The response options were often, sometimes, rarely, never. We compared those who answered ‘never’ with all other responses. This is because even someone who reports rarely doing this is still saying that they have done it at least once in the past three years, which, considering academics are unlikely to publish large numbers of papers and books over this period, is not trivial. The question does not ask about copying published work, but captures the essence of plagiarism in that it concerns passing off an idea as one’s own that was actually originated by somebody else.
We found that around 7 percent of researchers across the whole sample owned up to stealing someone else’s idea in this way. This is higher than previous estimates. Perhaps it is because stealing an idea rather than text is much harder to prove and therefore less likely to be detected and punished. This average also conceals some geographical differences. The heatmap below illustrates the varying percentages of researchers in different countries who admit to stealing ideas.
While many countries are in the 4-8 percent range, a few have even higher rates – typically, but not limited to, states that have joined the European Union more recently. Beyond the country where they are employed, it is also interesting to look at what kind of researchers are more likely to report stealing an idea.
In the plot below, we show how much more probable it is for a researcher to report this behaviour based on different attributes. The dots show the average increase or decrease in probability of reporting stealing an idea for someone with that attribute. The bars represent the margin of error (95 percent confidence interval). We model all these attributes together to predict who is most likely to report stealing an idea.
The biggest contrast is between industry and academia, where for those in industry the probability increases by about 3 percent. Non-profit and government researchers are also a little more likely to admit to stealing an idea, as are those working in humanities and social sciences. Of the characteristics associated with a lower probability, being female and expressing stronger adherence to classical ‘Mertonian norms’ of science are the most pertinent, along with working in a more collegial and supportive working environment.
Does Claudine Gay fit this profile? A ‘risk factor’ is that she works in social science (political science). However, she also works in academia – generally the least probable sector where plagiarism occurs. Women also have a lower probability of admitting to taking ideas from others. Unusually, Gay has worked almost exclusively in high status universities (Stanford and Harvard), where the pressure to publish is strong. It is possible that this environment is less collegial and incentives to cut corners to publish may be greater. For whatever reason, she does not fit the profile well.
That said, our models are not very precise in their prediction. Rather, our research indicates that the tendency to engage in questionable research practices is more an individual habit, with minimal correlation to the country, organisation, or observable attributes of the researchers. One thing we do know is that it is relatively uncommon, which makes it a serious accusation that shouldn’t be shrugged off.
Ultimately, plagiarism undermines research integrity. And despite varied views on the severity of Gay’s acts, citing sources is fundamental in scholarship. Her downfall shows that this ethical obligation extends even to those at the top.
The content generated on this blog is for information purposes only. This Article gives the views and opinions of the authors and does not reflect the views and opinions of the Impact of Social Science blog (the blog), nor of the London School of Economics and Political Science. Please review our comments policy if you have any concerns on posting a comment below.
Image Credit: fizkes on Shutterstock.
One wonders : if Gay who reached the highest level being president of Harvard cheats , what is the case of much less known scientists ? Gay was under scrutiny (as a result of her position) how could she plagiarize and not be afraid to be caught . This means the situation is much much serious than it is recognized to be and that a big chunk of what is published is , to say the least, suspicious. With the advent of IA, things will be more dishearteneing and out of control.
That is exactly the point of the post. There are empirical estimates for the extent of plagiarism.
My favorite quote from George Orwell … Winston Smith is contemplating the nature of his job. He reflects that it wasn’t even lying, it was just substituting one piece of nonsense for another.
The vast majority of published academic work falls into this category. It is boring, turgid crap that nobody is interested in. At the end of the day, who cares if its plagarised or not??
An interesting article thanks for posting! Always difficult to tell how honest people will be on self-report measures about topics like this, I wonder if there is any other (ethical) way of researching this.
I think much of plagiarism is pressure-most researchers do believe they have a unique voice that they feel should be listened to. However, many folks are vulnerable due to a variety of pressures-young researchers looking to become established and those perceived to have ‘trajectory’, the favoured ones, often have the greatest pressure and say yes to everything, with the result they need to write 40000 words a day for 7 days a week for a year to make the deadlines. So it is not surprising really. Since our only metric for recognition is publication, nothing will change, indeed, AI will make it things so muddy that the whole system might even disintegrate. I tend to write in a very individualistic way, but I bet there is an AI programme that has my prejudices/hobby-horses/obsessions already out there….
Is plagiarism always an objective issue?
Clearly, copy-and-paste the full text of others hardly will not be considered plagiarism by anyone, but what about the ideas that emerge in my mind from a coffee talk? in a conference? from a piece of news? from an academic paper?
May be for a researcher going deeper in something a colleague marginally commented in a coffee talk is not plagiarism (so, he will not report it as plagiarism, even he will say the full work is his) while another one sincerely believes that this idea that he publicly shared belongs to him and should be credited. May be, that idea was the result of deep thoughts and preliminary tests but he did not present it like that.. Those different perceptions of whether the ideas inspired in such environments (or the developments that follow from those ideas) are clearly culturally-based. Even more, for example more individualistic/competitive environments may lead to credit each minimum point to his/her author… while others can see that if someone is leaving breadcrumbs behind him, others have the right to use them.
Besides, does every sentence (data, conclusion, …) that another researcher has said must be attributed? We do not expect (and texts will be un-readable) if everytime someone writes that Earth is round must cite Pitagoras or that days last 24 hours… So, not crediting ideas/data results/.. from others can be considered “normal” and then non-plagiarism at the extent one does not atribute to oneself obviously. Then, I honestly think I can say in my research that “I depart from the fact that the earth is round” without crediting the author. How many other statements (“increasing taxes reduce the money on private hands”, “increasing taxes reduces entrepreneurship” / “increasing taxes reduces the potential economic growth” Someones will see the three statements as obvious that do not need to be credited, while other ones will see a deep analysis behind that must be credited) ? Are the factors that classify them as “must be attributed” or “it is unnecesary to attribute them” objective and universally shared? Personally, I have doubts.
Thanks for the article!
These are really great points. Thank you for pulling apart assumptions about the scope of that question and how to interpret it. Food for a lot of thought!