The Green Open Access route, which encourages the use of institutional repositories for depositing manuscripts as drafts or after a publisher’s embargo period, seems to many in the Humanities and Social Sciences like a more agreeable way to widen access to scholarly work. Mike Taylor is not convinced that the Green route is able to adequately address the key concerns of the access problem. He outlines five limitations that Green access will have to address if institutional repositories are to become truly useful.
The progressive RCUK policy on open access has recently come under fire, particularly from humanities scholars, for favouring Gold Open Access (OA) over Green. For various reasons — and I won’t, for now, go into the question of which of these reasons are and aren’t sound — they favour an approach to open access where publishers keep final versions of their papers behind paywalls, but drafts are deposited in institutional repositories (IRs) and people who want to read the paper can have access to the drafts.
It’s appealing to think that this relatively lightweight way of solving the access problem can work. Unfortunately, I’m not convinced it can, for several reasons. I’ll discuss these below, not so much with the intention of persuading people that Gold is a better approach, but with the hope that those of you who are Green advocates have seen things that I’ve missed and you’ll be able to explain why it can work after all.
1. Two-class system
Most fundamentally, I worry that Green OA creates a two-class system which I can’t approve of. It does this in two ways:
First, it necessarily creates two classes of papers: author’s draft and publishers’ final versions. These will differ in some respects, and it’s hard in general to know what those respects are. Of course pagination will differ — which means you can’t cite page-numbers reliably. But other changes are possible as well. For example, Matt and I have a paper in press now for which a whole additional figure — an important one — was added at the proofing stage. In our case, the paper will be OA anyway, but if it were not then the authors’ manuscript would be a poor substitute.
And by implication, Green OA creates two classes of researchers — a potentially harmful division between those privileged few who have the “proper” papers and an underclass who have only manuscripts. (It doesn’t help that for stupid historical reasons, our manuscripts are often butt-ugly: double-spaced, line-numbered, all the figures at the end instead of where they’re needed, etc.) Admittedly, the two-classes-of-researcher problem is not created by Green OA: it already exists, in a worse form where the underclass doesn’t have access to any version of the paper. But whereas Gold OA solves this problem (everyone has exactly the same access to PLOS and PeerJ papers), Green doesn’t.
(To me, it’s obvious that democratising access is a good thing. But now that I’ve made the notion explicit, I can’t help the uncharitable thought that there may be those out there who want to maintain a two-class system — to retain a notion that they are “in” while others are “out”. I hope I’m wrong, and I’m certainly not accusing Green OA advocates of having this motivation. It just seems like it might be implicit in some of the broader struggles over access. Anyway, let’s not confuse this separate potential problem with access with the actual problems with Green OA I’m addressing in this post.)
2. Expense of continuing subscriptions
I find it baffling that people keep talking as though Green OA is cheaper than Gold. It isn’t, at all. As I’ve shown previously, the cost to the world of a paywalled paper (aggregated across all subscriptions) is about $5333. There is no reason to think that will change under the Green model, in which we continue to give the final and best version of our work to publishers. By contrast, even the publisher-influenced Finch estimates typical Gold APCs as £1500-£2000 (about $2270-$3030), which amounts to 43%-57% as much. (Conveniently, the midpoint of the Finch range, £1750, is about $2650, which is almost exactly half of what we pay by the subscription model.
But the true cost of Gold OA is much, much less. Follow the link for the detail, but one credible banner figure starts with the observation that half of all Gold OA articles are published at no cost to the author and that the average APC of the other half is $906, to arrive at a true average APC of $453 — about one twelfth of the cost for a paywalled article. So for purely pragmatic financial reasons, Green seems like a silly path. There’s a very short-term saving, sure, as we avoid paying APCs. But we have to look further ahead than the next five years.
3. Embargoes
Now there is nothing intrinsic to Green OA that means embargoes must be in place. It’s perfectly possible, and manifestly desirable, that no-embargo Green-OA mandate should be enacted, requiring that authors’ final manuscripts become available immediately on publication. But for whatever historical reasons (and I admit I find this baffling) there are few or no Green-OA mandates that do this. Even the best of them seem to allow a six-month delay; twelve months is not uncommon (and Michael Eisen worries that the new White House policy with further establish twelve months as the norm.
I will have more to say about embargoes in a subsequent post. (SPOILER: it’s not going to be pretty.) But for now it suffices to say that any system that makes research freely available only a year after it’s published is wholly inadequate. Not to mention stupid. Stupid and inadequate. So if Green OA is going to be the solution we need, it has to break free from embargoes.
4. Non-open licences
Similarly, there is no intrinsic reason why Green OA should mean non-open licences and Gold OA should mean truly open (BOAI-compliant) open access. And yet history has brought us to a point where is often how things are. For example, the RCUK policy (even before its progressive erosion got properly under way) says of its Gold arm that “The CC-BY license should be used in this case”, but contains weasel words in its Green arm:
“the journal must allow deposit of Accepted Manuscripts that include all changes resulting from peer review (but not necessarily incorporating the publisher’s formatting) in other repositories, without restrictions on non-commercial re-use.”
This just won’t do. It’s not open access. To quote Heather Piwowar’s pithy statement once more, “We do basic research not only to know more, but to do more”. Non-commercial licences impede the use of research, and that’s not to the benefit of wider society. (I won’t labour this point now, because I’ll have more to say on non-commercial clauses in a subsequent post.) So as with embargoes, if Green OA is going to be the solution we need, it has to break free to its habitual acceptance of non-commercial clauses.
5. Practical failings
On top of the fundamental problems already discussed (two-class system, expense of continuing subscriptions, embargoes and non-open licences), the repository system as it exists today suffers from a suite of practical problems that render it pretty inadequate.
- Many institutions don’t even have an IR; or if they do, it doesn’t work.
- Many scholars aren’t associated with an institution and so don’t know where they should reposit their manuscripts. (That this is overlooked is a symptom of an unfortunate elitist tendency among academics.) [UPDATE 4th March: thanks to Neil Stewart, whose comment below points out Open Depot as a solution to this.]
- The use of IRs involves an institution-by-institution fragmentation, with different user interfaces, policies, etc.
- For whatever reasons, many scholars do not bother to reposit their manuscripts in institution repositories.
- Even when mandates are in place, compliance is often miserable, to the point where Peter Suber considers the 80% NIH compliance rate as “respectable”. It really isn’t. 100% is acceptable; 99% is respectable.
- Many IRs have abject search facilities, often for example lacking the ability to restrict searches to papers that are actually available.
- Many IRs impose unnecessary restrictions on the use of the materials they contain: for example, Bath’s repo prohibits further redistribution.
- There is no central point for searching all IRs (at least not one that is half-decent; I know about OAIster).
- The quality of metadata within most IRs variable at best
- Use of metadata across IRs is inconsistent — hence many of the problems that render OAIster near-useless.
… and, I am sure, many more that I’ve not thought of right now.
Could these issues be addressed? Yes, probably; but ten years have unfortunately not done much to resolve them, so I don’t feel all that confident that the next ten will. Do the IR advocates have a plan for solving these problems? Because they are much more political/sociological than technical, and those always seem to be the hardest ones to solve.
This article was originally published on Mike Taylor’s blog, and is published here with permission.
Note: This article gives the views of the author(s), and not the position of the Impact of Social Sciences blog, nor of the London School of Economics.
About the author:
Mike Taylor is a dinosaur palaeontologist, computer programmer and open access advocate, affiliated with the University of Bristol. He has named two new dinosaurs, Xenoposeidon (“alien earthquake god”) and Brontomerus (“thunder thighs”) and written other papers so boring that his wife fell asleep while he was explaining one of them to her.
MANDATE DEPOSIT IN INSTITUTIONAL REPOSITORIES AND THEY WILL SOLVE THE ACCESS PROBLEM
MT: “[Green OA] necessarily creates two classes of papers: author’s draft and publishers’ final versions.”Actually what Green OA does is provide access to the author’s draft for those who don’t have access to the publisher’s final version. The difference between night and day for those who have no access at all.
No “class” differences. Just a remedy for the difference between the Haves and the Have-Nots.
Negligible loss. Cite by section heading and paragraph number. (Page numbers are obsolescent anyway.)
Add the figure to your author’s draft.
(As Green grows, authors will learn to be more attentive about the needs of the Have-Nots among their potential users in a Green world: Scholarly practice will adapt to the medium, as it always does.)
Gold solves the problem at a hefty price, not just in terms of having to double-pay Gold fees out of scarce research funds, over and above existing subscription fees (which already pay for Green), but also in terms of restrictions on authors’ free choice of journal: a forced choice based on journal economic model instead of journal quality and track-record.
The RCUK Gold-preference mandate also encourages publishers to offer hybrid Gold OA and to extend embargoes on Green OA beyond RCUK limits (to force RCUK authors to pick and pay for Gold), thereby making it gratuitously harder for Have-Not nations (who cannot afford Gold) to mandate and provide Green. Pre-emptive Gold also locks-in journals’ current revenues and modus operandi — and does so even if journals offer a full subscription rebate on all Gold OA revenue.
Mandatory Gold also engenders author resentment and resistance.
In contrast, mandatory Green (if effectively mandated, with compliance verification and as an eligibility condition for research evaluation, as HEFCE has proposed for REF) provides OA for the Have-Nots (about 60% immediate-OA and about 40% Button-mediated Almost-OA for embargoed deposits) at no extra cost, with no constraint on authors’ free choice of journals.
Again, no “class” differences. Just a remedy for the difference between the Haves and the Have-Nots.
There is indeed reason. It’s through the (hybrid) Gold model, which RCUK is perversely reinforcing, that current overall publisher revenues will be locked in (along with double-dipping too, for sure). Even if all Gold payment is given back as a subscription rebate, the total amount paid to publishers remains unchanged.
In contrast, universal Green will force cost-cutting and downsizing by making subscriptions unsustainable:
What those rosy estimates (based on a fantasy of universal conversion of publishers to pure Gold, under pressure from the RCUK mandate!) overlook is the double-payment that must continue while UK subscriptions remain the only way for UK institutional users to access subscription content.
Yes indeed, but that cost-free Gold half is unfortunately not the mainstream international journals that are really at issue in all this, for UK authors and users. And it’s that half that is spuriously lowering the average price of APCs well below what the UK Must-Have journals are charging, especially for hybrid Gold.
Mike, I think you too will eventually come to realize that the only way to attain what we both want — which, is not just embargoed Gratis Green, but an end of embargoes, as much Libre OA as users need and researchers want to provide, license reform, publishing reform, and Gold OA at a fair, affordable, sustainable price — is by first taking the compromise step of universally mandating immediate deposit of the author’s draft in the author’s institutional repository, and then letting Nature take its course.
The only thing standing between us and what we all want is keystrokes. Until we mandate those, there will be little OA of any sort: Gratis or Libre, Green or Gold, immediate or embargoed.
Let me unbaffle you then, Mike:
It’s pushback from publishers, who then intimidate authors as well as institutional lawyers — while also lobbying and intimidating politicians. The result is that no one dares mandate un-embargoed Gratis Green (let alone unembargoed Libre Green), and most authors wouldn’t dare provide it even if it were mandated.
Solution: mandate immediate deposit (no exceptions) and allow (minimal) embargoes on the allowable length of the embargo on access to the deposit. (The “ID/OA mandate.”)
That will ensure that the Have-Nots at least gain 60% immediate OA + 40% Almost-OA (Button-mediated).
And then let Nature take its course. Once the keystrokes are being universally done, what you seek will not be far behind.
But it will take much longer if you delay (embargo!) the universal adoption of the ID/OA compromise mandate by over-reaching instead for what is not within reach, rather than first grasping what is already fully within reach.
That is called letting the “best” get in the way of the better. And in doing so, you are playing into the hands of the publisher lobby, which is also using embargoes (of their own making) along with license restrictions as an excuse for delaying the inevitable transition to OA as long as possible, and making sure it only happens on their terms, preserving their current revenue streams and modus operandi.
Once again: Grasp first what is within immediate reach and the rest will come. Join Finch instead, in deprecating Green, and you will get next to nothing.
Most research-active institutions in the UK (and Europe, and the US and Canada and Australia) already have an IR, but it doesn’t “work” without an (effective) Green OA mandate from funders and institutions.
Any institution is a just piece of free software, some space on a server and some sysad start-up time away from having an IR.
Few researchers are unaffiliated, but for them there is, for example, OpenDepot — which is still just as empty as IRs — for want of mandates…
Most IRs are highly interoperable. Mandate Green OA and they will be even more so.
(And distributed local deposit with central harvesting is not “fragmentation”: it’s the way of the Web! No one deposits directly in Google. The rest is down to metadata, interoperablity, and harvesting. But there’s no incentive to enrich those while the OA content itself is still so impoverished — for lack of mandates.)
You have just casually mentioned OA’s #1 problem for the past 20 years!
But what you forget to say is that even fewer scholars bother to publish in a Gold OA journal.
(With Green deposit {ID/OA], the only deterrent is keystrokes; but with Gold OA there’s price and journal-choice restrictions as further deterrents.)
The remedy is of course mandates. But mandates have to be adopted, and complied with. And that’s why they have to have all the parameters you are lamenting: Gratis, Green, author draft, embargoed. That’s the immediately reachable path of least resistance for mandate adoption and compliance.
But you have stop thinking of what you’d ideally like to have right away, and think practically about how to get it, not spurning approximations and compromises only to end up with next to nothing.
There it is again: Reality for the last 20 years has been at 10-40% OA, and you are dismissing as “miserable” a tried and proven means of generating at least 80%!
I don’t wish my 20 miserable years trying to reach OA on anyone, but maybe a dose would not do you any harm, Mike, to help you appreciate the difference between principled armchair wish-lists and practical delivery.
We don’t have another decade to waste on ineffectual over-reaching. (And that’s what’s been holding OA up for the past two decades too.)
(Your questions here are almost all a litany or repetition of the 38+ causes of Zeno’s Paralysis in this 15-year-old list. I could almost answer them by number!)
No one (except maybe institutional administrators and window-shopping prospective-students or staff) searches at the IR level!
IR metadata (and/or full-texts) are harvested (or imported/exported) at the central harvester/search-engine level (Scirus, BASE, MS Academic Search, Google Scholar) and that’s the level at which they are searched.
The central harvester-level search capabilities can be enriched greatly, and they will be, but there’s absolutely no point doing that now, with the sparse OA content that there is in IRs (or in any repository) today. Without mandates to provide that content, nuclear-powered search (and text-mining) capabilities would be spinning wheels.
(And in case you imagine that the solution is direct deposit in institution-external repositories: far from it. That just makes the problem of mandating OA worse, forcing authors to deposit willy-nilly in institution-external repositories — Arxiv, PMC, EuroPMC, etc. — and prevents institutions from being able to monitor compliance with deposit mandates, whether institutional or funder mandates.)
Most IRs are sensible (though they all make craven — and sometimes excess — efforts to comply with publisher copyright conditions and embargoes).
Once Green OA mandates become sufficiently widespread, IRs will get their acts together. For now, the essential thing is to get papers deposited. Once that is being done, globally, everything else we seek will come, and probably surprisingly quickly.
But not if we continue to carp at minor details like Bath’s overzealousness, as if they were symptoms of ineffectiveness of the Green mandate strategy. They are not. They are simply symptoms of ineffective institutional policy, easily fixed under pressure from other IRs that are doing it right.
As above: IR metadata (and/or full-texts) are harvested (or imported/exported) at the central harvester/search-engine level (Scirus, BASE, MS Academic Search, Google Scholar) and that’s the level at which they are searched.
The central harvester-level search capabilities can be enriched greatly, and they will be, but there’s absolutely no point doing that now, with the sparse OA content that there is in IRs (or in any repository) today.
Without effective mandates to fill the IRs, central search is not much more “decent” than IR-level search: the OA content is simply far too sparse.
Without mandates to provide the content (and motivate the metadata enrichment) rich metadata on impoverished content are no help.
Scirus, BASE, MS Academic Search, Google Scholar and OAIster are all equally useless without the full-text content. The motivation to enrich and conform the IR metadata will grow with the content, not just as an end in itself.
There is in reality only one issue: Getting the keystrokes to be mandated (and hence done). That’s what’s held us up for 20 years, while we ran off in every direction except the one that would get us to our goal.
It is time to pool efforts toward getting institutions and funders worldwide to adopt the Green OA mandates that will get us there. For that we have to stop focussing on fixing frills that are useless until and unless we first get the content deposited, and stop insisting on organic haute cuisine before we have even taken care of the famine of the Have-Nots.
Just to follow up on your point about restrictions on reuse of materials in IRs. The ‘prohibition’ on reproduction in our end user licence is not an extra prohibition we impose. It’s a reminder that for material in copyright, the copyright owner has the exclusive right to reproduce and distribute the work. So you cannot reproduce or further distribute it unless you have either the copyright holders express permission (e.g. given via a CC licence) or are covered by one of the statutory exceptions (e.g. review and criticism, non-commercial research or private study).
Thanks for your posting, Steven Harnad, on Mike Taylor’s blog. It is dispiriting that there are academics who cannot see the broader strategic issue, which is the established publishers with very big margins exploiting free academic labour and charging them for the privilege; this not only limits access but reduces funds available to research and other legitimate academic activities (like maintaining professional associations). Instead of finding fault with IR and Green Open Access, let’s find solutions and work to the broader (and, yes, longer-term) strategic goal. There are also other problems with OA – like predatory publishers, and publishing charges – but they should not stop us proceeding down that track over the long haul.