When researchers reach the point of actually writing up their analyses, the writing can often centre around the data itself. Howard Aldrich argues this kind of “data first” strategy to writing goes against the spirit of disciplined inquiry and also severely limits creativity and imagination. Literature reviews and conceptual planning phases in particular would benefit if researchers explored the range of ideas associated with their study, rather than the constraining reality of data limitations.
At a conference, when you ask somebody to tell you about their current project, what do they typically say? I often get a puzzling response: instead of beginning by telling me about an idea, the person starts by describing their data. They tell me they are using survey data they have collected, or data from an archive, or data they’ve scraped from the web. As they go on at length about the nature of the data, I have to interrupt them and ask for what purpose the data will be used. Then, I’m likely to get a description of an analytic method or computer software. It’s almost as if they have devoted most of their working hours to thinking about what they can do with the data they have collected –– or will collect –– and very little time to the question of where their project fits into some larger scheme.
First and most obviously, beginning with data considerations may lead to the unintended outcome of writing a theoretical framework and conceptual model, complete with hypotheses, that are totally framed around what the data permits. In the worst-case scenario, this can resemble the kinds of narratives corporate historians write when they begin with what they know about their firms in the present and then build a story to suit. Researchers may anticipate journal reviewers’ biases toward “significant” results and may simply wait to begin writing their story until they’ve conducted preliminary analyses. I’ve realized that this response partially explains why many graduate students have such a difficult time in writing a thesis proposal. Two kinds of problems result from a “data first” strategy.
In the writing workshops that I offer at conferences, I often have students tell me that they wait to write the introduction to their paper or thesis until after they’ve done the “analysis and results” section. This is certainly a safe strategy to follow if one wants to economize on doing multiple drafts of a paper, but it goes against the spirit of disciplined inquiry that we try to engender in our theory and methods classes.
Image credit: Whiteboard Startup Stock Photos pixabay public domain
Second and far more damaging from my point of view, following a data first strategy severely constrains creativity and imagination. Writing a theoretical introduction and conceptual model that is implicitly tailored to a specific research design or data set preemptively grounds any flights of fancy that might have tempted an unconstrained author. By contrast, beginning with a completely open mind in the free writing phase of preparing a proposal or paper allows an author to pursue promising ideas, regardless of whether they are “testable” with what is currently known about available data.
When I say “write as if you don’t have the data,” I’m referring to the literature review and planning phase of a project, preferably before it has been locked into a specific research design. Writing about ideas without worrying about whether they can be operationalized –– whether in field work, surveys, or simulations –– frees authors of the burden they will eventually face in writing their “methods” section. Eventually, a researcher will have to explain what compromises have been made, given the gap between the ideas they set out to explore and the reality of data limitations, but that bridge will be crossed later. Rushing over that bridge during the idea generation stage almost guarantees that the journey will be a lifeless one.
Even if someone is locked into a mentor’s or principal investigator’s research design and data set, I would recommend they still begin their literature review and conceptual modeling as if they had the luxury of a blank slate. In their initial musings and doodles, as they write interpretive summaries of what they read, they might picture a stone wall that temporarily buffers them from the data obligations that come with their positions as data supplicants. Writing without data constraints will, I believe, free their imaginations to range widely over the realm of possibilities, before they are brought to earth by practical necessities.
So, the next time someone asks you about what you are working on, don’t begin by talking about the data. Instead, tell them about the ideas that emerged as you wrote about the theories and models that you would like to explore, rather than about the compromises you will eventually be forced to make. The conversation will be a lot more interesting for both of you!
This piece originally appeared on the author’s personal blog and is reposted with permission.
Note: This article gives the views of the author(s), and not the position of the LSE Impact blog, nor of the London School of Economics.
Howard E. Aldrich is Kenan Professor of Sociology, Adjunct Professor of Business at the University of North Carolina, Chapel Hill, Faculty Research Associate at the Department of Strategy & Entrepreneurship, Fuqua School of Business, Duke University, and Fellow, Sidney Sussex College, Cambridge University. His main research interests are entrepreneurship, entrepreneurial team formation, gender and entrepreneurship, and evolutionary theory.
This is very interesting. We’ve once tried writing a Nature Neuroscience paper this way once, where we had a specific hypothesis that we tried evaluating in the data. In a nutshell: had to do with a certain variable correlating with the differences between 2 variables: A and B, but with no clear prediction about whether there would be a significant difference between A and B themselves (the paper was about the predicted correlation). Here’s a reviewer’s response that I kept in my ‘don’t forget’ file..
“It [The manuscript] espouses the unfortunate narrative stye of “we did this, then this didn’t work, then we tried that, or this led to the next thing, so then we thought we should do z”, etc. Just tell the shortest, most clear story about the actual scientific results taken in total, rather than about your process in reaching there!”
I told my post-doc to chill out, and have taken the point that high impact journals in our field will only accept elegant stories about data. God forbid your hypothesis ended in a null result. And don’t get me started on any descriptive data being described as double dipping. We’ve become a group of slogan throwers, thinking in schematic and rigid ways established by a few powerful trend setters.