A Tale from the Replication Crypt

I got an email this morning from a colleague asking for the replication files for a paper I published in 2005 (PDF). Sheepishly, I had to admit that I didn’t have them.

Data-sharing and replication weren’t the professional norm in political science 10 years ago. Best I can recall, it never even occurred to me to put the files where future me could easily find them. I did the research, submitted the paper, and moved on to the next project. During peer review, no one asked to see the data and .do files I used, and the email I got today was, I think, the first time anyone had asked for them.

I’ve probably changed PCs three or four times in the intervening decade and haven’t kept all of the retired machines. I spent some time this afternoon looking on a DVD with files from one of those out-to-pasture PCs, but to no avail. Now, I’m staring at a frozen blue Microsoft ScanDisk screen on a laptop running Windows 98 and realizing that this path is probably a dead end, too. Those were all my options.

There’s a simple lesson here: if you’re going to do something you want to construe as science, you need to store your data—quantitative, qualitative, audio, imagery, whatever—where you can easily find and share it in perpetuity.

That’s a helluva lot easier now than it was 10 years ago, thanks to things like GitHub, Google Drive, Dataverse, and various other backup and cloud-storage services. It still doesn’t happen by itself, though. You still have to choose to do it. Today, I’m relearning why that’s important—for science, of course, but also for my professional reputation.

Leave a comment


  1. Grant

     /  January 17, 2014

    I’m not going to say that people shouldn’t use cloud storage, indeed it’s shown itself to be useful, but people should remember that the storage is only as reliable and lasting as the company.

  2. abedgell

     /  January 18, 2014

    The newest issue of Political Science and Policy has an excellent symposium discussing the issue of replication and data sharing. You can access it here [Gated]: https://journals.cambridge.org/action/displayJournal?jid=PSC

  1. Best of replication & data sharing: Collection 6 (April 2014) | Political Science Replication

Leave a Comment

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

  • Author

  • Follow me on Twitter

  • Follow Dart-Throwing Chimp on WordPress.com
  • Enter your email address to follow this blog and receive notifications of new posts by email.

    Join 13,612 other followers

  • Archives

%d bloggers like this: