Epidemiology

Skip Navigation LinksHome > Blogs > The Editors of Epidemiology > A TIDBIT by Jay Kaufman
The Editors of Epidemiology
The goal of this blog is to help EPIDEMIOLOGY authors produce papers that clearly and effectively communicate their science.
Wednesday, December 8, 2010
A TIDBIT by Jay Kaufman
As I mentioned in the editorial announcing this blog, I am open to ideas, great and small to blog about. This is one volunteered by Jay Kaufman:
 
“DataThief”
 
“DataThief III is a program to extract (reverse engineer) data points from a graph.  Typically, you scan a graph from a publication, load it into DataThief, and save the resulting coordinates, so you can use them in calculations or graphs that include your own data.”  [from the link]
 
What a nifty idea!  For many teaching examples, we’d like to analyze data presented in graphs and figures. It can be a painstaking process to read the data points off of the image, especially if the type is small or the resolution poor. We asked a colleague to do a validation run of test cases (where the original data were available). He reports that, while not completely user-friendly (getting the marker exactly on the data point isn't easy), the answers on a test case with known values were within rounding error. With some effort this could really come in handy.
 
 
Jan P Vandenbroucke
 
If you like to comment, Email me directly at epidemiologyblog@gmail.com or submt your comment via the journal which requires a password protected login. Unfortunately, comments are limited to 1000 characters.
12/8/2010
blog reader said:
Arnout Standaert writes: The idea of using software to digitize published data is pretty old, and several software packages exist. It’s indeed a great idea to improve the accuracy and availability of background information in research. One thing that slightly bothers me is the mentioned software, Data Thief. Research should be an open process, relying on transparency, open standards, and open tools. That includes open source software. While Data Thief is without a doubt a useful piece of software, it is shareware and usage is prohibited without payment. However, there are perfectly fine open source alternatives. One that I have used myself and appreciate is Engauge Digitizer, available at http://digitizer.sourceforge.net/. It would benefit an open science process to promote not only commercial tools, but also the open source alternatives. This not only allows everyone to enjoy the benefits without cost, it also allows anyone to contribute to the tools and improve them as desired.
About the Author

Editors
A team of EPIDEMIOLOGY editors, led by Deputy Editor Emily DeVoto (emily.devoto@epidemiology-journal.com), contribute to this blog.

Blogs Archive
Share