Still Relevant: “Analyzing the Analyzers”

I just happened across Analyzing the Analyzers and while it is several years old, I found it to be still relevant. Congratulations to Harlan D. Harris, Sean Patrick Murphy and Marck Vaisman for writing something that stood the test of time. Anyone doing a survey of data scientists should look at this book.

There are several fascinating charts based on a survey of self-identified data scientists. However, I was fascinated to see that most data scientists weren’t working with big data (see the chart below). I assume that in 2015 a much larger percentage of data scientists are working on a terabyte or petabyte scale.

fromAnalyzingtheAnalysts-OReilly
Respondents Working With Different Scales of Data, by Primary Skills Group