Many recent posts in the data science media have emphasised the importance of the modern data scientist, who not only knows about statistics, machine learning and programming with R and Python, but also knows about cloud technology, parallelising code and software development for designing better