Author: Ed Thewlis
Continuous Integration with Python
Confession time… I missed something really important! Most CI tools don’t work too well with Python. Here’s something that’ll help you overcome some of the issues you’ll come across…Ed Thewlis 3rd January 2017
Announcing DataShed Labs event – getting hands-on with Apache Spark & Hadoop
In an effort to give developers a view of the tools and technologies that we think are worth trying, we’re hosting a series of hands-on events in LeedsEd Thewlis 17th February 2016
Cloudera Director 2.0 – Truly cloud ready?
Cloudera seems to have made some huge leaps forward in terms of enterprise integration, workload automation and High Availability in Cloudera Director 2.0…Ed Thewlis 22nd January 2016
Why has the world gone Open Source Mad?
Open Source software seems to be taking over. But why? And what does Steve Ballmer think about this?Ed Thewlis 5th January 2016
How to increase the ROI of a data warehouse
Data warehouse projects are expensive!Ed Thewlis 2nd December 2015
Why bother with data visualisation?
Sometimes, the visualisation can be something as simple as a chart, showing how a measure has changed over time. Other times, it can be as complex as those created by Mike Bostock of the New York Times and the creator of d3.Ed Thewlis 25th August 2015
Do I make the pipe bigger, or buy more pipes?
Scale up vs scale out is a pretty standard conundrum nowadays - particularly with the advent of distributed compute models such as Hadoop. A few years ago, the standard approach in the Business Intelligence & data management world was simply to throw more hardware at it.Ed Thewlis 2nd February 2015
We are data scientist
I was reading an interesting article today on the skills required to be a 'Data Scientist'... All very interesting!Ed Thewlis 5th September 2014