• Open Data Handbook Revised

    Originally published in 2012 the “Open Data Handbook” has been revamped to inspire Open Data Newcomers. The new version of the online site allows access to: The Open Data Book Value Stories Resource Library For those who have not yet been introduced to the Open Data Handbook, the below is a good overview of what…

  • DevOps Survey 2015 Underway

    Puppet Labs are running their DevOps Survey again this year. The survey takes around 15 minutes to complete. It will be interesting to see how DevOps has changed over the past 12 months from the 2014 report which found that: High-performing organizations are still deploying code 30 times more frequently, with 50 percent fewer failures…

  • Open Source Web Crawlers and Data Sets

    A great list of 50 Open Source Web Crawlers has been produced by Baiju NT on a Big Data Blog Web Crawlers are useful in gathering data from other sites when performing research, although caution should be used as with today’s levels of protection some sites defenses may consider your data gathering as an attack.…

  • Raspberry PI Development Projects – Are these the next big thing?

    Element 14 has been running a competition recently called “Sci Fi Your PI” which looked to make science out of science fiction using Raspberry PI’s. 25 challengers have now been selected to continue forward. The ideas that have been selected are both creative and interesting. Below is the list of the 25 chosen with their…

  • Microsoft Ignite Conference

    Hot on the Heels of the Microsoft Build Conference, the Microsoft Ignite Conference is under way. Once again the conference is being streamed with sessions online for those who cant get to the conference. You can follow the conference at http://ignite.microsoft.com/ Replays available here http://ignite.microsoft.com/Sessions

  • Microsoft Build Conference

    The Microsoft Build conference is in fully swing with Day 2 coming up. Although I am not there in person, its good to see that the sessions are being streamed and recorded. You can follow the conference at http://www.buildwindows.com/ Replays available here There is already a lot of news stories coming out on the latest Microsoft…

  • Data Mining Courses

    Via Coursera the University of Illinois at Urbana-Champaign is running a specialisation on Data Mining.  As with all Coursera courses, you don’t have to take the specialisation, but can take the courses individually or one after each other. Taking the courses outside of the specialisation means that you wont get to complete the capstone project and…

  • Big Data – 4V’s + Verification

    IBM have released an Infographic on the “Four V’s of Big Data” which covers: Volume – Scale of Data Variety – Different forms of Data Velocity – Analysis of Streaming Data Veracity – Uncertainty of Data There should be another V for “Verification” which covers the questions you ask of the data in order to…

  • Faster Smaller Raspberry Pi Cluster

    Following on from my blog “Race to the largest Raspberry Pi Cluster“, James J. Guthrie has built a 3 node cluster out of Raspberry Pi 2’s using 3 nodes, out performing the 64 node cluster. The Iridis Pi has a peak CPU performance benchmark was around 1 GIGAFLOPS (floating point operations per second) The cluster presented…

  • R {swirls} – Learning R by doing

    A swirl is an interactive way of learning R by installing a package called {swirl} into R and then installing a course. I have used swirls in the Data Science Courses on Coursera and found them a useful way of learning and testing your knowledge. swirl is installed as a package into R using the following command…