Just enough Azure for Hadoop – Part 4

This blog is part 4 of a series that covers relevant Azure fundamentals – concepts/terminology you need to know, in the context of Hadoop.  While the first three touched on Azure infrastructure aspects, this one covers Azure PaaS Data Services.  There are a number of them – I have touched on the ones relevant.  In…


Just enough Azure for Hadoop – Part 2

This blog is part 2 of a series that covers relevant Azure fundamentals – concepts/terminology you need to know, in the context of Hadoop.  Some of the content is a copy of Azure documentation (full credit to the Azure documentation team).  I have compiled relevant information into a single post, along with my commentary, to…


Just enough Azure for Hadoop – Part 3

This blog is part 3 of a series that covers relevant Azure fundamentals – concepts/terminology that you need to know, in the context of Hadoop.  Some of the content is a copy of Azure documentation (full credit to the Azure documentation team).  I have attempted to compile relevant information into a single post, along with…


Just enough Azure for Hadoop – Part 1

Motivation for this blog… On my last day at my former workplace, where I mostly worked on customer Hadoop projects on AWS, a colleague got a project that involved provisioning an IaaS Hadoop cluster on Azure; We were stumped and scrambling to figure out – there was no guide with just enough information about Azure…


Provisioning a Cloudera Hadoop cluster on Azure

This post covers how to provision a Cloudera-certified Hadoop IaaS cluster on Azure, for Production, from the Azure Preview Portal using an Azure Resource Manager template available in the marketplace that was developed by Cloudera.   At the time of writing the blog, the CDH version was 5.6.0. Details covered are: 1.  Cluster options 2.  Instructions on provisioning…