Avkash Chauhan's Blog

Windows Azure, Windows 8, Cloud Computing, Big Data and Hadoop: All together at one place.. One problem, One solution at One time...

Windows Azure CDN and Referrer Header

The Windows Azure Azure CDN, like any other CDNs, attempts to be a transparent caching layer. The...

Author: Avkash Chauhan - MSFT Date: 02/27/2012

What is ignoreRoleInstanceStatus setting in Windows Azure?

ignoreRoleInstanceStatus is described in WebRole and WorkerRole Schema as below...

Author: Avkash Chauhan - MSFT Date: 02/23/2012

Windows Azure Blob Upload Scenarios

Windows Azure Blob storage API provided following upload scenarios to upload a blob: Scenario [1]:...

Author: Avkash Chauhan - MSFT Date: 02/21/2012

Keys to understand relationship between MapReduce and HDFS

Map Task (HDFS data localization): The unit of input for a map task is an HDFS data block of the...

Author: Avkash Chauhan - MSFT Date: 02/15/2012

Solving SSL Certificate expiration problem with an existing Windows Azure Application

Recently I was working on an issue where the SSL certificate was expired and due to it, the user...

Author: Avkash Chauhan - MSFT Date: 02/14/2012

Hadoop Performance: How storage disk types in individual node will impact the job performance?

As you may have already know that Hadoop Cluster is network and disk, IO intensive. Recently I was...

Author: Avkash Chauhan - MSFT Date: 02/14/2012

How many copies of your blob is stored in Windows Azure Blob Storage?

I was recently asked about if someone store their content at Windows Azure Storage, how secure is it...

Author: Avkash Chauhan - MSFT Date: 02/09/2012

Internals of Hadoop Pig Operators as MapReduce Job

I was recently asked to show that Pig scripts are actually MapReduce jobs so to explain it in very...

Author: Avkash Chauhan - MSFT Date: 02/08/2012

Which one to choose between Pig and Hive?

Technically they both will do the job, you are looking from "either hive or Pig" perspective, means...

Author: Avkash Chauhan - MSFT Date: 02/07/2012

Handling two known issues with Windows Azure node.js SDK 0.5.2

My recent development work with Windows Azure Node.js SDK 0.5.2 helped me to found two issues. Until...

Author: Avkash Chauhan - MSFT Date: 01/31/2012

NougakuDoCompanion: A “Ruby on Rails” companion for Windows Azure

I was recently informed about a great “Ruby on Rails” companion package to run your Ruby...

Author: Avkash Chauhan - MSFT Date: 01/31/2012

Customizing your Hadoop cluster running on your own Windows Azure Subscription

In this article we will learn how to configure the same Hadoop cluster to do some customization. To...

Author: Avkash Chauhan - MSFT Date: 01/29/2012

Creating your own Hadoop cluster on Windows Azure by using your own Windows Azure Subscription account

[As of now this functionality is not available with Hadoop on Windows Azure. These instruction are...

Author: Avkash Chauhan - MSFT Date: 01/28/2012

Windows Azure Troubleshooting - Taking specific Windows Azure Instance offline

When you have lots of instances running on Windows Azure, you may need to investigate issues on a...

Author: Avkash Chauhan - MSFT Date: 01/27/2012

Setting Amazon S3 Storage as data source (s3n://) in Hadoop on Azure (hadooponazure.com) portal

To get your Amazon S3 account setup with Apache Hadoop cluster on Windows Azure you just need you...

Author: Avkash Chauhan - MSFT Date: 01/27/2012

Real World Windows Azure Guidance: Troubleshooting Best Practices for Developing Windows Azure Applications

One of my teammate William Bellamy has written a great article “Troubleshooting Best Practices...

Author: Avkash Chauhan - MSFT Date: 01/26/2012

Node.js and Windows Azure: Creating a blog application using Node.JS and Windows Azure Table & Blob Storage Part 1

In this example I will create a node.js based blob application which will storage all the blog...

Author: Avkash Chauhan - MSFT Date: 01/25/2012

Understanding Map/Reduce job in Apache Hadoop on Windows Azure (A Reverse Approach)

When you run Map/Reduce job in Hadoop cluster on Windows Azure you will get an aggregated progress...

Author: Avkash Chauhan - MSFT Date: 01/20/2012

Windows Azure application VM and (virtual) IP Address

Time to time, I get involved with our Windows Azure partners to discussion IP address configuration...

Author: Avkash Chauhan - MSFT Date: 01/18/2012

Setting Windows Azure Blob Storage (asv) as data source directly from Portal at Hadoop on Azure

After you log into your Hadoop Portal and configured your cluster, you can select “Manage...

Author: Avkash Chauhan - MSFT Date: 01/13/2012

By default the application drive in Windows Azure VM has maximum size limitation of 1GB with any...

Author: Avkash Chauhan - MSFT Date: 01/13/2012

With Azure SDK 1.6, Azure Diagnostics is enabled by default can cause thousands of daily transections to Azure Storage

In Windows Azure VM diagnostics is enabled by default so basic log can be collected and transferred...

Author: Avkash Chauhan - MSFT Date: 01/11/2012

Running Apache Pig (Pig Latin) at Apache Hadoop on Windows Azure

Microsoft Distribution of Apache Hadoop comes with Pig Support along with an Interactive JavaScript...

Author: Avkash Chauhan - MSFT Date: 01/10/2012

Accessing Windows Azure blob Storage as Network Drive

If you want to access Windows Azure Blob Storage same as a drive on your development machine you...

Author: Avkash Chauhan - MSFT Date: 01/10/2012

Apache Hadoop on Windows Azure: Few tips and tricks to manage your Hadoop cluster in Windows Azure

In Hadoop cluster, namenode communicate with all the other nodes. Apache Hadoop on Windows Azure...

Author: Avkash Chauhan - MSFT Date: 01/06/2012

Apache Hadoop on Windows Azure: Connecting to Windows Azure Storage from Hadoop Cluster

Microsoft distribution to Apache Hadoop comes by direct connectivity to cloud storage i.e. Windows...

Author: Avkash Chauhan - MSFT Date: 01/05/2012

Apache Hadoop on Windows Azure: How Hadoop cluster was setup on Windows Azure

Once your provide following information to setup your Hadoop cluster in Azure: Cluster DNS Name Type...

Author: Avkash Chauhan - MSFT Date: 01/04/2012

Assess your SQL Server to SQL Azure migration using SQL Azure Compatibility Assessment Tool by SQL Azure Labs

SQL Azure team announced today about the release of a new experimental cloud service, "SQL Azure...

Author: Avkash Chauhan - MSFT Date: 01/04/2012

Apache Hadoop on Windows Azure Part 10 - Running a JavaScript Map/Reduce Job from Interactive JavaScript Console

Microsoft distribution of Apache Hadoop on Windows Azure, let you run JavaScript Map/Reduce jobs...

Author: Avkash Chauhan - MSFT Date: 01/03/2012

Apache Hadoop on Windows Azure Part 9 – Using Interactive JavaScript for Data Visualization

Apache Hadoop on Windows Azure is integrated with a web-based interactive JavaScript console, which...

Author: Avkash Chauhan - MSFT Date: 01/02/2012

Apache Hadoop on Windows Azure Part 8 – Hadoop Map/Reduce Administration from command line in Cluster

After you created your Hadoop cluster in Windows Azure, you can remote into it to start the...

Author: Avkash Chauhan - MSFT Date: 01/01/2012

Wish you all a very Happy New Year 2012!!

Dear Readers, Thank you so much for all your support and comments!! You are the inspiration behind...

Author: Avkash Chauhan - MSFT Date: 01/01/2012

Apache Hadoop on Windows Azure Part 7 – Writing your very own WordCount Hadoop Job in Java and deploying to Windows Azure Cluster

In this article, I will help you writing your own WordCount Hadoop Job and then deploy it to Windows...

Author: Avkash Chauhan - MSFT Date: 12/31/2011

Apache Hadoop on Windows Azure Part 6 - Running 10GB Sort Hadoop Job with TeraSort Option and understanding MapReduce Job administration

In this section we will run the same 10GB sorting Hadoop job with TERASORT option. With TeraSort...

Author: Avkash Chauhan - MSFT Date: 12/30/2011

Apache Hadoop on Windows Azure Part 5 - Running 10GB Sort Hadoop Job with Teragen, TeraSort and TeraValidate Options

This example consists of the 3 map/reduce applications that Owen O'Malley and Arun Murthy used win...

Author: Avkash Chauhan - MSFT Date: 12/30/2011

Apache Hadoop on Windows Azure Part 4- Remote Login to Hadoop node for MapReduce Job and HDFS administration

When you are running Apache Hadoop job in Windows Azure, you have ability to remote into the main...

Author: Avkash Chauhan - MSFT Date: 12/29/2011

Apache Hadoop on Windows Azure Part 3 - Creating a Word Count Hadoop Job with a few twists

In this example I am starting a new Hadoop Job with few intentional errors to understand the...

Author: Avkash Chauhan - MSFT Date: 12/29/2011

Apache Hadoop on Windows Azure Part 2 - Creating a Pi Estimator Hadoop Job

Once you have created a cluster in Windows Azure, you will have a few prebuilt samples provided in...

Author: Avkash Chauhan - MSFT Date: 12/29/2011

Apache Hadoop on Windows Azure Part 1- Creating a new Windows Azure Cluster for Hadoop Job

Once you have applied for Apache Hadoop on Windows Azure CTP account you can create a new cluster...

Author: Avkash Chauhan - MSFT Date: 12/28/2011

Top 12 Articles on Cloud Services and Big Data on Windows Azure in December

Windows Azure Cloud Services Newly Designed Windows Azure Developer Center Article Tutorial: Running...

Author: Avkash Chauhan - MSFT Date: 12/28/2011

How to Modify Registry keys in Windows Azure Virtual Machine from a web or worker role?

If you have a requirement and decided to modify VM registry keys, you have two options: Do it from a...

Author: Avkash Chauhan - MSFT Date: 12/23/2011

Windows Azure: Hands on Lab for Moving Applications to the Cloud

Windows Azure team created a detailed hands on lab to help everyone who wants to move their...

Author: Avkash Chauhan - MSFT Date: 12/19/2011

Windows Azure Resource: A Guide to Claims-Based Identity and Access Control, Second Edition - eBook Download

Map of the book: What is this book about: An Introduction to Claims explains what a claim is and...

Author: Avkash Chauhan - MSFT Date: 12/19/2011

Resources to write .Net based MapReduce jobs for Hadoop using F#

What is Hadoop Streaming: Start learning about Hadoop Streaming:...

Author: Avkash Chauhan - MSFT Date: 12/18/2011

Windows Azure Plugin for Eclipse with Java, December 2011 CTP is now available

Windows Azure Plugin for Eclipse with Java, December 2011 CTP is now available for public download...

Author: Avkash Chauhan - MSFT Date: 12/18/2011

<Previous Next>