Nodes in HDInsight

Knowing the types and functions of nodes in HDInsight is key to taking full advantage of the service. This article is aimed at users who are familiar with big data concepts but are newer to HDInsight. Please feel free to read the article and provide me feedback even if you’re beyond the target audience for…


Apache HBase/Phoenix – Tips , Tricks & Best Practices in HDInsight

We will keep this page updated with HDInsight HBase/ Phoenix related commonly asked questions. You can leave comments/questions on this blog. Also, official channel to provide HDInsight related feedback and make feature requests is here What is the advantage of using HBase in Azure HDInsight? Azure HDInsight HBase – A NoSql database like no other  …


HDInsight HBase: How to Improve HBase cluster restart time by Flushing tables?

This blog is written by Nitin Verma, Sr. Software Engineer, HDInsight. Do you restart or re-create your HDInsight HBase clusters often? and wished restart/re-create times were faster? if yes, please read on- This blog introduces a new script for HDInsight HBase service through which you can flush the MemStore of all HBase tables conveniently. The script…


HDInsight HBase: 9 things you must do to get great HBase performance

HBase is a fantastic high end NoSql BigData machine that gives you many options to get great performance, there are no shortage of levers that you can’t tweak to further optimize it. Below is the general list of impact-full considerations for great HBase performance in HDInsight Don’t have HDInsight HBase cluster yet ? don’t worry…


HDInsight -New self-paced trainings and labs

This week Microsoft Learning Experiences released/updated 3 HDInsight courses ( These are free , $49 if you need a course Certificate) Create HDInsight cluster Processing Big Data with Azure HDInsight Start course More and more organizations are taking on the challenge of analyzing big data. This course teaches you how to use the Hadoop technologies…


HDinsight – How to use Spark-HBase connector?

Apache Spark is an open-source parallel processing framework that supports in-memory processing to boost the performance of big-data analytic applications. Azure HDInsight offers a fully managed Spark service with many benefits. Apache HBase is an open Source No SQL Hadoop database, a distributed, scalable, big data store. It provides real-time read/write access to large datasets….

1