HDInsight : BUILD Hive Labs are Available Now

Got some time to learn Big Data Technologies? How about starting with Hive which is considered the de facto standard for SQL queries in Hadoop We just released  HDInsight labs used during the BUILD conference code challenge. You will need 2 things to run these labs 1- HDInsight Cluster – How to create? 2- Step by Step Instructions…


Provision HBase cluster with Azure Data Lake Store in a few easy steps

Last week, we announced public preview of HDInsight HBase with Azure Data Lake Store. You can read the announcement here With this release there is virtually no limit on amount of data you can store in your HBase cluster. I created a GIF that will guide you on how to create HBase Cluster with Azure Data…


Azure HDInsight: How to run Presto in one simple step and query across data sources such as Cosmos DB, SQL DB & Hive

I have seen in past few months many inquiries on how to run Presto in HDInsight In this post we have provided an easy way for you to install Presto in HDInsight as well as configure various data sources which Presto can query. One of the unique advantages of HDInsight is decoupling between storage and…


Exposing Hive!

I sat down with Justin Scott (Application Development Manager at Microsoft working with our top customers) to talk about Apache Hive and where it’s heading. You can listen to channel 9 podcast now


Azure HDInsight 3.6 – Five things that will make a data developer happy

Working with Hive, I regularly find myself staring at a csv/tsv/json files wondering where to start…. Hive View 2.0 is a new Web Experience in HDInsight 3.6 that greatly simplifies many common Hive Tasks and makes it easy to author and debug hive queries. In this post, we will look into 5 key feature that…


Hive Metastore in HDInsight –Tips, Tricks & Best Practices

When you create a Hive table, the table definition (column names, data types, comments, etc.) are stored in the Hive Metastore. Hive Metastore is critical part of Hadoop architecture as it acts as a central schema repository which can be used by other access tools like Spark, Interactive Hive (LLAP), Presto, Pig and many other…


HDInsight HBase: 9 things you must do to get great HBase performance

Cross post from ADL blog This post is based on learnings from numerous  HDInsight HBase customer interactions. HBase is a fantastic high end NoSql BigData machine that gives you many options to get great performance, there are no shortage of levers that you can’t tweak to further optimize it. Below is the general list of impact-full considerations…


HDInsight -New self-paced trainings and labs on Hadoop, Hive, HBase, Spark & Storm

cross post from https://blogs.msdn.microsoft.com/azuredatalake/2016/08/28/hdinsight-new-self-paced-trainings-and-labs/ This week Microsoft Learning Experiences released/updated 3 HDInsight courses ( These are free , $49 if you need a course Certificate) Create HDInsight cluster Processing Big Data with Azure HDInsight Start course More and more organizations are taking on the challenge of analyzing big data. This course teaches you how to…


Apache HBase/Phoenix – Tips , Tricks & Best Practices in Azure HDInsight

We will keep this page updated with HDInsight HBase/ Phoenix related commonly asked questions. You can leave comments/questions on this blog. Also, official channel to provide HDInsight related feedback and make feature requests is here What is the advantage of using HBase in Azure HDInsight? Azure HDInsight HBase – A NoSql database like no other  …

0