HDInsight : BUILD Hive Lab is available now

Got some time to learn Big Data Technologies? How about starting with Hive which is considered the de facto standard for SQL queries in Hadoop We just released  HDInsight labs used during the BUILD conference code challenge. You will need 2 things to run these labs 1- HDInsight Cluster – How to create? 2- Step by Step Instructions…


Cloudera clusters now run with Azure Data Lake Store

We are excited to announce that with today’s release of Cloudera Enterprise 5.11 you can now run Spark, Hive, and MapReduce workloads in a Cloudera cluster on Azure Data Lake Store (ADLS). Cloudera customers can now take advantage of the many benefits of running clusters on ADLS. And ADLS brings to its customers another valuable…

0

Azure HDInsight 3.6 – Five things that will make a data developer happy

Working with Hive, I regularly find myself staring at a csv/tsv/json files wondering where to start…. Hive View 2.0 is a new Web Experience in HDInsight 3.6 that greatly simplifies many common Hive Tasks and makes it easy to author and debug hive queries. In this post, we will look into 5 key feature that…


Apache HBase/Phoenix – Tips , Tricks & Best Practices in HDInsight

We will keep this page updated with HDInsight HBase/ Phoenix related commonly asked questions. You can leave comments/questions on this blog. Also, official channel to provide HDInsight related feedback and make feature requests is here What is the advantage of using HBase in Azure HDInsight? Azure HDInsight HBase – A NoSql database like no other  …


HDInsight HBase: How to Improve HBase cluster restart time by Flushing tables?

This blog is written by Nitin Verma, Sr. Software Engineer, HDInsight. Do you restart or re-create your HDInsight HBase clusters often? and wished restart/re-create times were faster? if yes, please read on- This blog introduces a new script for HDInsight HBase service through which you can flush the MemStore of all HBase tables conveniently. The script…