HDInsight HBase: Migrating to new HDInsight version

Following are short steps to upgrade your HDInsight HBase cluster with small downtime. Before you migrate please note that there may be incompatibilities between HBase Major/Minor version and below steps only works if there is no version compatibility issues between source and destination cluster. We recommend you to review HBase book before undertaking an upgrade….


XBox: Analytics on petabytes of gaming data with Azure HDInsight

Cross post from https://azure.microsoft.com/en-us/blog/how-xbox-uses-hdinsight-to-drive-analytics-on-petabytes-of-telemetry-data/ Microsoft Studios produces some of the world’s most popular game titles including the Halo, Minecraft, and Forza Motorsport series. The Xbox product services team manage thousands of datasets and hundreds of active pipelines consuming hundreds of gigabytes of data each hour for first party studios. Game developers need to know the health…


Azure HDInsight Integration with Azure Log Analytics is now generally available

Cross post from https://azure.microsoft.com/en-us/blog/azure-hdinsight-integration-with-azure-log-analytics-is-now-generally-available/   I am excited to announce the general availability of HDInsight Integration with Azure Log Analytics. Azure HDInsight is a fully managed cloud service for customers to do analytics at scale using the most popular open-source engines such as Hadoop, Hive/LLAP, Presto, Spark, Kafka, Storm, HBase etc. ​ Thousands of our customers…


General availability of HDInsight Interactive Query – blazing fast queries on hyper-scale data

Cross post from https://azure.microsoft.com/en-gb/blog/general-availability-of-hdinsight-interactive-query-blazing-fast-data-warehouse-style-queries-on-hyper-scale-data-2/ It’s 2017, and big data challenges are as real as they get. Our customers have petabytes of data living in elastic and scalable commodity storage systems such as Azure Data Lake Store and Azure Blob storage. One of the central questions today is finding insights from data in these storage systems…


Azure HDInsight 3.6 – Five things that will make a data developer happy

Working with Hive, I regularly find myself staring at a csv/tsv/json files wondering where to start…. Hive View 2.0 is a new Web Experience in HDInsight 3.6 that greatly simplifies many common Hive Tasks and makes it easy to author and debug hive queries. In this post, we will look into 5 key feature that…


Hive Metastore in HDInsight –Tips, Tricks & Best Practices

When you create a Hive table, the table definition (column names, data types, comments, etc.) are stored in the Hive Metastore. Hive Metastore is critical part of Hadoop architecture as it acts as a central schema repository which can be used by other access tools like Spark, Interactive Hive (LLAP), Presto, Pig and many other…


HDInsight HBase: 9 things you must do to get great HBase performance

Cross post from ADL blog This post is based on learnings from numerous  HDInsight HBase customer interactions. HBase is a fantastic high end NoSql BigData machine that gives you many options to get great performance, there are no shortage of levers that you can’t tweak to further optimize it. Below is the general list of impact-full considerations…


Apache HBase/Phoenix – Tips , Tricks & Best Practices in Azure HDInsight

We will keep this page updated with HDInsight HBase/ Phoenix related commonly asked questions. You can leave comments/questions on this blog. Also, official channel to provide HDInsight related feedback and make feature requests is here What is the advantage of using HBase in Azure HDInsight? Azure HDInsight HBase – A NoSql database like no other  …

0

Azure HDInsight HBase – A NoSql database like no other

Apache HBase is an open source NoSQL database that provides real-time read/write access to large data-sets. Facebook’s Message infrastructure, Apples Siri, Bloomberg’s price history service and world’s largest biometric identity system in india called Aadhar all running on HBase. HBase has fantastic track record of being very successful for highest level of data scale needs,…