Ten tools to analyze big data faster

Customers use HDInsight Interactive Query (also called Hive LLAP, or Low Latency Analytical Processing) to query data stored in Azure storage & Azure Data Lake Storage in super-fast manner. Interactive query makes it easy for developers and data scientist to work with the big data using BI tools they love the most. HDInsight Interactive Query…


Azure HDInsight Performance Insights: Interactive Query, Spark and Presto

Cross post from https://azure.microsoft.com/en-us/blog/hdinsight-interactive-query-performance-benchmarks-and-integration-with-power-bi-direct-query/ Fast SQL query processing at scale is often a key consideration for our customers. In this blog post, we compare HDInsight Interactive Query, Spark and Presto using an industry standard benchmark derived from the TPC-DS Benchmark. These benchmarks are run using out of the box default HDInsight configurations, with no special optimizations….


Azure HDInsight Integration with Azure Log Analytics is now generally available

Cross post from https://azure.microsoft.com/en-us/blog/azure-hdinsight-integration-with-azure-log-analytics-is-now-generally-available/   I am excited to announce the general availability of HDInsight Integration with Azure Log Analytics. Azure HDInsight is a fully managed cloud service for customers to do analytics at scale using the most popular open-source engines such as Hadoop, Hive/LLAP, Presto, Spark, Kafka, Storm, HBase etc. ​ Thousands of our customers…


Azure #HDInsight @Microsoft Openness Day

I will be speaking at Microsoft Openness day on December 5th. Register here Session Abstract: Hadoop, Kafka, Spark and NoSQL solutions have emerged as the most appropriate open source technologies for data processing at massive scale and cost effectively. Azure HDInsight is Microsoft’s OSS analytics offering available to Azure customers since 2013. It is a…


Exposing Hive!

I sat down with Justin Scott (Application Development Manager at Microsoft working with our top customers) to talk about Apache Hive and where it’s heading. You can listen to channel 9 podcast now


Hive Metastore in HDInsight –Tips, Tricks & Best Practices

When you create a Hive table, the table definition (column names, data types, comments, etc.) are stored in the Hive Metastore. Hive Metastore is critical part of Hadoop architecture as it acts as a central schema repository which can be used by other access tools like Spark, Interactive Hive (LLAP), Presto, Pig and many other…


HDInsight -New self-paced trainings and labs on Hadoop, Hive, HBase, Spark & Storm

cross post from https://blogs.msdn.microsoft.com/azuredatalake/2016/08/28/hdinsight-new-self-paced-trainings-and-labs/ This week Microsoft Learning Experiences released/updated 3 HDInsight courses ( These are free , $49 if you need a course Certificate) Create HDInsight cluster Processing Big Data with Azure HDInsight Start course More and more organizations are taking on the challenge of analyzing big data. This course teaches you how to…


HDInsight:- Attach additional Azure storage accounts to the cluster

HDInsight supports a notion of the default file system. The default file system implies a default scheme and authority. It can also be used to resolve relative paths. During the HDInsight creation process, an Azure Storage account and a specific Azure Blob storage container from that account is designated as the default file system. In…

0