Microsoft R Server 9.1 on HDInsight is available!

Today, we are excited to announce that Microsoft R Server 9.1 on Azure HDInsight is generally available. With this, we bring the power and innovation of our latest 9.1 release to the cloud on Spark 2.1 on HDInsight 3.6. This release of R Server on HDInsight includes the following features: State of the art new parallel machine…


Announcing Microsoft Machine Learning Library for Apache Spark

This post is authored by Roope Astala, Senior Program Manager, and Sudarshan Raghunathan, Principal Software Engineering Manager, at Microsoft. This is a cross post and its original post is in Cortana Intelligence and Machine Learning Blog. We’re excited to announce the Microsoft Machine Learning library for Apache Spark – a library designed to make data scientists…


HDInsight : BUILD Hive Lab is available now

Got some time to learn Big Data Technologies? How about starting with Hive which is considered the de facto standard for SQL queries in Hadoop We just released  HDInsight labs used during the BUILD conference code challenge. You will need 2 things to run these labs 1- HDInsight Cluster – How to create? 2- Step by Step Instructions…


Allowing multiple users to access R Server on HDInsight

Recently there are a few customers asking me how to enable multiple users to access R Server on HDInsight CONCURRENTLY, so I think blogging all the ways might be a good idea. The basic idea here is that we will simply add more users in the Edge node where the RStudio community version is currently…


SCP.Net with HDInsight Linux Storm clusters

SCP.Net is now available on HDInsight Linux clusters 3.4 and above. Versions Note: HDInsight Storm team recommends HDI 3.5 or above clusters for users looking to migrate their SCP.Net topologies from Windows to Linux. HDInsight custom script actions can be used to update the Mono version on HDI Clusters. For more details please look at:…


HDInsight tools for IntelliJ & Eclipse April Updates

  We are pleased to announce the April updates of HDInsight Tools for IntelliJ & Eclipse. This is a quality milestone and we focus primarily on refactoring the components and fixing bugs. We also added Azure Data Lake Store support and Eclipse local emulator support in this release. The HDInsight Tools for IntelliJ & Eclipse…

0

Exposing Hive!

I sat down with Justin Scott (Application Development Manager at Microsoft working with our top customers) to talk about Apache Hive and where it’s heading. You can listen to channel 9 podcast now


Azure HDInsight 3.6 – Five things that will make a data developer happy

Working with Hive, I regularly find myself staring at a csv/tsv/json files wondering where to start…. Hive View 2.0 is a new Web Experience in HDInsight 3.6 that greatly simplifies many common Hive Tasks and makes it easy to author and debug hive queries. In this post, we will look into 5 key feature that…


Nodes in HDInsight

Knowing the types and functions of nodes in HDInsight is key to taking full advantage of the service. This article is aimed at users who are familiar with big data concepts but are newer to HDInsight. Please feel free to read the article and provide me feedback even if you’re beyond the target audience for…


How WebHCat Works and How to Debug (Part 2)

Link to Part 1 2. How to debug WebHCat 2.1. BadGateway (HTTP status code 502) This is a very generic message from Gateway nodes. We will cover some common cases and possible mitigations. This is the most common Templeton problems customer are seeing right now. 2.1.1. WebHcat service down This happens in-case WebHCat server on…

2