Azure HDInsight: How to run Presto in one simple step and query across data sources such as Cosmos DB, SQL DB & Hive

I have seen in past few months many inquiries on how to run Presto in HDInsight. In this post we have provided an easy way for you to install Presto in HDInsight as well as configure various data sources which Presto can query. One of the unique advantages of HDInsight is decoupling between storage and…


Webinar with Datameer: Modern Data Preparation, Analytics and Insights – Key ingredients for an ML- and AI-ready organization

“Excited about using Big Data to build intelligent applications, but unsure of how to proceed?” Join me for a webinar with Andrew Brust (Senior Director at Datameer and a Microsoft Data Platform MVP) who specializes in this domain and will explain how Datameer can help users be more productive. Big Data has come a long…

0

Allowing multiple users to access R Server on HDInsight

Recently there are a few customers asking me how to enable multiple users to access R Server on HDInsight CONCURRENTLY, so I think blogging all the ways might be a good idea. The basic idea here is that we will simply add more users in the Edge node where the RStudio community version is currently…


SCP.Net with HDInsight Linux Storm clusters

SCP.Net is now available on HDInsight Linux clusters 3.4 and above. Versions Note: HDInsight Storm team recommends HDI 3.5 or above clusters for users looking to migrate their SCP.Net topologies from Windows to Linux. HDInsight custom script actions can be used to update the Mono version on HDI Clusters. For more details please look at:…


HDInsight tools for IntelliJ & Eclipse April Updates

  We are pleased to announce the April updates of HDInsight Tools for IntelliJ & Eclipse. This is a quality milestone and we focus primarily on refactoring the components and fixing bugs. We also added Azure Data Lake Store support and Eclipse local emulator support in this release. The HDInsight Tools for IntelliJ & Eclipse…

0

Azure Data Lake U-SQL April 25 2017 Updates: Introducing Packages, UNPIVOT INCLUDE NULLS, fast file set preview flag, R extension returns dataframes, exporting your cluster database with sample data to your local run and more!

We have concluded the rollout of our April 2017 refresh to all the regions today. Here are the April 2017 Updates for Azure Data Lake U-SQL and Developer Tooling! The main items are the release of the package feature that allows you to bundle assembly reference statements, variable declarations into a shareable package and reduce…

0

Exposing Hive!

I sat down with Justin Scott (Application Development Manager at Microsoft working with our top customers) to talk about Apache Hive and where it’s heading. You can listen to channel 9 podcast now


Cloudera clusters now run with Azure Data Lake Store

We are excited to announce that with today’s release of Cloudera Enterprise 5.11 you can now run Spark, Hive, and MapReduce workloads in a Cloudera cluster on Azure Data Lake Store (ADLS). Cloudera customers can now take advantage of the many benefits of running clusters on ADLS. And ADLS brings to its customers another valuable…

0