U-SQL Tip: Generating ranges of numbers and dates

Many common scenarios for U-SQL developers require constructing a RowSet made up of a simple range of numbers or dates, for example the integers from 1 to 10. In this blog post we’ll take a look at options for doing this in U-SQL. In the process, we’ll get a chance to learn how to use some common U-SQL features:…

0

Running XGBoost on Azure HDInsight

XGBoost is a popular open-source distributed gradient boosting library used by many companies in production. Azure HDInsight is a fully managed Hadoop and Spark solution where you can easily create a fully-managed Spark cluster and with great extensibility. In this blog post, we will walk you through the detailed steps on how to compile and run XGBoost…


Create U-SQL EXTRACT Script Automatically

In this blog, you will learn how to create U-SQL EXTRACT script automatically using the latest version of Azure Data Lake Tools for Visual Studio. Watching this 3 minutes video to learn more.   One of U-SQL’s core capabilities is to be able to schematize unstructured data on the fly without having to create a…


Integrating Kyligence Analytics Platform with Microsoft Azure HDInsight

This is a guest blog from Shaofeng Shi, Senior Architect from Kyligence Inc. Introducing Kyligence Analytics Platform Kyligence Analytics Platform (KAP) is an enterprise-ready big data warehouse on Apache Hadoop. Created by the same development team of Apache Kylin, an open-source distributed OLAP engine for big data, KAP inherits all Kylin’s advantages and has more innovations,…


Analyze data in Azure Data Lake Store using familiar-and-powerful Excel 2016

We are excited to announce that as part of the June 2017 updates of Excel 2016, Azure Data Lake Store is now supported as a source of data. Sophisticated and powerful tools like Excel and Power BI are preferred by many Enterprise data analysts to access and analyze data. As enterprises are building cloud-based data…

0

Solving the problem of “Problem with the SSL CA cert (path? access rights?)” for R server on HDInsight

R Server on HDInsight is an ideal platform for performing big data analysis using R interface. If you install packages just from the CRAN R repository, you probably won’t meet the problem mentioned in the title. However, if you want to install some package that is still under development, or if you want to use…


Webinar with Talena: Migrate open source big data applications to HDInsight, add backup & restore to your existing apps running on Apache Hadoop & Spark

“Are you building Big data applications using open source platforms such as Apache Hadoop or Spark, but unsure of how to add cloud to your architecture and manage your data assets better?” Join me for a webinar on June 29th, 2017 at 10am PST with Hari (CTO of Talena) as we explore how Talena can help customers migrate their big…

0

Run H2O.ai in R on Azure HDInsight

In our previous blog, we introduced H2O.ai on Azure HDInsight. Currently, H2O can run on Azure HDInsight in Python or Scala APIs. However, R doesn’t come out-of-box. R has been popular in data scientist communities and support of R in H2O.ai on Azure HDInsight has been sought after by many of our customers. Today, we…


Microsoft R Server 9.1 on HDInsight is available!

Today, we are excited to announce that Microsoft R Server 9.1 on Azure HDInsight is generally available. With this, we bring the power and innovation of our latest 9.1 release to the cloud on Spark 2.1 on HDInsight 3.6. This release of R Server on HDInsight includes the following features: State of the art new parallel machine…