Microsoft R Server 9.0.1 is Packed with Exciting New Capabilities

Microsoft R Server has just released a feature-packed version 9.0.1. Customers can download from MSDN from this link. Enterprise customers with EA agreement will be able to download from Microsoft Volume Licensing Software Center in January 2017. There are major enhancements and brand new face lifts in some of the capabilities in Microsoft R Server that worth mentioning: -…


Data Exploration in XDF Files using ScaleR Functions

In this article, let us see few examples to explore data in XDF Files using ScaleR functions. For all the following examples, we will be using input XDF file AirlineDemoSmall.xdf from the SampleData Directory in Microsoft R Server. 1. Obtain the different types of variables present in the dataset OUTPUT :       2….


Pattern Matching on xdf files in Microsoft R Server

Pattern Matching: R uses regular expressions for pattern matching. To find patterns on non-xdf files, it’s pretty straightforward using R’s grep function. Example: Output: [1] “Microsoft01” “Microsoft03” “Microsoft05” [1] “Microsoft01”   When using xdf format files (the binary compressed file format used by RevoScaleR, that is a part of the Microsoft R Server) we can…


Importing .dbf files in Microsoft R Server

The dBASE database file with the .dbf file extension is not supported by Microsoft R Server’s rxImport. rxImport function is used to import data into an ‘.xdf’ file or data.frame. Alternatively, .dbf file can be converted into a data.frame using the CRAN package called foreign. The data.frame can then be converted into an .xdf file,…


Performance Optimization When Using rxExec() to Parallelize Algorithms

Microsoft R Server is designed around the concept of PEMAs (Parallel External Memory Algorithms). An external memory algorithm is an algorithm which does not require all data to be present in memory, it can process data sequentially, generating intermediate results and then iterating on those results. These algorithms are widely used for statistical analysis and…


Data Wrangling in XDF files using ScaleR Functions

The RevoScaleR package provides a set of over one hundred portable, scalable, and distributable data analysis functions. In this article, we will see some examples of using ScaleR Functions to do Data Wrangling in XDF files. For all the following examples, we will be using input XDF files from the SampleData Directory in Microsoft R…


Integrating PolyBase with Cloudera using Active Directory Authentication

This article outlines the steps to use PolyBase in SQL 2016(including R-Services) with a Cloudera Cluster and setup authentication using Active Directory in both SQL 2016 and Cloudera. Prerequisites Cloudera Cluster Active Directory with Domain Controller SQL Server 2016 with PolyBase and R-Services installed NOTE: We have tested the configuration using the Cloudera Cluster 5.5 running on…


Loan Classification using SQL Server 2016 R Services

R Services (In-database) provides a platform for developing and deploying intelligent applications that uncover new insights. You can use the rich and powerful R language and the many packages from the community to create models and generate predictions using your SQL Server data. Because R Services (In-database) integrates the R language with SQL Server, you…


Joseph Sirosh’s Keynote on 1 Million Predictions Per Second

We are thrilled to announce Joseph Sirosh, Corporate Vice President of Data Group at Microsoft will be presenting a demo at Microsoft Machine Learning & Data Science Summit on using online transaction processing (OLTP) database in SQL Server 2016 and latest algorithms in R service to predict a million events per second. As the number…


SQL R Services optimization for concurrent execution of sp_execute_external_script

Tips and Trick for SQL R Services With SQL Server 2016, we have introduced in-database analytics by bringing R closer to the database. This allows the compute to happen closer to the data,  and also leverage the power of SQL Server including resource governance. For production scenario, our guideline includes embedding ‘R’ scripts inside sp_execute_external_script…