Performance Optimization When Using rxExec() to Parallelize Algorithms

Microsoft R Server is designed around the concept of PEMAs (Parallel External Memory Algorithms). An external memory algorithm is an algorithm which does not require all data to be present in memory, it can process data sequentially, generating intermediate results and then iterating on those results. These algorithms are widely used for statistical analysis and…


Data Wrangling in XDF files using ScaleR Functions

The RevoScaleR package provides a set of over one hundred portable, scalable, and distributable data analysis functions. In this article, we will see some examples of using ScaleR Functions to do Data Wrangling in XDF files. For all the following examples, we will be using input XDF files from the SampleData Directory in Microsoft R…


Integrating PolyBase with Cloudera using Active Directory Authentication

This article outlines the steps to use PolyBase in SQL 2016(including R-Services) with a Cloudera Cluster and setup authentication using Active Directory in both SQL 2016 and Cloudera. Prerequisites Cloudera Cluster Active Directory with Domain Controller SQL Server 2016 with PolyBase and R-Services installed NOTE: We have tested the configuration using the Cloudera Cluster 5.5 running on…


Loan Classification using SQL Server 2016 R Services

R Services (In-database) provides a platform for developing and deploying intelligent applications that uncover new insights. You can use the rich and powerful R language and the many packages from the community to create models and generate predictions using your SQL Server data. Because R Services (In-database) integrates the R language with SQL Server, you…


Joseph Sirosh’s Keynote on 1 Million Predictions Per Second

We are thrilled to announce Joseph Sirosh, Corporate Vice President of Data Group at Microsoft will be presenting a demo at Microsoft Machine Learning & Data Science Summit on using online transaction processing (OLTP) database in SQL Server 2016 and latest algorithms in R service to predict a million events per second. As the number…


SQL R Services optimization for concurrent execution of sp_execute_external_script

Tips and Trick for SQL R Services With SQL Server 2016, we have introduced in-database analytics by bringing R closer to the database. This allows the compute to happen closer to the data,  and also leverage the power of SQL Server including resource governance. For production scenario, our guideline includes embedding ‘R’ scripts inside sp_execute_external_script…


DeployR log rotation to avoid large catalina.out

Are you facing a situation where your DeployR installations catalina.out file is growing huge after being in operation for a long time? DeployR has a webapp which is deployed on tomcat servlet container. Tomcat’s default log file catalina.out has no rotation facility. So if tomcat has been running for a while without restarts which could be…


MRS Capability Extension: Importing and Exporting Large In-Memory Data Frames

Introduction Microsoft R Server is an advanced analytics platform. Enterprise-ready, Microsoft R Server scales and accelerates R. R being an open source, statistical programming language, is a great tool to start building intelligent applications and realizing value in predictive analytics. While powerful, R is single threaded and memory bound. In order to handle Big Data,…


Understanding Grid Nodes and Slots in DeployR

DeployR has the concept of Grid nodes for scalable execution of R scripts. One DeployR instance can have multiple Grid nodes in order to horizontally scale R script execution. By default a grid node is installed for you when you install DeployR and you can add new grid nodes on separate machines. Each Grid node…


Enterprise Security in DeployR 8.0.5

DeployR Enterprise Security Model: DeployR Enterprise is designed to deliver analytics solutions at scale to whomever needs it: inside or outside the enterprise. It also guarantees secure delivery of your analytics via DeployR web services. These secure web services integrate seamlessly with existing enterprise security solutions: Single Sign-On, LDAP, Active Directory, PAM, and Basic Authentication,…