Classify Yelp restaurant reviews’ food origin with MicrosoftML

Yelp restaurant reviews are one of the most useful resources people use to pick restaurants. Reviews themselves not only carry sentiment towards the dining experience but also contain “meta-information” about the restaurant. For example, looking at a review that says We can tell that this is a Japanese restaurant since it mentions omakase and sushi. Natural language processing and machine…


Importing .dbf files in Microsoft R Server

The dBASE database file with the .dbf file extension is not supported by Microsoft R Server’s rxImport. rxImport function is used to import data into an ‘.xdf’ file or data.frame. Alternatively, .dbf file can be converted into a data.frame using the CRAN package called foreign. The data.frame can then be converted into an .xdf file,…


Data Wrangling in XDF files using ScaleR Functions

The RevoScaleR package provides a set of over one hundred portable, scalable, and distributable data analysis functions. In this article, we will see some examples of using ScaleR Functions to do Data Wrangling in XDF files. For all the following examples, we will be using input XDF files from the SampleData Directory in Microsoft R…


MRS Capability Extension: Importing and Exporting Large In-Memory Data Frames

Introduction Microsoft R Server is an advanced analytics platform. Enterprise-ready, Microsoft R Server scales and accelerates R. R being an open source, statistical programming language, is a great tool to start building intelligent applications and realizing value in predictive analytics. While powerful, R is single threaded and memory bound. In order to handle Big Data,…