Data Exploration in XDF Files using ScaleR Functions

In this article, let us see few examples to explore data in XDF Files using ScaleR functions. For all the following examples, we will be using input XDF file AirlineDemoSmall.xdf from the SampleData Directory in Microsoft R Server.

1. Obtain the different types of variables present in the dataset

OUTPUT : p1
 

 

 

2. Obtain the frequency table of different types of variables present in the dataset

OUTPUT : p2
 

 

 

3. Obtain the quantile of numeric variables in the dataset

OUTPUT : p3

 

 

 

 

 

4. Obtain the TOTAL_OBSERVATIONS, VALID_OBSERVATIONS, MISSING_OBSERVATIONS, MISSING_PERCENTAGE, DISTINCT_CATEGORIES of all variables in the dataset

OUTPUT : p4

 

 

 

5. Obtain the frequency table of a particular variable in the dataset

OUTPUT : p5
 

 

 

 

 

REFERENCES

RevoScaleR Functions Comparison of Base R and ScaleR Functions A simple Big Data analysis using the RevoScaleR package in Revolution R