This post from Sherry LaMonica is the first in a series from members of the Revolution Analytics Engineering team — ed.
RevoScaleR provides a framework for fast and efficient multi-core processing of large data sets. You can visualize and model data sets with millions of records on your local machine using syntax like:
myLinMod <- rxLinMod(y ~ x + z, data=myData)
Some highlights of the RevoScaleR package include:
- The XDF file format, a binary file format with an R interface that optimizes row and column processing and analysis.
- Data transformation tools for exploring and preparing large data sets for analysis.
- Statistical algorithms optimized for large data sets.