We're nearing completion of the package of statistical tools for very large data sets that I gave an early preview of at R/Finance 2010. It will be released for Revolution R Enterprise later this year, but we're looking for some R users with big data sets to put the 1.0 version through its paces in the beta program and offer suggestions for subsequent versions.
For now, it's a limited beta program -- we're looking for testers with some fairly specific attributes:
- You'll need to be running a recent version of Windows (other platforms will be open for testing later) on a machine with at least a 2-core and ideally 4-core or more processors.
- You'll need to have a large data set (larger than 1 GB, up to around 200 GB for now) for analysis with linear regression, logistic regression, or crosstabulations. The data should be in text (ideally CSV) format or can be exported as such.
If you think you might fit the bill, send me an email at [email protected] with a brief description of your setup and we'll see if we can get you in to the beta program. Thanks!
Will the package be released under an open source license?
Posted by: Albert Gifi | July 16, 2010 at 00:11
No, this will be a proprietary module for Revolution R Enterprise. (Revolution's products are configured under an Open Core software model.)
Posted by: David Smith | July 16, 2010 at 07:46